ModelChorusModelChorus
ChallengeChatLeaderboardBenchmarksHistoryHow it works
Terms of ServicePrivacy PolicyAPI

Copyright 2026 MeetKai Inc.

Benchmarks/GPT-oss-120B/Portuguese (Portugal) tasks

GPT-oss-120B

6 tasks

Each row below is a single benchmark task this model was evaluated on. The Score column averages every metric the task reports (accuracy, F1, exact-match, etc.). Click a row to browse the individual questions and the model's responses.

Average
77.7
ScoreLanguageTaskMetrics
91.8Portuguese (Portugal)
portuguese_enem
portuguese mcq
exact_match: 91.8sample_len: 1432.0
exact_match: 91.8sample_len: 1432.0
90.1Portuguese (Portugal)
portuguese_bluex
portuguese mcq
exact_match: 90.1sample_len: 724.0
exact_match: 90.1sample_len: 724.0
85.4Portuguese (Portugal)
portuguese_hatebr
portuguese classification
f1_macro: 85.4sample_len: 1400.0
f1_macro: 85.4sample_len: 1400.0
70.2Portuguese (Portugal)
portuguese_tweetsentbr
portuguese classification
f1_macro: 70.2sample_len: 2010.0
f1_macro: 70.2sample_len: 2010.0
65.8Portuguese (Portugal)
portuguese_hate_speech
portuguese classification
f1_macro: 65.8sample_len: 851.0
f1_macro: 65.8sample_len: 851.0
62.9Portuguese (Portugal)
portuguese_oab_exams
portuguese mcq
exact_match: 62.9sample_len: 2210.0
exact_match: 62.9sample_len: 2210.0