ModelChorusModelChorus
ChallengeChatLeaderboardBenchmarksHistoryHow it works
Terms of ServicePrivacy PolicyAPI

Copyright 2026 MeetKai Inc.

Benchmarks/GPT-oss-120B/Ukrainian (Ukraine) tasks

GPT-oss-120B

6 tasks

Each row below is a single benchmark task this model was evaluated on. The Score column averages every metric the task reports (accuracy, F1, exact-match, etc.). Click a row to browse the individual questions and the model's responses.

Average
82.0
ScoreLanguageTaskMetrics
98.1Ukrainian (Ukraine)
ukrainian_polywrite
ukrainian open generation
open_quality_score: 98.1sample_len: 154.0
open_quality_score: 98.1sample_len: 154.0
91.8Ukrainian (Ukraine)
ukrainian_belebele
ukrainian mcq
f1_macro: 91.8sample_len: 900.0
f1_macro: 91.8sample_len: 900.0
88.9Ukrainian (Ukraine)
ukrainian_sib200
ukrainian classification
f1_macro: 88.9sample_len: 204.0
f1_macro: 88.9sample_len: 204.0
84.7Ukrainian (Ukraine)
ukrainian_global_mmlu
ukrainian mcq
f1_macro: 84.7sample_len: 2850.0
f1_macro: 84.7sample_len: 2850.0
70.0Ukrainian (Ukraine)
ukrainian_zno
ukrainian mcq
f1_macro: 70.0sample_len: 751.0
f1_macro: 70.0sample_len: 751.0
58.4Ukrainian (Ukraine)
ukrainian_squad
ukrainian qa
exact_match: 46.2f1: 70.6sample_len: 3812.0
exact_match: 46.2f1: 70.6sample_len: 3812.0