ModelChorusModelChorus
ChallengeChatLeaderboardBenchmarksHistoryHow it works
Terms of ServicePrivacy PolicyAPI

Copyright 2026 MeetKai Inc.

Benchmarks/GPT-oss-120B/Albanian (Albania) tasks

GPT-oss-120B

5 tasks

Each row below is a single benchmark task this model was evaluated on. The Score column averages every metric the task reports (accuracy, F1, exact-match, etc.). Click a row to browse the individual questions and the model's responses.

Average
86.0
ScoreLanguageTaskMetrics
96.5Albanian (Albania)
albanian_polywrite
albanian open generation
open_quality_score: 96.5sample_len: 155.0
open_quality_score: 96.5sample_len: 155.0
89.6Albanian (Albania)
albanian_belebele
albanian mcq
f1_macro: 89.6sample_len: 900.0
f1_macro: 89.6sample_len: 900.0
87.2Albanian (Albania)
albanian_sib200
albanian classification
f1_macro: 87.2sample_len: 204.0
f1_macro: 87.2sample_len: 204.0
82.8Albanian (Albania)
albanian_global_mmlu
albanian mcq
f1_macro: 82.8sample_len: 400.0
f1_macro: 82.8sample_len: 400.0
74.0Albanian (Albania)
albanian_aya
albanian open generation
llm_judge_score: 74.0sample_len: 200.0
llm_judge_score: 74.0sample_len: 200.0