ModelChorusModelChorus
ChallengeChatLeaderboardBenchmarksHistoryHow it works
Terms of ServicePrivacy PolicyAPI

Copyright 2026 MeetKai Inc.

Benchmarks/GPT-oss-120B/Yoruba (Nigeria) tasks

GPT-oss-120B

7 tasks

Each row below is a single benchmark task this model was evaluated on. The Score column averages every metric the task reports (accuracy, F1, exact-match, etc.). Click a row to browse the individual questions and the model's responses.

Average
46.7
ScoreLanguageTaskMetrics
78.6Yoruba (Nigeria)
yoruba_sib200
yoruba classification
f1_macro: 78.6sample_len: 204.0
f1_macro: 78.6sample_len: 204.0
68.9Yoruba (Nigeria)
yoruba_naijasenti
yoruba sentiment
f1_macro: 68.9sample_len: 4515.0
f1_macro: 68.9sample_len: 4515.0
60.8Yoruba (Nigeria)
yoruba_afrimgsm
yoruba afrimgsm
exact_match: 60.8sample_len: 250.0
exact_match: 60.8sample_len: 250.0
60.5Yoruba (Nigeria)
yoruba_afrixnli
yoruba nli
f1_macro: 60.5sample_len: 600.0
f1_macro: 60.5sample_len: 600.0
23.5Yoruba (Nigeria)
yoruba_afrimmlu
yoruba mcq
f1_macro: 23.5sample_len: 500.0
f1_macro: 23.5sample_len: 500.0
19.1Yoruba (Nigeria)
yoruba_belebele
yoruba mcq
f1_macro: 19.1sample_len: 900.0
f1_macro: 19.1sample_len: 900.0
15.7Yoruba (Nigeria)
yoruba_afriqa
yoruba qa
exact_match: 12.7f1: 18.8sample_len: 332.0
exact_match: 12.7f1: 18.8sample_len: 332.0