ModelChorusModelChorus
ChallengeChatLeaderboardBenchmarksHistoryHow it works
Terms of ServicePrivacy PolicyAPI

Copyright 2026 MeetKai Inc.

Benchmarks/GPT-oss-120B/Swahili (Tanzania) tasks

GPT-oss-120B

3 tasks

Each row below is a single benchmark task this model was evaluated on. The Score column averages every metric the task reports (accuracy, F1, exact-match, etc.). Click a row to browse the individual questions and the model's responses.

Average
74.6
ScoreLanguageTaskMetrics
89.3Swahili (Tanzania)
swahili_sib200
swahili classification
f1_macro: 89.3sample_len: 204.0
f1_macro: 89.3sample_len: 204.0
69.6Swahili (Tanzania)
swahili_afrimgsm
swahili afrimgsm
exact_match: 69.6sample_len: 250.0
exact_match: 69.6sample_len: 250.0
65.0Swahili (Tanzania)
swahili_afrixnli
swahili nli
f1_macro: 65.0sample_len: 600.0
f1_macro: 65.0sample_len: 600.0