ModelChorusModelChorus
ChallengeChatLeaderboardBenchmarksHistoryHow it works
Terms of ServicePrivacy PolicyAPI

Copyright 2026 MeetKai Inc.

Benchmarks/GPT-oss-120B/swahili_afrixnli

swahili_afrixnli

GPT-oss-120B · swahili nli · Swahili (Tanzania) · 0 samples

Every row in the list is one question from the benchmark. The check or cross icon shows whether the model's answer matched the target; click a row to read the full prompt, expected answer, and what the model actually produced.

f1_macro
65.0
sample_len
600.0
Average
65.0
No per-question samples have been synced for this task yet.