ModelChorusModelChorus
ChallengeChatLeaderboardBenchmarksHistoryHow it works
Terms of ServicePrivacy PolicyAPI

Copyright 2026 MeetKai Inc.

Benchmarks/Functionary Swahili Large/arabic_tydiqa

arabic_tydiqa

Functionary Swahili Large · arabic qa · Arabic (Saudi Arabia) · 921 samples

Every row in the list is one question from the benchmark. The check or cross icon shows whether the model's answer matched the target; click a row to read the full prompt, expected answer, and what the model actually produced.

exact_match
38.3
f1
63.9
sample_len
921.0
Average
51.1
Loading samples…