ModelChorusModelChorus
ChallengeChatLeaderboardBenchmarksHistoryHow it works
Terms of ServicePrivacy PolicyAPI

Copyright 2026 MeetKai Inc.

Benchmarks/Functionary Swahili Large/Spanish (Spain) tasks

Functionary Swahili Large

1 task

Each row below is a single benchmark task this model was evaluated on. The Score column averages every metric the task reports (accuracy, F1, exact-match, etc.). Click a row to browse the individual questions and the model's responses.

Average
62.9
ScoreLanguageTaskMetrics
62.9Spanish (Spain)
spanish_xquad_es
spanish xquad es
exact_match: 53.0f1: 72.8sample_len: 1190.0
exact_match: 53.0f1: 72.8sample_len: 1190.0