ModelChorusModelChorus
ChallengeChatLeaderboardBenchmarksHistoryHow it works
Terms of ServicePrivacy PolicyAPI

Copyright 2026 MeetKai Inc.

Benchmarks/GPT-oss-120B/spanish_xquad_es

spanish_xquad_es

GPT-oss-120B · spanish xquad es · Spanish (Spain) · 1190 samples

Every row in the list is one question from the benchmark. The check or cross icon shows whether the model's answer matched the target; click a row to read the full prompt, expected answer, and what the model actually produced.

exact_match
54.4
f1
75.4
sample_len
1190.0
Average
64.9
Loading samples…