ModelChorusModelChorus
ChallengeChatLeaderboardBenchmarksHistoryHow it works
Terms of ServicePrivacy PolicyAPI

Copyright 2026 MeetKai Inc.

Benchmarks/Rnj 1 Instruct/Spanish (Spain) tasks

Rnj 1 Instruct

3 tasks

Each row below is a single benchmark task this model was evaluated on. The Score column averages every metric the task reports (accuracy, F1, exact-match, etc.). Click a row to browse the individual questions and the model's responses.

Average
58.1
ScoreLanguageTaskMetrics
70.2Spanish (Spain)
spanish_belebele
spanish mcq
f1_macro: 70.2sample_len: 900.0
f1_macro: 70.2sample_len: 900.0
55.1Spanish (Spain)
spanish_xquad_es
spanish xquad es
exact_match: 46.1f1: 64.1sample_len: 1190.0
exact_match: 46.1f1: 64.1sample_len: 1190.0
49.0Spanish (Spain)
spanish_global_mmlu
spanish mcq
f1_macro: 49.0sample_len: 400.0
f1_macro: 49.0sample_len: 400.0