ModelChorusModelChorus
ChallengeChatLeaderboardBenchmarksHistoryHow it works
Terms of ServicePrivacy PolicyAPI

Copyright 2026 MeetKai Inc.

Benchmarks/Rnj 1 Instruct/ifeval

ifeval

Rnj 1 Instruct · ifeval · English (US) · 0 samples

Every row in the list is one question from the benchmark. The check or cross icon shows whether the model's answer matched the target; click a row to read the full prompt, expected answer, and what the model actually produced.

inst_level_loose_acc
76.3
inst_level_strict_acc
72.3
prompt_level_loose_acc
66.7
prompt_level_strict_acc
63.0
sample_len
541.0
Average
69.6
No per-question samples have been synced for this task yet.