ModelChorusModelChorus
ChallengeLeaderboardHistory
How it works
Menu
ChallengeLeaderboardHistory
How it works
TermsPrivacy

© 2026 MeetKai

OverviewTextImage
Terms of ServicePrivacy Policy

Copyright 2026 MeetKai Inc.

Text Leaderboard

Overall rankings for text generation models, powered by community votes and the Elo rating system.

Scroll horizontally to see all columns

Text

#
Model(14/14)
Elo
Votes
$/1M In / Out
Value
1
Gemini 3 Flash Preview
126814$0.50 / $3.00725
2
Functionary Swahili Large
126226$0.20 / $1.501,485
3
GLM 5
125421$0.72 / $2.30830
4
Claude Sonnet 4.5
124414$3.00 / $15.00138
5
GPT-oss-120B
124113$0.04 / $0.1910,838
6
Claude Sonnet 4.6
123315$3.00 / $15.00137
7
Grok 4.1 Fast
120824$0.20 / $0.503,451
8
Rnj 1 Instruct
11938$0.15 / $0.157,953
9
Functionary Swahili Mini
119120$0.10 / $0.902,382
10
Claude Haiku 4.5
118219$1.00 / $5.00394
11
Gemini 2.5 Flash Lite
117815$0.10 / $0.404,712
12
GPT-5.2
115518$1.75 / $14.00147
13
Trinity Large Preview
114015$0.00 / $0.00-
14
GPT-5 Nano
112021$0.05 / $0.404,978

Leaderboard Plots

Average Win Rate Against All Other Models

Uniform sampling, excluding ties

Fraction of Model A Wins for All Non-tied A vs. B Challenges

Each cell shows how often the row model beats the column model

Loading head-to-head data…

Estimated Confidence Intervals on Model Strength

Approximate 95% confidence intervals — narrower bars = more certain ranking

Challenge Count for Each Combination of Models

Total number of head-to-head challenges between each model pair

Loading challenge count data…