Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1446
GPT-5.4
1440
Claude Opus 4.6 (Thinking)
1420
Claude Opus 4.6
1415
GPT-5.4 (High)
1377
Claude Sonnet 4.6 (Thinking)
1346
GPT-5.1 (Medium)
1345
Claude Sonnet 4.6
1319
Claude Sonnet 4.5 (Thinking)
1317
Gemini 3.1 Pro
1295
GPT-5.1
1285
Gemini 3 Pro
1272
Claude Opus 4.5 (Thinking)
1272
Gemini 3 Flash Preview Thinking
1262
Gemini 3 Pro (Low)
1262
GPT-5.2 Instant

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
12GPT-5.41446±141.7K1.7%2.6%55 tps0.8s1M$2.50$15.00
21Claude Opus 4.6 (Thinking)1440±95.1K1.2%2.5%56 tps1.6s200K$5.00$25.00
32Claude Opus 4.61420±116.5K1.1%2.1%48 tps1.7s200K$5.00$25.00
44GPT-5.4 (High)1415±122.1K1.4%4.6%68 tps7.9s1M$2.50$15.00
55Claude Sonnet 4.6 (Thinking)1377±94.9K1.3%4.7%57 tps1.1s200K$3.00$15.00
68GPT-5.1 (Medium)1346±147851.9%<0.1%86 tps3.8s400K$0.83$6.67
74Claude Sonnet 4.61345±114.7K1.3%1.6%47 tps1.2s200K$3.00$15.00
810Claude Sonnet 4.5 (Thinking)1319±46.7K2.4%1.9%44 tps1.1s200K$3.00$15.00
96Gemini 3.1 Pro1317±87.9K1.6%3.5%35 tps4.1s1M$2.00$12.00
108GPT-5.11295±74.3K2.2%2.3%71 tps1.4s400K$1.42$11.33
1110Gemini 3 Pro1285±917.6K1.5%2.1%50 tps3.6s1M$2.00$12.00
127Claude Opus 4.5 (Thinking)1272±711.3K2.0%1.8%49 tps1.4s200K$5.00$25.00
1314Gemini 3 Flash Preview Thinking1272±97.9K1.8%1.6%3 tps6.2s1M$0.50$3.00
1414Gemini 3 Pro (Low)1262±66.1K2.2%2.4%51 tps3.5s1M$2.00$12.00
1510GPT-5.2 Instant1262±76.9K1.8%1.7%52 tps2.0s400K$1.75$14.00
1617Claude Opus 4.51259±74.1K2.9%1.5%45 tps1.5s200K$5.00$25.00
1716GPT-5.21254±114.5K1.8%4.1%18 tps2.7s400K$1.75$14.00
188GPT-5.1 (High)1252±86.4K1.9%3.2%76 tps6.9s400K$1.25$10.00
1917Gemini 3 Flash Preview1248±123.9K2.2%1.3%138 tps1.4s1M$0.50$3.00
2022GPT-5 Chat1243±711.4K2.5%1.3%95 tps0.9s400K$1.25$10.00
2113GPT-5.3 Instant1240±144.5K1.9%0.9%63 tps0.8s400K$1.75$14.00
2217GPT-5.2 (High)1236±811.2K1.8%6.7%18 tps16.3s400K$1.75$14.00
2337Claude Sonnet 4.51230±64.7K4.1%1.4%41 tps1.3s200K$1.80$9.00
2443Gemini 2.5 Flash Thinking Preview 09251211±72.6K2.9%<0.1%111 tps4.7s1M$0.30$2.50
2517Grok 4.20 Beta Reasoning1209±219152.1%1.1%77 tps4.5s2M$2.00$5.50
2626Claude Haiku 4.5 (Extended Thinking)1196±63K2.6%1.4%115 tps0.7s200K$1.00$5.00
2732Gemini 2.5 Pro High1191±66.1K3.2%1.5%48 tps2.3s1M$1.25$10.00
2856Gemini 2.5 Pro Low1186±72.9K3.3%<0.1%89 tps2.4s1M$1.25$10.00
2944Gemini 2.5 Pro1184±513.8K2.9%2.3%45 tps2.6s1M$1.25$10.00
3062GPT-5.1 Instant1168±74.3K2.5%1.3%50 tps1.9s400K$1.25$10.00
3152Claude Haiku 4.51163±85.3K4.1%1.1%100 tps0.9s200K$1.00$5.00
3271Gemini 2.5 Flash Thinking1161±46.8K3.0%2.2%88 tps6.4s1M$0.30$2.50
3360MiniMax M2.11159±112.1K2.1%2.1%66 tps2.6s205K$0.30$1.20
3433Grok 4.20 Multi Agent Beta1158±236651.5%1.2%56 tps8.8s2M$2.00$6.00
3521Claude Opus 4 (Thinking)1156±187153.4%<0.1%28 tps1.3s200K$15.00$75.00
3642GPT-5.2 (Extra High) 1146±103.5K2.3%13.2%17 tps20.5s400K$1.75$14.00
3786Claude Sonnet 41138±77.4K2.3%1.8%49 tps1.3s200K$3.00$15.00
3871Gemini 2.5 Flash Lite Preview 09251134±93.5K3.4%1.2%209 tps0.7s1M$0.25$0.35
3956Gemini 3.1 Flash Lite Preview Thinking1132±121.7K3.9%1.7%75 tps4.7s1M$0.25$1.50
4022GLM 51132±121.7K1.4%3.4%36 tps2.7s200K$0.72$2.55
View All (188 models)