Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1203
Qwen3 Max Instruct Preview
1204
Gemini 2.5 Pro High
1205
Gemini 3 Flash Preview
1205
Claude Sonnet 4
1210
Kimi K2.5 Instant
1211
Qwen3.5 27B
1211
GPT-5.2 Codex (Medium)
1214
GPT-5 Codex (Medium)
1216
Qwen3.5 122B A17B
1220
Qwen3 VL 235B A22B Instruct
1221
GPT-5.2 (Extra High)
1228
MiniMax M2.5 Lightning
1231
Qwen3 Next 80B A3B Instruct
1231
GPT-5 Chat
1239
Grok 4.1 Fast Non-Reasoning

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
24143Qwen3 Max Instruct Preview1203±616.1K4.6%1.1%31 tps1.7s256K$1.43$6.61
24243Gemini 2.5 Pro High1204±321.1K5.7%1.5%48 tps2.3s1M$1.25$10.00
24343Gemini 3 Flash Preview1205±117.2K3.7%1.3%138 tps1.4s1M$0.50$3.00
24443Claude Sonnet 41205±343.2K3.7%1.8%49 tps1.3s200K$3.00$15.00
24536Kimi K2.5 Instant1210±81.8K3.2%2.9%32 tps3.0s262K$0.50$3.00
24636Qwen3.5 27B1211±169104.7%3.7%55 tps2.6s256K$0.30$2.40
24736GPT-5.2 Codex (Medium)1211±122.4K3.0%5.7%37 tps6.3s400K$1.75$14.00
24836GPT-5 Codex (Medium)1214±68.8K3.9%4.1%122 tps5.2s400K$1.25$10.00
24936Qwen3.5 122B A17B1216±151.9K3.1%1.5%82 tps1.4s256K$0.40$3.20
25036Qwen3 VL 235B A22B Instruct1220±75.6K6.7%3.1%75 tps1.9s129K$0.37$1.81
25136GPT-5.2 (Extra High) 1221±98K3.5%13.2%17 tps20.5s400K$1.75$14.00
25231MiniMax M2.5 Lightning1228±141.7K3.2%1.5%51 tps2.0s205K$0.60$2.40
25331Qwen3 Next 80B A3B Instruct1231±58.8K5.8%0.6%84 tps1.1s256K$0.20$1.42
25431GPT-5 Chat1231±435K4.5%1.3%95 tps0.9s400K$1.25$10.00
25531Grok 4.1 Fast Non-Reasoning1239±69.4K5.4%0.9%101 tps0.5s2M$0.20$0.50
25631GPT-5.1 Codex (High)1240±837K3.3%3.2%96 tps3.9s400K$1.25$10.00
25727GPT-5.2 Codex (High)1257±123.1K2.8%8.8%41 tps12.9s400K$1.75$14.00
25827GPT-5 (High)1259±416.2K3.5%4.5%81 tps35.9s400K$1.25$10.00
25927GPT-5 Codex (High)1260±718.5K3.3%3.2%122 tps7.1s400K$1.25$10.00
26027Claude Sonnet 4 (Thinking)1261±325.9K2.9%1.5%52 tps1.5s200K$3.00$13.67
26119GPT-5.3 Instant1271±124.2K2.5%0.9%63 tps0.8s400K$1.75$14.00
26219GPT-5.3 Codex (Medium)1278±271.1K2.3%2.3%62 tps10.3s400K$1.75$14.00
26319MiniMax M2.51283±285103.8%1.4%70 tps1.9s205K$0.28$1.20
26419Claude Haiku 4.51283±316.4K4.5%1.1%100 tps0.9s200K$1.00$5.00
26519Gemini 3 Flash Preview Thinking1286±632.7K3.3%1.6%3 tps6.2s1M$0.50$3.00
26619GPT-5.1 (High)1290±619.1K3.5%3.2%76 tps6.9s400K$1.25$10.00
26719Gemini 3 Pro (Low)1291±611.9K4.2%2.4%51 tps3.5s1M$2.00$12.00
26819Kimi K2.51291±1116.5K3.4%6.5%33 tps1.7s262K$0.34$2.57
26917GPT-5.2 (High)1297±830.7K2.8%6.7%18 tps16.3s400K$1.75$14.00
27017Claude Sonnet 4.51307±320.9K5.0%1.4%41 tps1.3s200K$1.80$9.00
27115GPT-5.11319±712.9K3.4%2.3%71 tps1.4s400K$1.42$11.33
27215GLM 51324±1411.7K3.3%3.4%36 tps2.7s200K$0.72$2.55
27313Gemini 3 Pro1337±559.4K2.6%2.1%50 tps3.6s1M$2.00$12.00
27413GPT-5.21340±811.3K3.2%4.1%18 tps2.7s400K$1.75$14.00
27512Claude Haiku 4.5 (Extended Thinking)1353±414.3K3.8%1.4%115 tps0.7s200K$1.00$5.00
27610GPT-5.2 Instant1358±615.7K3.3%1.7%52 tps2.0s400K$1.75$14.00
27710Claude Sonnet 4.5 (Thinking)1362±458.2K3.3%1.9%44 tps1.1s200K$3.00$15.00
2789GPT-5.3 Codex (High)1381±93.2K1.2%2.0%61 tps17.8s400K$1.75$14.00
2797Claude Opus 4.51409±515.1K2.2%1.5%45 tps1.5s200K$5.00$25.00
2807Gemini 3.1 Pro1418±922K2.5%3.5%35 tps4.1s1M$2.00$12.00
View All (286 models)