Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1152
Claude Haiku 4.5 (Extended Thinking)
1153
Gemini 2.5 Flash Thinking
1157
MiniMax M2.1 Lightning
1160
Kimi K2 Thinking Turbo
1161
Qwen3 VL 235B A22B Instruct
1162
MiniMax M2.5 FP8
1166
Seed 2.0 Lite (Medium)
1168
Gemini 2.5 Pro
1169
Kimi K2.5
1169
Gemini 2.5 Pro High
1171
GPT-5 (High)
1189
GPT-5.2 (Extra High)
1191
GPT-5.3 Instant
1193
GPT-5 Chat
1199
Gemini 3.1 Flash Lite Preview Thinking

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
20126Claude Haiku 4.5 (Extended Thinking)1152±57K6.6%1.4%115 tps0.7s200K$1.00$5.00
20271Gemini 2.5 Flash Thinking1153±73.7K3.6%2.2%88 tps6.4s1M$0.30$2.50
20356MiniMax M2.1 Lightning1157±139701.0%1.7%52 tps2.1s205K$0.30$2.40
20444Kimi K2 Thinking Turbo1160±813.2K3.5%2.0%75 tps1.4s262K$1.15$8.00
20529Qwen3 VL 235B A22B Instruct1161±64.5K8.8%3.1%75 tps1.9s129K$0.37$1.81
20671MiniMax M2.5 FP81162±175753.4%3.6%33 tps1.7s205K$0.45$1.75
20786Seed 2.0 Lite (Medium)1166±145752.5%6.6%33 tps1.6s256K$0.25$2.00
20844Gemini 2.5 Pro1168±419.1K9.1%2.3%45 tps2.6s1M$1.25$10.00
20933Kimi K2.51169±65.3K2.8%6.5%33 tps1.7s262K$0.34$2.57
21032Gemini 2.5 Pro High1169±416.8K7.8%1.5%48 tps2.3s1M$1.25$10.00
21126GPT-5 (High)1171±412K3.7%4.5%81 tps35.9s400K$1.25$10.00
21242GPT-5.2 (Extra High) 1189±66.7K3.5%13.2%17 tps20.5s400K$1.75$14.00
21313GPT-5.3 Instant1191±82.2K1.8%0.9%63 tps0.8s400K$1.75$14.00
21422GPT-5 Chat1193±323.9K6.8%1.3%95 tps0.9s400K$1.25$10.00
21556Gemini 3.1 Flash Lite Preview Thinking1199±139503.1%1.7%75 tps4.7s1M$0.25$1.50
21617Claude Opus 4.51207±56.6K3.6%1.5%45 tps1.5s200K$5.00$25.00
21717Grok 4.20 Beta Reasoning1209±218501.7%1.1%77 tps4.5s2M$2.00$5.50
21816GPT-5.21219±66.2K3.8%4.1%18 tps2.7s400K$1.75$14.00
21937Claude Sonnet 4.51219±513.7K6.6%1.4%41 tps1.3s200K$1.80$9.00
22017GPT-5.2 (High)1221±419.4K3.1%6.7%18 tps16.3s400K$1.75$14.00
22114Gemini 3 Flash Preview Thinking1225±613.8K3.4%1.6%3 tps6.2s1M$0.50$3.00
22217Gemini 3 Flash Preview1226±65.3K4.2%1.3%138 tps1.4s1M$0.50$3.00
22348Claude Sonnet 4 (Thinking)1228±65.2K3.4%1.5%52 tps1.5s200K$3.00$13.67
22422GLM 51231±122.3K2.5%3.4%36 tps2.7s200K$0.72$2.55
22510GPT-5.2 Instant1254±59K3.9%1.7%52 tps2.0s400K$1.75$14.00
22610Claude Sonnet 4.5 (Thinking)1264±225.1K4.5%1.9%44 tps1.1s200K$3.00$15.00
22714Gemini 3 Pro (Low)1266±58.4K4.6%2.4%51 tps3.5s1M$2.00$12.00
2288GPT-5.1 (High)1273±416.6K4.0%3.2%76 tps6.9s400K$1.25$10.00
2298GPT-5.11278±58.5K4.7%2.3%71 tps1.4s400K$1.42$11.33
2307Claude Opus 4.5 (Thinking)1280±423K2.2%1.8%49 tps1.4s200K$5.00$25.00
23110Gemini 3 Pro1282±437.7K3.1%2.1%50 tps3.6s1M$2.00$12.00
2325Claude Sonnet 4.6 (Thinking)1324±93K2.6%4.7%57 tps1.1s200K$3.00$15.00
2336Gemini 3.1 Pro1359±95.8K2.3%3.5%35 tps4.1s1M$2.00$12.00
2344Claude Sonnet 4.61364±83K1.1%1.6%47 tps1.2s200K$3.00$15.00
2352GPT-5.41371±131.1K1.3%2.6%55 tps0.8s1M$2.50$15.00
2362Claude Opus 4.61445±84.3K1.2%2.1%48 tps1.7s200K$5.00$25.00
2371Claude Opus 4.6 (Thinking)1504±93.2K1.5%2.5%56 tps1.6s200K$5.00$25.00
View All (237 models)