Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1504
Claude Opus 4.6 (Thinking)
1445
Claude Opus 4.6
1371
GPT-5.4
1364
Claude Sonnet 4.6
1359
Gemini 3.1 Pro
1324
Claude Sonnet 4.6 (Thinking)
1282
Gemini 3 Pro
1280
Claude Opus 4.5 (Thinking)
1278
GPT-5.1
1273
GPT-5.1 (High)
1266
Gemini 3 Pro (Low)
1264
Claude Sonnet 4.5 (Thinking)
1254
GPT-5.2 Instant
1231
GLM 5
1228
Claude Sonnet 4 (Thinking)

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
11Claude Opus 4.6 (Thinking)1504±93.2K1.5%2.5%56 tps1.6s200K$5.00$25.00
22Claude Opus 4.61445±84.3K1.2%2.1%48 tps1.7s200K$5.00$25.00
32GPT-5.41371±131.1K1.3%2.6%55 tps0.8s1M$2.50$15.00
44Claude Sonnet 4.61364±83K1.1%1.6%47 tps1.2s200K$3.00$15.00
56Gemini 3.1 Pro1359±95.8K2.3%3.5%35 tps4.1s1M$2.00$12.00
65Claude Sonnet 4.6 (Thinking)1324±93K2.6%4.7%57 tps1.1s200K$3.00$15.00
710Gemini 3 Pro1282±437.7K3.1%2.1%50 tps3.6s1M$2.00$12.00
87Claude Opus 4.5 (Thinking)1280±423K2.2%1.8%49 tps1.4s200K$5.00$25.00
98GPT-5.11278±58.5K4.7%2.3%71 tps1.4s400K$1.42$11.33
108GPT-5.1 (High)1273±416.6K4.0%3.2%76 tps6.9s400K$1.25$10.00
1114Gemini 3 Pro (Low)1266±58.4K4.6%2.4%51 tps3.5s1M$2.00$12.00
1210Claude Sonnet 4.5 (Thinking)1264±225.1K4.5%1.9%44 tps1.1s200K$3.00$15.00
1310GPT-5.2 Instant1254±59K3.9%1.7%52 tps2.0s400K$1.75$14.00
1422GLM 51231±122.3K2.5%3.4%36 tps2.7s200K$0.72$2.55
1548Claude Sonnet 4 (Thinking)1228±65.2K3.4%1.5%52 tps1.5s200K$3.00$13.67
1617Gemini 3 Flash Preview1226±65.3K4.2%1.3%138 tps1.4s1M$0.50$3.00
1714Gemini 3 Flash Preview Thinking1225±613.8K3.4%1.6%3 tps6.2s1M$0.50$3.00
1817GPT-5.2 (High)1221±419.4K3.1%6.7%18 tps16.3s400K$1.75$14.00
1937Claude Sonnet 4.51219±513.7K6.6%1.4%41 tps1.3s200K$1.80$9.00
2016GPT-5.21219±66.2K3.8%4.1%18 tps2.7s400K$1.75$14.00
2117Grok 4.20 Beta Reasoning1209±218501.7%1.1%77 tps4.5s2M$2.00$5.50
2217Claude Opus 4.51207±56.6K3.6%1.5%45 tps1.5s200K$5.00$25.00
2356Gemini 3.1 Flash Lite Preview Thinking1199±139503.1%1.7%75 tps4.7s1M$0.25$1.50
2422GPT-5 Chat1193±323.9K6.8%1.3%95 tps0.9s400K$1.25$10.00
2513GPT-5.3 Instant1191±82.2K1.8%0.9%63 tps0.8s400K$1.75$14.00
2642GPT-5.2 (Extra High) 1189±66.7K3.5%13.2%17 tps20.5s400K$1.75$14.00
2726GPT-5 (High)1171±412K3.7%4.5%81 tps35.9s400K$1.25$10.00
2832Gemini 2.5 Pro High1169±416.8K7.8%1.5%48 tps2.3s1M$1.25$10.00
2944Gemini 2.5 Pro1168±419.1K9.1%2.3%45 tps2.6s1M$1.25$10.00
3086Seed 2.0 Lite (Medium)1166±145752.5%6.6%33 tps1.6s256K$0.25$2.00
3171MiniMax M2.5 FP81162±175753.4%3.6%33 tps1.7s205K$0.45$1.75
3229Qwen3 VL 235B A22B Instruct1161±64.5K8.8%3.1%75 tps1.9s129K$0.37$1.81
3356MiniMax M2.1 Lightning1157±139701.0%1.7%52 tps2.1s205K$0.30$2.40
3471Gemini 2.5 Flash Thinking1153±73.7K3.6%2.2%88 tps6.4s1M$0.30$2.50
3526Claude Haiku 4.5 (Extended Thinking)1152±57K6.6%1.4%115 tps0.7s200K$1.00$5.00
3642Qwen3 Max Instruct Preview1150±413.5K5.8%1.1%31 tps1.7s256K$1.43$6.61
3760MiniMax M2.11149±610.4K4.3%2.1%66 tps2.6s205K$0.30$1.20
3844Grok 4.1 Fast Reasoning1149±621.2K4.2%1.5%58 tps7.3s2M$0.20$0.50
3940Qwen3 235B A22B Instruct 25071146±38.8K12.2%6.8%13 tps1.9s262K$0.13$0.52
4033Grok 4.20 Multi Agent Beta1143±167651.9%1.2%56 tps8.8s2M$2.00$6.00
View All (170 models)