Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1485
Claude Opus 4.6 (Thinking)
1483
Claude Opus 4.6
1344
Gemini 3.1 Pro
1295
Claude Sonnet 4.6
1274
GPT-5.2 Instant
1274
GPT-5.1 (High)
1270
Gemini 3 Pro (Low)
1268
GPT-5.1
1265
Gemini 3 Pro
1260
Claude Sonnet 4.5 (Thinking)
1248
Claude Opus 4.5 (Thinking)
1235
Claude Sonnet 4.6 (Thinking)
1228
Claude Opus 4.5
1220
GPT-5 Chat
1213
GPT-5.2

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
11Claude Opus 4.6 (Thinking)1485±112.3K1.3%2.5%56 tps1.6s200K$5.00$25.00
22Claude Opus 4.61483±113.2K0.9%2.1%48 tps1.7s200K$5.00$25.00
36Gemini 3.1 Pro1344±162.8K2.4%3.5%35 tps4.1s1M$2.00$12.00
44Claude Sonnet 4.61295±161.8K0.8%1.6%47 tps1.2s200K$3.00$15.00
510GPT-5.2 Instant1274±83.9K2.2%1.7%52 tps2.0s400K$1.75$14.00
68GPT-5.1 (High)1274±64.2K2.5%3.2%76 tps6.9s400K$1.25$10.00
714Gemini 3 Pro (Low)1270±123.5K2.4%2.4%51 tps3.5s1M$2.00$12.00
88GPT-5.11268±63.3K1.5%2.3%71 tps1.4s400K$1.42$11.33
910Gemini 3 Pro1265±512K1.3%2.1%50 tps3.6s1M$2.00$12.00
1010Claude Sonnet 4.5 (Thinking)1260±48.3K1.8%1.9%44 tps1.1s200K$3.00$15.00
117Claude Opus 4.5 (Thinking)1248±69.3K2.3%1.8%49 tps1.4s200K$5.00$25.00
125Claude Sonnet 4.6 (Thinking)1235±181.7K2.0%4.7%57 tps1.1s200K$3.00$15.00
1317Claude Opus 4.51228±92.9K1.8%1.5%45 tps1.5s200K$5.00$25.00
1422GPT-5 Chat1220±610.4K2.1%1.3%95 tps0.9s400K$1.25$10.00
1516GPT-5.21213±122.7K2.2%4.1%18 tps2.7s400K$1.75$14.00
1614Gemini 3 Flash Preview Thinking1212±84.2K1.8%1.6%3 tps6.2s1M$0.50$3.00
1717Gemini 3 Flash Preview1198±142K2.0%1.3%138 tps1.4s1M$0.50$3.00
1832Gemini 2.5 Pro High1189±64.5K2.5%1.5%48 tps2.3s1M$1.25$10.00
1922GLM 51184±227952.5%3.4%36 tps2.7s200K$0.72$2.55
2017GPT-5.2 (High)1175±106.4K1.7%6.7%18 tps16.3s400K$1.75$14.00
2142GPT-5.2 (Extra High) 1168±122.4K1.8%13.2%17 tps20.5s400K$1.75$14.00
2213GPT-5.3 Instant1160±191.5K1.9%0.9%63 tps0.8s400K$1.75$14.00
2344Gemini 2.5 Pro1158±57.8K3.3%2.3%45 tps2.6s1M$1.25$10.00
2452GPT-51157±75.3K2.9%3.1%78 tps23.1s400K$1.25$9.67
2537Claude Sonnet 4.51156±55.1K2.8%1.4%41 tps1.3s200K$1.80$9.00
2671Gemini 2.5 Flash Thinking1148±72.6K4.2%2.2%88 tps6.4s1M$0.30$2.50
2748Claude Sonnet 4 (Thinking)1144±64.2K3.5%1.5%52 tps1.5s200K$3.00$13.67
2829Nova Experimental Chat 12-101135±197202.0%2.4%84 tps12.9s98K$0$0
2960Gemini 2.5 Flash Preview 09251131±102.1K2.7%1.2%5 tps0.9s1M$0.13$0.97
3026Grok 4.1 Fast Non-Reasoning1130±152K4.3%0.9%101 tps0.5s2M$0.20$0.50
3133Qwen3 30B A3B Instruct 25071127±92.5K2.9%1.2%55 tps1.3s131K$0.13$0.72
3281OpenAI o3-pro1126±102.3K3.1%5.2%22 tps70.8s200K$20.00$80.00
3340Qwen3 235B A22B Instruct 25071118±112.5K2.1%6.8%13 tps1.9s262K$0.13$0.52
3468Qwen Plus (Aug'24)1115±81.9K2.6%1.4%53 tps1.3s30K$0.40$1.20
3526Claude Haiku 4.5 (Extended Thinking)1115±122.2K2.7%1.4%115 tps0.7s200K$1.00$5.00
3629Qwen3 VL 235B A22B Instruct1114±81.3K2.5%3.1%75 tps1.9s129K$0.37$1.81
3771Qwen3.5 397B A17B1112±245802.5%4.3%57 tps1.4s256K$0.52$3.00
3826GPT-5 (High)1110±74.3K3.1%4.5%81 tps35.9s400K$1.25$10.00
3995Gemini 2.5 Flash1098±94.8K2.7%1.3%2 tps3.7s1M$0.30$2.50
4068Grok 41093±55.7K4.0%3.9%29 tps11.1s256K$3.00$15.00
View All (107 models)