Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1485
Claude Opus 4.6 (Thinking)
1483
Claude Opus 4.6
1466
GPT-5.4 (High)
1344
Gemini 3.1 Pro
1295
Claude Sonnet 4.6
1280
GPT-5.1 (Medium)
1274
GPT-5.2 Instant
1274
GPT-5.1 (High)
1270
Gemini 3 Pro (Low)
1268
GPT-5.1
1265
Gemini 3 Pro
1264
Nova Experimental Chat 11-10
1260
Claude Sonnet 4.5 (Thinking)
1248
Claude Opus 4.5 (Thinking)
1235
Claude Sonnet 4.6 (Thinking)

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
11Claude Opus 4.6 (Thinking)1485±112.3K1.3%2.5%56 tps1.6s200K$5.00$25.00
22Claude Opus 4.61483±113.2K0.9%2.1%48 tps1.7s200K$5.00$25.00
34GPT-5.4 (High)1466±195152.8%4.6%68 tps7.9s1M$2.50$15.00
46Gemini 3.1 Pro1344±162.8K2.4%3.5%35 tps4.1s1M$2.00$12.00
54Claude Sonnet 4.61295±161.8K0.8%1.6%47 tps1.2s200K$3.00$15.00
68GPT-5.1 (Medium)1280±127401.3%<0.1%86 tps3.8s400K$0.83$6.67
710GPT-5.2 Instant1274±83.9K2.2%1.7%52 tps2.0s400K$1.75$14.00
88GPT-5.1 (High)1274±64.2K2.5%3.2%76 tps6.9s400K$1.25$10.00
914Gemini 3 Pro (Low)1270±123.5K2.4%2.4%51 tps3.5s1M$2.00$12.00
108GPT-5.11268±63.3K1.5%2.3%71 tps1.4s400K$1.42$11.33
1110Gemini 3 Pro1265±512K1.3%2.1%50 tps3.6s1M$2.00$12.00
1216Nova Experimental Chat 11-101264±131.1K1.4%0.4%84 tps8.9s98K$0$0
1310Claude Sonnet 4.5 (Thinking)1260±48.3K1.8%1.9%44 tps1.1s200K$3.00$15.00
147Claude Opus 4.5 (Thinking)1248±69.3K2.3%1.8%49 tps1.4s200K$5.00$25.00
155Claude Sonnet 4.6 (Thinking)1235±181.7K2.0%4.7%57 tps1.1s200K$3.00$15.00
1617Claude Opus 4.51228±92.9K1.8%1.5%45 tps1.5s200K$5.00$25.00
1722GPT-5 Chat1220±610.4K2.1%1.3%95 tps0.9s400K$1.25$10.00
1816GPT-5.21213±122.7K2.2%4.1%18 tps2.7s400K$1.75$14.00
1914Gemini 3 Flash Preview Thinking1212±84.2K1.8%1.6%3 tps6.2s1M$0.50$3.00
2019Mistral Medium 3.11198±102.2K2.8%<0.1%77 tps0.7s128K$0.40$2.00
2117Gemini 3 Flash Preview1198±142K2.0%1.3%138 tps1.4s1M$0.50$3.00
2232Gemini 2.5 Pro High1189±64.5K2.5%1.5%48 tps2.3s1M$1.25$10.00
2322GLM 51184±227952.5%3.4%36 tps2.7s200K$0.72$2.55
2417GPT-5.2 (High)1175±106.4K1.7%6.7%18 tps16.3s400K$1.75$14.00
2542GPT-5.2 (Extra High) 1168±122.4K1.8%13.2%17 tps20.5s400K$1.75$14.00
2637Nova Experimental Chat 10-201162±91.1K3.1%<0.1%30 tps0.5s98K$0$0
2756Gemini 2.5 Pro Low1162±82.4K3.3%<0.1%89 tps2.4s1M$1.25$10.00
2813GPT-5.3 Instant1160±191.5K1.9%0.9%63 tps0.8s400K$1.75$14.00
2984Claude Sonnet 3.7 (Thinking)1159±71.9K5.3%<0.1%41 tps2.6s200K$3.00$15.00
3044Gemini 2.5 Pro1158±57.8K3.3%2.3%45 tps2.6s1M$1.25$10.00
3152GPT-51157±75.3K2.9%3.1%78 tps23.1s400K$1.25$9.67
3237Claude Sonnet 4.51156±55.1K2.8%1.4%41 tps1.3s200K$1.80$9.00
3371Gemini 2.5 Flash Thinking1148±72.6K4.2%2.2%88 tps6.4s1M$0.30$2.50
3448Claude Sonnet 4 (Thinking)1144±64.2K3.5%1.5%52 tps1.5s200K$3.00$13.67
3580GPT-5 (Minimal)1141±111.9K3.0%<0.1%67 tps1.4s400K$1.25$10.00
36111Claude Sonnet 3.71140±92K5.0%<0.1%39 tps1.6s200K$3.00$15.00
3743Gemini 2.5 Flash Thinking Preview 09251138±72.3K3.4%<0.1%111 tps4.7s1M$0.30$2.50
3829Nova Experimental Chat 12-101135±197202.0%2.4%84 tps12.9s98K$0$0
3960Gemini 2.5 Flash Preview 09251131±102.1K2.7%1.2%5 tps0.9s1M$0.13$0.97
4026Grok 4.1 Fast Non-Reasoning1130±152K4.3%0.9%101 tps0.5s2M$0.20$0.50
View All (133 models)