Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1228
Claude Opus 4.5
1235
Claude Sonnet 4.6 (Thinking)
1248
Claude Opus 4.5 (Thinking)
1260
Claude Sonnet 4.5 (Thinking)
1265
Gemini 3 Pro
1268
GPT-5.1
1270
Gemini 3 Pro (Low)
1274
GPT-5.1 (High)
1274
GPT-5.2 Instant
1295
Claude Sonnet 4.6
1344
Gemini 3.1 Pro
1483
Claude Opus 4.6
1485
Claude Opus 4.6 (Thinking)

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
12117Claude Opus 4.51228±92.9K1.8%1.5%45 tps1.5s200K$5.00$25.00
1225Claude Sonnet 4.6 (Thinking)1235±181.7K2.0%4.7%57 tps1.1s200K$3.00$15.00
1237Claude Opus 4.5 (Thinking)1248±69.3K2.3%1.8%49 tps1.4s200K$5.00$25.00
12410Claude Sonnet 4.5 (Thinking)1260±48.3K1.8%1.9%44 tps1.1s200K$3.00$15.00
12510Gemini 3 Pro1265±512K1.3%2.1%50 tps3.6s1M$2.00$12.00
1268GPT-5.11268±63.3K1.5%2.3%71 tps1.4s400K$1.42$11.33
12714Gemini 3 Pro (Low)1270±123.5K2.4%2.4%51 tps3.5s1M$2.00$12.00
1288GPT-5.1 (High)1274±64.2K2.5%3.2%76 tps6.9s400K$1.25$10.00
12910GPT-5.2 Instant1274±83.9K2.2%1.7%52 tps2.0s400K$1.75$14.00
1304Claude Sonnet 4.61295±161.8K0.8%1.6%47 tps1.2s200K$3.00$15.00
1316Gemini 3.1 Pro1344±162.8K2.4%3.5%35 tps4.1s1M$2.00$12.00
1322Claude Opus 4.61483±113.2K0.9%2.1%48 tps1.7s200K$5.00$25.00
1331Claude Opus 4.6 (Thinking)1485±112.3K1.3%2.5%56 tps1.6s200K$5.00$25.00
View All (133 models)