Models
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1635
GPT-5.4
1616
Claude Sonnet 4.6
1611
Claude Opus 4.6
1596
Claude Opus 4.6 (Thinking)
1540
Claude Sonnet 4.6 (Thinking)
1522
GPT-5.4 (High)
1485
Claude Opus 4.5
1464
Gemini 3.1 Pro
1462
Claude Opus 4.5 (Thinking)
1425
Claude Haiku 4.5 (Extended Thinking)
1417
Claude Sonnet 4.5 (Thinking)
1404
Claude Sonnet 4.5
1393
GPT-5.2 Instant
1393
GPT-5.3 Codex (High)
1378
GPT-5.2

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
11GPT-5.41635±134K1.6%2.6%55 tps0.8s1M$2.50$15.00
21Claude Sonnet 4.61616±913.6K1.3%1.6%47 tps1.2s200K$3.00$15.00
31Claude Opus 4.61611±718.4K0.9%2.1%48 tps1.7s200K$5.00$25.00
44Claude Opus 4.6 (Thinking)1596±1012.8K1.2%2.5%56 tps1.6s200K$5.00$25.00
55Claude Sonnet 4.6 (Thinking)1540±811.3K2.6%4.7%57 tps1.1s200K$3.00$15.00
66GPT-5.4 (High)1522±133K3.1%4.6%68 tps7.9s1M$2.50$15.00
77Claude Opus 4.51485±611.7K1.8%1.5%45 tps1.5s200K$5.00$25.00
87Gemini 3.1 Pro1464±915.6K1.9%3.5%35 tps4.1s1M$2.00$12.00
96Claude Opus 4.5 (Thinking)1462±543.2K1.6%1.8%49 tps1.4s200K$5.00$25.00
1012Claude Haiku 4.5 (Extended Thinking)1425±710K3.7%1.4%115 tps0.7s200K$1.00$5.00
1110Claude Sonnet 4.5 (Thinking)1417±537.7K2.9%1.9%44 tps1.1s200K$3.00$15.00
1217Claude Sonnet 4.51404±612.9K5.0%1.4%41 tps1.3s200K$1.80$9.00
1310GPT-5.2 Instant1393±912.1K3.2%1.7%52 tps2.0s400K$1.75$14.00
149GPT-5.3 Codex (High)1393±142.8K0.9%2.0%61 tps17.8s400K$1.75$14.00
1513GPT-5.21378±88.5K2.9%4.1%18 tps2.7s400K$1.75$14.00
1615GPT-5.11370±79.1K3.6%2.3%71 tps1.4s400K$1.42$11.33
1713Gemini 3 Pro1362±934.5K2.5%2.1%50 tps3.6s1M$2.00$12.00
1813Claude Opus 4 (Thinking)1346±81.8K2.2%<0.1%28 tps1.3s200K$15.00$75.00
1917GPT-5.2 (High)1344±916.4K2.8%6.7%18 tps16.3s400K$1.75$14.00
2019Claude Haiku 4.51344±710.5K4.5%1.1%100 tps0.9s200K$1.00$5.00
2115GLM 51329±143.9K3.1%3.4%36 tps2.7s200K$0.72$2.55
2236Claude Opus 4.11328±74.9K4.2%3.0%17 tps3.7s200K$15.00$75.00
2319GPT-5.3 Instant1321±153.4K2.2%0.9%63 tps0.8s400K$1.75$14.00
2419Gemini 3 Pro (Low)1319±88.9K4.2%2.4%51 tps3.5s1M$2.00$12.00
2519Kimi K2.51307±156.4K3.2%6.5%33 tps1.7s262K$0.34$2.57
2621GPT-5.1 (Medium)1306±152K7.4%<0.1%86 tps3.8s400K$0.83$6.67
2727Claude Sonnet 4 (Thinking)1304±519.2K2.6%1.5%52 tps1.5s200K$3.00$13.67
2819GPT-5.3 Codex (Medium)1299±158902.2%2.3%62 tps10.3s400K$1.75$14.00
2919Gemini 3 Flash Preview Thinking1289±914.1K3.6%1.6%3 tps6.2s1M$0.50$3.00
3019GPT-5.1 (High)1289±109.9K4.1%3.2%76 tps6.9s400K$1.25$10.00
3129Claude Opus 41287±57.9K2.4%<0.1%25 tps1.5s200K$15.00$75.00
3229Claude Opus 4.1 (Thinking)1285±74.9K4.9%<0.1%20 tps3.9s200K$15.00$75.00
3327GPT-5 Codex (High)1282±511K3.5%3.2%122 tps7.1s400K$1.25$10.00
3436Qwen3.5 122B A17B1282±181.3K2.7%1.5%82 tps1.4s256K$0.40$3.20
3531GPT-5 Chat1272±420.8K4.7%1.3%95 tps0.9s400K$1.25$10.00
3631GPT-5.1 Codex (High)1271±814.1K3.4%3.2%96 tps3.9s400K$1.25$10.00
3743Claude Sonnet 41270±623.4K3.2%1.8%49 tps1.3s200K$3.00$15.00
3844GPT-4.5 Preview1265±151.1K2.6%<0.1%36 tps3.0s200K$75.00$150.00
3936GPT-5.2 Codex (Medium)1264±162.1K2.6%5.7%37 tps6.3s400K$1.75$14.00
4062OpenAI o1-mini1260±67.9K5.3%<0.1%118 tpsN/A128K$1.13$4.51
View All (382 models)