Models
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1635
GPT-5.4
1616
Claude Sonnet 4.6
1611
Claude Opus 4.6
1596
Claude Opus 4.6 (Thinking)
1540
Claude Sonnet 4.6 (Thinking)
1485
Claude Opus 4.5
1464
Gemini 3.1 Pro
1462
Claude Opus 4.5 (Thinking)
1425
Claude Haiku 4.5 (Extended Thinking)
1417
Claude Sonnet 4.5 (Thinking)
1404
Claude Sonnet 4.5
1393
GPT-5.2 Instant
1393
GPT-5.3 Codex (High)
1378
GPT-5.2
1370
GPT-5.1

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
11GPT-5.41635±134K1.6%2.6%55 tps0.8s1M$2.50$15.00
21Claude Sonnet 4.61616±913.6K1.3%1.6%47 tps1.2s200K$3.00$15.00
31Claude Opus 4.61611±718.4K0.9%2.1%48 tps1.7s200K$5.00$25.00
44Claude Opus 4.6 (Thinking)1596±1012.8K1.2%2.5%56 tps1.6s200K$5.00$25.00
55Claude Sonnet 4.6 (Thinking)1540±811.3K2.6%4.7%57 tps1.1s200K$3.00$15.00
67Claude Opus 4.51485±611.7K1.8%1.5%45 tps1.5s200K$5.00$25.00
77Gemini 3.1 Pro1464±915.6K1.9%3.5%35 tps4.1s1M$2.00$12.00
86Claude Opus 4.5 (Thinking)1462±543.2K1.6%1.8%49 tps1.4s200K$5.00$25.00
912Claude Haiku 4.5 (Extended Thinking)1425±710K3.7%1.4%115 tps0.7s200K$1.00$5.00
1010Claude Sonnet 4.5 (Thinking)1417±537.7K2.9%1.9%44 tps1.1s200K$3.00$15.00
1117Claude Sonnet 4.51404±612.9K5.0%1.4%41 tps1.3s200K$1.80$9.00
1210GPT-5.2 Instant1393±912.1K3.2%1.7%52 tps2.0s400K$1.75$14.00
139GPT-5.3 Codex (High)1393±142.8K0.9%2.0%61 tps17.8s400K$1.75$14.00
1413GPT-5.21378±88.5K2.9%4.1%18 tps2.7s400K$1.75$14.00
1515GPT-5.11370±79.1K3.6%2.3%71 tps1.4s400K$1.42$11.33
1613Gemini 3 Pro1362±934.5K2.5%2.1%50 tps3.6s1M$2.00$12.00
1717GPT-5.2 (High)1344±916.4K2.8%6.7%18 tps16.3s400K$1.75$14.00
1819Claude Haiku 4.51344±710.5K4.5%1.1%100 tps0.9s200K$1.00$5.00
1915GLM 51329±143.9K3.1%3.4%36 tps2.7s200K$0.72$2.55
2019GPT-5.3 Instant1321±153.4K2.2%0.9%63 tps0.8s400K$1.75$14.00
2119Gemini 3 Pro (Low)1319±88.9K4.2%2.4%51 tps3.5s1M$2.00$12.00
2219Kimi K2.51307±156.4K3.2%6.5%33 tps1.7s262K$0.34$2.57
2327Claude Sonnet 4 (Thinking)1304±519.2K2.6%1.5%52 tps1.5s200K$3.00$13.67
2419GPT-5.3 Codex (Medium)1299±158902.2%2.3%62 tps10.3s400K$1.75$14.00
2519Gemini 3 Flash Preview Thinking1289±914.1K3.6%1.6%3 tps6.2s1M$0.50$3.00
2619GPT-5.1 (High)1289±109.9K4.1%3.2%76 tps6.9s400K$1.25$10.00
2727GPT-5 Codex (High)1282±511K3.5%3.2%122 tps7.1s400K$1.25$10.00
2836Qwen3.5 122B A17B1282±181.3K2.7%1.5%82 tps1.4s256K$0.40$3.20
2931GPT-5 Chat1272±420.8K4.7%1.3%95 tps0.9s400K$1.25$10.00
3031GPT-5.1 Codex (High)1271±814.1K3.4%3.2%96 tps3.9s400K$1.25$10.00
3143Claude Sonnet 41270±623.4K3.2%1.8%49 tps1.3s200K$3.00$15.00
3236GPT-5.2 Codex (Medium)1264±162.1K2.6%5.7%37 tps6.3s400K$1.75$14.00
3327GPT-5.2 Codex (High)1255±142.6K2.6%8.8%41 tps12.9s400K$1.75$14.00
3443GPT-5.1 Codex Max1254±94.3K4.0%3.0%118 tps4.1s400K$1.25$10.00
3543Gemini 3 Flash Preview1245±115.5K3.7%1.3%138 tps1.4s1M$0.50$3.00
3631Qwen3 Next 80B A3B Instruct1241±113.5K8.9%0.6%84 tps1.1s256K$0.20$1.42
3777Grok 4.20 Multi Agent Beta1240±236853.5%1.2%56 tps8.8s2M$2.00$6.00
3849Grok 4 Fast Non-Reasoning1232±73.9K9.8%1.5%93 tps0.6s2M$0.27$0.67
3974Qwen3.5 397B A17B1231±201.7K2.8%4.3%57 tps1.4s256K$0.52$3.00
4036Qwen3.5 27B1230±176904.2%3.7%55 tps2.6s256K$0.30$2.40
View All (273 models)