Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1596
Claude Opus 4.6
1594
Claude Sonnet 4.6
1594
GPT-5.4
1566
Claude Opus 4.6 (Thinking)
1506
Claude Sonnet 4.6 (Thinking)
1464
GPT-5.4 (High)
1446
Claude Opus 4.5 (Thinking)
1418
Gemini 3.1 Pro
1409
Claude Opus 4.5
1381
GPT-5.3 Codex (High)
1362
Claude Sonnet 4.5 (Thinking)
1358
GPT-5.2 Instant
1353
Claude Haiku 4.5 (Extended Thinking)
1352
Claude Opus 4 (Thinking)
1340
GPT-5.2

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
11Claude Opus 4.61596±621.6K1.1%2.1%48 tps1.7s200K$5.00$25.00
21Claude Sonnet 4.61594±1015.7K1.4%1.6%47 tps1.2s200K$3.00$15.00
31GPT-5.41594±144.4K1.6%2.6%55 tps0.8s1M$2.50$15.00
44Claude Opus 4.6 (Thinking)1566±816.5K1.6%2.5%56 tps1.6s200K$5.00$25.00
55Claude Sonnet 4.6 (Thinking)1506±816.2K3.5%4.7%57 tps1.1s200K$3.00$15.00
66GPT-5.4 (High)1464±124.9K3.9%4.6%68 tps7.9s1M$2.50$15.00
76Claude Opus 4.5 (Thinking)1446±460.8K1.9%1.8%49 tps1.4s200K$5.00$25.00
87Gemini 3.1 Pro1418±922K2.5%3.5%35 tps4.1s1M$2.00$12.00
97Claude Opus 4.51409±515.1K2.2%1.5%45 tps1.5s200K$5.00$25.00
109GPT-5.3 Codex (High)1381±93.2K1.2%2.0%61 tps17.8s400K$1.75$14.00
1110Claude Sonnet 4.5 (Thinking)1362±458.2K3.3%1.9%44 tps1.1s200K$3.00$15.00
1210GPT-5.2 Instant1358±615.7K3.3%1.7%52 tps2.0s400K$1.75$14.00
1312Claude Haiku 4.5 (Extended Thinking)1353±414.3K3.8%1.4%115 tps0.7s200K$1.00$5.00
1413Claude Opus 4 (Thinking)1352±52.6K2.6%<0.1%28 tps1.3s200K$15.00$75.00
1513GPT-5.21340±811.3K3.2%4.1%18 tps2.7s400K$1.75$14.00
1613Gemini 3 Pro1337±559.4K2.6%2.1%50 tps3.6s1M$2.00$12.00
1715GLM 51324±1411.7K3.3%3.4%36 tps2.7s200K$0.72$2.55
1815GPT-5.11319±712.9K3.4%2.3%71 tps1.4s400K$1.42$11.33
1917Claude Sonnet 4.51307±320.9K5.0%1.4%41 tps1.3s200K$1.80$9.00
2017GPT-5.2 (High)1297±830.7K2.8%6.7%18 tps16.3s400K$1.75$14.00
2121GPT-5.1 (Medium)1291±93.2K6.4%<0.1%86 tps3.8s400K$0.83$6.67
2219Gemini 3 Pro (Low)1291±611.9K4.2%2.4%51 tps3.5s1M$2.00$12.00
2319GPT-5.1 (High)1290±619.1K3.5%3.2%76 tps6.9s400K$1.25$10.00
2419Gemini 3 Flash Preview Thinking1286±632.7K3.3%1.6%3 tps6.2s1M$0.50$3.00
2519Claude Haiku 4.51283±316.4K4.5%1.1%100 tps0.9s200K$1.00$5.00
2619MiniMax M2.51283±285103.8%1.4%70 tps1.9s205K$0.28$1.20
2719GPT-5.3 Codex (Medium)1278±271.1K2.3%2.3%62 tps10.3s400K$1.75$14.00
2829Claude Opus 41274±412.4K2.7%<0.1%25 tps1.5s200K$15.00$75.00
2929Claude Opus 4.1 (Thinking)1272±57.7K5.2%<0.1%20 tps3.9s200K$15.00$75.00
3019GPT-5.3 Instant1271±124.2K2.5%0.9%63 tps0.8s400K$1.75$14.00
3127Claude Sonnet 4 (Thinking)1261±325.9K2.9%1.5%52 tps1.5s200K$3.00$13.67
3227GPT-5 Codex (High)1260±718.5K3.3%3.2%122 tps7.1s400K$1.25$10.00
3327GPT-5 (High)1259±416.2K3.5%4.5%81 tps35.9s400K$1.25$10.00
3427GPT-5.2 Codex (High)1257±123.1K2.8%8.8%41 tps12.9s400K$1.75$14.00
3536Claude Opus 4.11254±47.1K4.6%3.0%17 tps3.7s200K$15.00$75.00
3631GPT-5.1 Codex (High)1240±837K3.3%3.2%96 tps3.9s400K$1.25$10.00
3731Grok 4.1 Fast Non-Reasoning1239±69.4K5.4%0.9%101 tps0.5s2M$0.20$0.50
3831GPT-5 Chat1231±435K4.5%1.3%95 tps0.9s400K$1.25$10.00
3937Nova Experimental Chat 11-101230±85.2K6.3%0.4%84 tps8.9s98K$0$0
4037Polaris Alpha1226±147555.6%<0.1%48 tps1.1s256K$0$0
View All (305 models)