Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1388
Claude Opus 4.6 (Thinking)
1339
GPT-5.4
1328
Claude Opus 4.6
1325
Claude Sonnet 4.6 (Thinking)
1268
Gemini 3.1 Pro
1266
GPT-5.2 Instant
1258
GPT-5.1 (High)
1243
GPT-5 (High)
1233
GPT-5.1
1232
Claude Sonnet 4.6
1226
Qwen3 30B A3B Instruct 2507
1217
Gemini 3 Pro
1208
GPT-5 Chat
1202
Qwen3 VL 235B A22B Instruct
1191
GPT-5.3 Instant

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
11Claude Opus 4.6 (Thinking)1388±101.8K0.8%2.5%56 tps1.6s200K$5.00$25.00
22GPT-5.41339±155300.9%2.6%55 tps0.8s1M$2.50$15.00
32Claude Opus 4.61328±82.3K0.9%2.1%48 tps1.7s200K$5.00$25.00
45Claude Sonnet 4.6 (Thinking)1325±71.6K1.2%4.7%57 tps1.1s200K$3.00$15.00
56Gemini 3.1 Pro1268±84.3K0.7%3.5%35 tps4.1s1M$2.00$12.00
610GPT-5.2 Instant1266±46.2K0.7%1.7%52 tps2.0s400K$1.75$14.00
78GPT-5.1 (High)1258±65.3K1.3%3.2%76 tps6.9s400K$1.25$10.00
826GPT-5 (High)1243±73K2.3%4.5%81 tps35.9s400K$1.25$10.00
98GPT-5.11233±83.3K1.4%2.3%71 tps1.4s400K$1.42$11.33
104Claude Sonnet 4.61232±111.6K0.9%1.6%47 tps1.2s200K$3.00$15.00
1133Qwen3 30B A3B Instruct 25071226±85.6K2.2%1.2%55 tps1.3s131K$0.13$0.72
1210Gemini 3 Pro1217±511.7K0.9%2.1%50 tps3.6s1M$2.00$12.00
1322GPT-5 Chat1208±510.4K2.2%1.3%95 tps0.9s400K$1.25$10.00
1429Qwen3 VL 235B A22B Instruct1202±101.8K4.5%3.1%75 tps1.9s129K$0.37$1.81
1513GPT-5.3 Instant1191±111.6K1.2%0.9%63 tps0.8s400K$1.75$14.00
1617Grok 4.20 Beta Reasoning1190±175400.9%1.1%77 tps4.5s2M$2.00$5.50
1714Gemini 3 Pro (Low)1189±64.8K0.9%2.4%51 tps3.5s1M$2.00$12.00
1832Gemini 2.5 Pro High1182±36.7K2.2%1.5%48 tps2.3s1M$1.25$10.00
1940Qwen3 235B A22B Instruct 25071178±65.1K1.9%6.8%13 tps1.9s262K$0.13$0.52
20106Claude Sonnet 3.5 v21177±81.6K1.2%<0.1%46 tps1.4s200K$3.00$15.00
2117GPT-5.2 (High)1177±77.4K0.8%6.7%18 tps16.3s400K$1.75$14.00
2214Gemini 3 Flash Preview Thinking1173±54.4K0.6%1.6%3 tps6.2s1M$0.50$3.00
2381GPT-4o1170±92.3K2.8%1.0%49 tps2.4s128K$3.71$12.57
2416GPT-5.21168±63K1.2%4.1%18 tps2.7s400K$1.75$14.00
2517Gemini 3 Flash Preview1166±72.4K0.6%1.3%138 tps1.4s1M$0.50$3.00
2660Gemini 2.5 Flash Preview 09251163±72.7K2.9%1.2%5 tps0.9s1M$0.13$0.97
277Claude Opus 4.5 (Thinking)1155±55.3K1.6%1.8%49 tps1.4s200K$5.00$25.00
2826Grok 4.1 Fast Non-Reasoning1151±63.2K1.8%0.9%101 tps0.5s2M$0.20$0.50
2968Qwen Plus (Aug'24)1150±57.5K1.4%1.4%53 tps1.3s30K$0.40$1.20
3017Claude Opus 4.51144±82.4K2.1%1.5%45 tps1.5s200K$5.00$25.00
3137Qwen3 Omni 30B A3B Thinking1139±71.6K1.2%3.7%67 tps1.2s66K$0.97$1.79
3229Nova Experimental Chat 12-101138±91.9K0.5%2.4%84 tps12.9s98K$0$0
3310Claude Sonnet 4.5 (Thinking)1136±56.8K2.7%1.9%44 tps1.1s200K$3.00$15.00
3442GPT-5.2 (Extra High) 1131±53.7K0.9%13.2%17 tps20.5s400K$1.75$14.00
3526Claude Haiku 4.5 (Extended Thinking)1129±53.6K1.6%1.4%115 tps0.7s200K$1.00$5.00
3642Qwen3 Max Instruct Preview1126±44.3K2.8%1.1%31 tps1.7s256K$1.43$6.61
3744Gemini 2.5 Pro1126±416.2K1.5%2.3%45 tps2.6s1M$1.25$10.00
3844Grok 4.1 Fast Reasoning1119±65.4K1.5%1.5%58 tps7.3s2M$0.20$0.50
3937Claude Sonnet 4.51116±65K3.1%1.4%41 tps1.3s200K$1.80$9.00
4056DeepSeek V3.1 Turbo1114±64K2.1%0.9%173 tps1.3s164K$2.00$3.75
View All (142 models)