Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1147
Qwen3 235B A22B Instruct 2507
1147
Claude Opus 4.5 (Thinking)
1147
Kimi K2.5
1148
Claude Sonnet 4.5 (Thinking)
1149
GLM 5
1150
Step 3.5 Flash
1151
Qwen3 Max Instruct Preview
1152
DeepSeek V3.2
1152
DeepSeek V3.1 Terminus Chat
1153
Gemini 2.5 Pro
1156
Grok 4 Fast Non-Reasoning
1156
GPT-5 (High)
1157
Qwen3 Omni 30B A3B Instruct
1158
Kimi K2.5 Instant
1160
Grok 4.1 Fast Reasoning

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
24140Qwen3 235B A22B Instruct 25071147±224.5K1.4%6.8%13 tps1.9s262K$0.13$0.52
2427Claude Opus 4.5 (Thinking)1147±421.9K1.6%1.8%49 tps1.4s200K$5.00$25.00
24333Kimi K2.51147±316.1K1.2%6.5%33 tps1.7s262K$0.34$2.57
24410Claude Sonnet 4.5 (Thinking)1148±227.2K2.5%1.9%44 tps1.1s200K$3.00$15.00
24522GLM 51149±46.4K1.2%3.4%36 tps2.7s200K$0.72$2.55
24648Step 3.5 Flash1150±62.9K1.7%2.2%109 tps0.6s256K$0.05$0.15
24742Qwen3 Max Instruct Preview1151±226.4K2.0%1.1%31 tps1.7s256K$1.43$6.61
24840DeepSeek V3.21152±316.5K1.1%1.4%83 tps5.1s131K$0.43$1.09
24944DeepSeek V3.1 Terminus Chat1152±214.5K1.8%3.4%27 tps1.5s131K$0.86$1.80
25044Gemini 2.5 Pro1153±238.9K1.8%2.3%45 tps2.6s1M$1.25$10.00
25152Grok 4 Fast Non-Reasoning1156±216.7K2.2%1.5%93 tps0.6s2M$0.27$0.67
25226GPT-5 (High)1156±312.3K2.3%4.5%81 tps35.9s400K$1.25$10.00
25362Qwen3 Omni 30B A3B Instruct1157±62.3K1.9%3.9%65 tps1.2s66K$0.35$0.97
25437Kimi K2.5 Instant1158±64K1.5%2.9%32 tps3.0s262K$0.50$3.00
25544Grok 4.1 Fast Reasoning1160±223.1K1.8%1.5%58 tps7.3s2M$0.20$0.50
25633Qwen3 30B A3B Instruct 25071167±224.4K1.7%1.2%55 tps1.3s131K$0.13$0.72
25717GPT-5.2 (High)1168±231.4K1.1%6.7%18 tps16.3s400K$1.75$14.00
25817Gemini 3 Flash Preview1173±312.8K0.7%1.3%138 tps1.4s1M$0.50$3.00
25932Gemini 2.5 Pro High1175±127.6K2.0%1.5%48 tps2.3s1M$1.25$10.00
26048gpt-oss-120b1182±226.3K1.3%0.7%213 tps0.5s131K$0.11$0.50
26133Qwen3 Next 80B A3B Instruct1185±219.8K1.9%0.6%84 tps1.1s256K$0.20$1.42
26237Qwen3 Omni 30B A3B Thinking1188±55.3K1.2%3.7%67 tps1.2s66K$0.97$1.79
26316GPT-5.21193±216.3K0.9%4.1%18 tps2.7s400K$1.75$14.00
26414Gemini 3 Pro (Low)1195±320.3K1.1%2.4%51 tps3.5s1M$2.00$12.00
26514Gemini 3 Flash Preview Thinking1195±319.7K1.0%1.6%3 tps6.2s1M$0.50$3.00
26633Grok 4.20 Multi Agent Beta1197±71.7K1.8%1.2%56 tps8.8s2M$2.00$6.00
26729Nova Experimental Chat 12-101206±39K1.2%2.4%84 tps12.9s98K$0$0
26813GPT-5.3 Instant1206±45.5K1.0%0.9%63 tps0.8s400K$1.75$14.00
26926Grok 4.1 Fast Non-Reasoning1207±320.1K1.7%0.9%101 tps0.5s2M$0.20$0.50
27022GPT-5 Chat1208±258K1.4%1.3%95 tps0.9s400K$1.25$10.00
27129Qwen3 VL 235B A22B Instruct1211±310.2K2.5%3.1%75 tps1.9s129K$0.37$1.81
2724Claude Sonnet 4.61212±56K1.1%1.6%47 tps1.2s200K$3.00$15.00
27317Grok 4.20 Beta Reasoning1216±72.1K1.7%1.1%77 tps4.5s2M$2.00$5.50
27410Gemini 3 Pro1222±344.5K1.1%2.1%50 tps3.6s1M$2.00$12.00
2755Claude Sonnet 4.6 (Thinking)1256±55.8K1.4%4.7%57 tps1.1s200K$3.00$15.00
27610GPT-5.2 Instant1260±327.1K0.8%1.7%52 tps2.0s400K$1.75$14.00
2776Gemini 3.1 Pro1270±512.3K1.1%3.5%35 tps4.1s1M$2.00$12.00
2782Claude Opus 4.61274±47.1K1.4%2.1%48 tps1.7s200K$5.00$25.00
2798GPT-5.11288±219.7K1.3%2.3%71 tps1.4s400K$1.42$11.33
2808GPT-5.1 (High)1289±223K1.4%3.2%76 tps6.9s400K$1.25$10.00
View All (283 models)