Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1144
gpt-oss-120b
1144
DeepSeek V3.2
1147
Qwen3 235B A22B Instruct 2507
1148
Qwen3 Max Instruct Preview
1150
DeepSeek V3.1 Terminus Chat
1151
Kimi K2.5
1151
Qwen3.5 122B A17B
1158
Qwen3 30B A3B Instruct 2507
1159
Gemini 3 Flash Preview
1159
Gemini 2.5 Pro High
1161
Qwen3 Next 80B A3B Instruct
1164
Step 3.5 Flash
1165
MiniMax M2.1 Lightning
1167
MiniMax M2.7
1168
Qwen3 Omni 30B A3B Instruct

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
24148gpt-oss-120b1144±240.7K3.7%0.7%213 tps0.5s131K$0.11$0.50
24240DeepSeek V3.21144±320.7K1.9%1.4%83 tps5.1s131K$0.43$1.09
24340Qwen3 235B A22B Instruct 25071147±232.2K4.7%6.8%13 tps1.9s262K$0.13$0.52
24442Qwen3 Max Instruct Preview1148±236.6K3.5%1.1%31 tps1.7s256K$1.43$6.61
24544DeepSeek V3.1 Terminus Chat1150±317.8K4.2%3.4%27 tps1.5s131K$0.86$1.80
24633Kimi K2.51151±332.5K1.8%6.5%33 tps1.7s262K$0.34$2.57
24752Qwen3.5 122B A17B1151±44.7K1.6%1.5%82 tps1.4s256K$0.40$3.20
24833Qwen3 30B A3B Instruct 25071158±231.6K4.1%1.2%55 tps1.3s131K$0.13$0.72
24917Gemini 3 Flash Preview1159±317.8K2.1%1.3%138 tps1.4s1M$0.50$3.00
25032Gemini 2.5 Pro High1159±242.7K4.5%1.5%48 tps2.3s1M$1.25$10.00
25133Qwen3 Next 80B A3B Instruct1161±224.9K3.8%0.6%84 tps1.1s256K$0.20$1.42
25248Step 3.5 Flash1164±54K1.5%2.2%109 tps0.6s256K$0.05$0.15
25356MiniMax M2.1 Lightning1165±54.9K1.2%1.7%52 tps2.1s205K$0.30$2.40
25429MiniMax M2.71167±81.1K1.8%3.0%34 tps2.5s205K$0.30$1.20
25562Qwen3 Omni 30B A3B Instruct1168±53K2.3%3.9%65 tps1.2s66K$0.35$0.97
25637Kimi K2.5 Instant1171±46.2K1.8%2.9%32 tps3.0s262K$0.50$3.00
25726Claude Haiku 4.5 (Extended Thinking)1173±224.3K3.1%1.4%115 tps0.7s200K$1.00$5.00
25817Claude Opus 4.51173±222.5K2.2%1.5%45 tps1.5s200K$5.00$25.00
25917Grok 4.20 Beta Reasoning1175±73.3K1.8%1.1%77 tps4.5s2M$2.00$5.50
26016GPT-5.21176±222.6K1.8%4.1%18 tps2.7s400K$1.75$14.00
26126GPT-5 (High)1177±222.1K3.1%4.5%81 tps35.9s400K$1.25$10.00
262106GPT-5.4 nano1177±106502.3%0.7%149 tps0.5s400K$0.20$1.25
26326Grok 4.1 Fast Non-Reasoning1177±225.7K3.0%0.9%101 tps0.5s2M$0.20$0.50
26417GPT-5.2 (High)1180±254.6K1.9%6.7%18 tps16.3s400K$1.75$14.00
26514Gemini 3 Pro (Low)1180±328.9K2.2%2.4%51 tps3.5s1M$2.00$12.00
26622GLM 51182±417.3K2.1%3.4%36 tps2.7s200K$0.72$2.55
26733Grok 4.20 Multi Agent Beta1183±92.6K2.0%1.2%56 tps8.8s2M$2.00$6.00
26837Qwen3 Omni 30B A3B Thinking1186±37.5K2.1%3.7%67 tps1.2s66K$0.97$1.79
26929Nova Experimental Chat 12-101188±39.8K1.9%2.4%84 tps12.9s98K$0$0
27029Qwen3 VL 235B A22B Instruct1188±313.5K5.2%3.1%75 tps1.9s129K$0.37$1.81
27114Gemini 3 Flash Preview Thinking1190±247K2.3%1.6%3 tps6.2s1M$0.50$3.00
27222Grok 4.20 Beta Non-reasoning1192±111.3K3.1%1.1%151 tps0.6s2M$2.00$6.00
27322GPT-5 Chat1196±175.1K3.4%1.3%95 tps0.9s400K$1.25$10.00
27413GPT-5.3 Instant1199±69.3K1.7%0.9%63 tps0.8s400K$1.75$14.00
27517GPT-5.4 mini1203±108852.7%0.8%148 tps0.5s400K$0.75$4.50
27610Gemini 3 Pro1207±178K2.2%2.1%50 tps3.6s1M$2.00$12.00
27722MiniMax M2.7-highspeed1207±101.1K2.1%2.3%50 tps2.1s205K$0.60$2.40
27810Claude Sonnet 4.5 (Thinking)1228±166.2K3.4%1.9%44 tps1.1s200K$3.00$15.00
27910GPT-5.2 Instant1232±239.3K1.6%1.7%52 tps2.0s400K$1.75$14.00
2806Gemini 3.1 Pro1245±326K2.0%3.5%35 tps4.1s1M$2.00$12.00
View All (288 models)