Models
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1098
Grok 3
1098
Gemini 2.5 Flash
1099
Qwen3 Coder 480B A35B Instruct
1100
DeepSeek V3 0324
1102
Step 3.5 Flash
1102
GPT-4o
1102
Grok 3 Fast
1103
Gemini 2.5 Flash Lite
1104
GPT-5 Mini Low
1107
Qwen Max
1107
DeepSeek V3.2 Exp Chat
1110
Qwen3 Omni 30B A3B Thinking
1110
DeepSeek V3.1 Chat
1111
Gemini 2.5 Pro Preview 0325
1113
GPT-5.2 Codex (Low)

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
28198Grok 31098±419.1K5.5%1.5%53 tps0.6s1M$3.67$18.33
28298Gemini 2.5 Flash1098±435.9K3.2%1.3%2 tps3.7s1M$0.30$2.50
28390Qwen3 Coder 480B A35B Instruct1099±83.1K4.5%3.3%61 tps2.0s262K$0.71$1.34
28490DeepSeek V3 03241100±415.1K4.3%5.8%12 tps2.7s164K$0.38$0.93
28590Step 3.5 Flash1102±248103.6%2.2%109 tps0.6s256K$0.05$0.15
28690GPT-4o1102±58.5K3.7%1.0%49 tps2.4s128K$3.71$12.57
28790Grok 3 Fast1102±142.5K4.7%1.7%52 tps2.4s131K$5.00$25.00
28890Gemini 2.5 Flash Lite1103±521.3K6.2%1.3%210 tps0.7s1M$0.10$0.40
289114GPT-5 Mini Low1104±82.8K7.2%<0.1%69 tps3.2s400K$0.25$2.00
29090Qwen Max1107±418.3K4.2%1.5%49 tps1.5s33K$1.60$6.40
29190DeepSeek V3.2 Exp Chat1107±45.5K6.1%2.6%29 tps1.5s131K$0.27$0.39
29285Qwen3 Omni 30B A3B Thinking1110±102.3K6.0%3.7%67 tps1.2s66K$0.97$1.79
29385DeepSeek V3.1 Chat1110±74.9K6.6%2.8%21 tps1.6s131K$0.38$1.00
294108Gemini 2.5 Pro Preview 03251111±111.5K3.2%<0.1%3 tps16.6s1M$1.25$10.00
29585GPT-5.2 Codex (Low)1113±191.2K3.2%4.5%41 tps5.0s400K$1.75$14.00
29685GPT-5 Mini Minimal1114±123.2K8.5%1.2%63 tps1.4s400K$0.25$2.00
29785Gemini 2.5 Flash Thinking1118±413.7K3.6%2.2%88 tps6.4s1M$0.30$2.50
29897Gemini 2.5 Pro Preview 06051121±101.7K2.3%<0.1%0 tps3.7s1M$1.25$10.00
29977Gemini 2.5 Flash Lite Preview 09251122±78.5K6.6%1.2%209 tps0.7s1M$0.25$0.35
30077GPT-4.11123±532.8K5.2%3.7%112 tps1.3s1M$2.00$8.00
30197Ministral 8B 25121125±155107.3%<0.1%174 tps0.5s128K$0.15$0.15
30277Grok 41125±339.6K4.4%3.9%29 tps11.1s256K$3.00$15.00
30377Qwen3 Max Thinking Preview1127±106.3K5.7%3.1%40 tps2.1s256K$1.20$6.00
30477Grok 4.20 Multi Agent Beta1129±199453.6%1.2%56 tps8.8s2M$2.00$6.00
30577DeepSeek V3.1 Turbo1130±74.8K5.3%0.9%173 tps1.3s164K$2.00$3.75
30677GPT-5 Mini1131±58.6K5.4%2.6%66 tps14.2s400K$0.25$2.00
30777Mistral Large 31131±85.4K5.8%2.1%51 tps1.0s256K$0.50$1.50
30897Grok 3 Beta1134±92K0.8%<0.1%58 tps0.8s131K$3.00$15.00
30993Gemini 2.5 Flash Preview Thinking1136±101.4K1.8%<0.1%26 tps1.8s1M$0.15$1.76
31074Gemini 2.5 Flash Preview 09251140±67.6K6.0%1.2%5 tps0.9s1M$0.13$0.97
31174Qwen3.5 397B A17B1142±142.5K2.9%4.3%57 tps1.4s256K$0.52$3.00
31274Qwen Plus (Aug'24)1146±517.2K4.7%1.4%53 tps1.3s30K$0.40$1.20
31369DeepSeek V3.1 Terminus Chat1158±56.5K6.9%3.4%27 tps1.5s131K$0.86$1.80
31486GPT-5 (Minimal)1158±58.3K7.4%<0.1%67 tps1.4s400K$1.25$10.00
31586Gemini 2.5 Flash Preview1161±83K1.1%<0.1%138 tps6.9s1M$0.15$0.60
31669GLM 4.71161±716.8K3.7%5.8%40 tps1.5s200K$0.77$1.73
31769GPT-5 Codex (Low)1163±105K4.1%2.7%112 tps3.5s400K$1.25$10.00
31869Qwen3.5 35B A3B1164±258653.9%2.1%116 tps2.1s256K$0.63$1.13
31969gpt-oss-120b1165±519.2K5.0%0.7%213 tps0.5s131K$0.11$0.50
32060Grok 4.20 Beta Reasoning1167±221.2K4.1%1.1%77 tps4.5s2M$2.00$5.50
View All (404 models)