Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1129
Claude Haiku 4.5 (Extended Thinking)
1126
Qwen3 Max Instruct Preview
1126
Gemini 2.5 Pro
1119
Grok 4.1 Fast Reasoning
1116
Claude Sonnet 4.5
1114
DeepSeek V3.1 Turbo
1113
DeepSeek V3.2
1112
Gemini 2.5 Flash Lite
1111
Qwen Max
1111
Qwen3 32B
1110
GLM 5
1107
GPT-5 Mini Minimal
1103
DeepSeek V3 0324 Turbo
1098
Qwen3 32B Fast
1097
DeepSeek V3.1 Chat

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
4126Claude Haiku 4.5 (Extended Thinking)1129±53.6K1.6%1.4%115 tps0.7s200K$1.00$5.00
4242Qwen3 Max Instruct Preview1126±44.3K2.8%1.1%31 tps1.7s256K$1.43$6.61
4344Gemini 2.5 Pro1126±416.2K1.5%2.3%45 tps2.6s1M$1.25$10.00
4444Grok 4.1 Fast Reasoning1119±65.4K1.5%1.5%58 tps7.3s2M$0.20$0.50
4537Claude Sonnet 4.51116±65K3.1%1.4%41 tps1.3s200K$1.80$9.00
4656DeepSeek V3.1 Turbo1114±64K2.1%0.9%173 tps1.3s164K$2.00$3.75
4740DeepSeek V3.21113±53.6K0.8%1.4%83 tps5.1s131K$0.43$1.09
48101Gemini 2.5 Flash Lite1112±67.6K1.7%1.3%210 tps0.7s1M$0.10$0.40
4993Qwen Max1111±67.6K1.4%1.5%49 tps1.5s33K$1.60$6.40
5095Qwen3 32B1111±175151.9%3.9%30 tps3.1s41K$0.12$0.42
5122GLM 51110±71.8K0.8%3.4%36 tps2.7s200K$0.72$2.55
5284GPT-5 Mini Minimal1107±109703.5%1.2%63 tps1.4s400K$0.25$2.00
5393DeepSeek V3 0324 Turbo1103±54.4K1.9%6.3%12 tps2.4s164K$0.73$1.79
54121Qwen3 32B Fast1098±89K1.0%11.6%30 tps3.1s41K$0.10$0.25
5586DeepSeek V3.1 Chat1097±71.9K2.3%2.8%21 tps1.6s131K$0.38$1.00
5656DeepSeek V3.2 Thinking1096±63.8K0.9%9.0%30 tps2.6s131K$0.28$0.42
57111LongCat Flash Chat1095±71.7K2.8%0.8%85 tps0.9s131K$0.14$0.68
58121QwQ 32B1091±59.9K0.9%5.4%41 tps2.1s16K$0.43$0.56
5933Kimi K2.51090±64.5K0.7%6.5%33 tps1.7s262K$0.34$2.57
6086Nemotron 3 Nano (Thinking)1089±91.5K0.7%2.0%200 tps0.5s256K$0$0
6162GPT-5.1 Instant1085±63.7K1.1%1.3%50 tps1.9s400K$1.25$10.00
62106DeepSeek V3 03241084±45.7K1.4%5.8%12 tps2.7s164K$0.38$0.93
6352GPT-51083±57.6K2.2%3.1%78 tps23.1s400K$1.25$9.67
6486Claude Sonnet 41083±512K1.6%1.8%49 tps1.3s200K$3.00$15.00
6560MiniMax M2.11080±65.2K0.6%2.1%66 tps2.6s205K$0.30$1.20
6648Grok 4 Fast Reasoning1077±63.3K2.8%2.1%102 tps3.1s2M$0.30$0.75
67121NVIDIA Llama 3.3 Nemotron Super 49B v1.51076±127551.9%2.0%50 tps0.6s131K$0.09$0.33
6852Claude Haiku 4.51076±84.2K2.2%1.1%100 tps0.9s200K$1.00$5.00
6952Grok 4 Fast Non-Reasoning1075±62.9K3.3%1.5%93 tps0.6s2M$0.27$0.67
7095DeepSeek-R1 Turbo1075±61.9K2.4%2.6%29 tps1.8s64K$2.85$4.75
7156Gemini 3.1 Flash Lite Preview Thinking1071±135601.8%1.7%75 tps4.7s1M$0.25$1.50
7268Grok 41070±413.8K1.6%3.9%29 tps11.1s256K$3.00$15.00
7395Gemini 2.5 Flash1068±411.2K1.2%1.3%2 tps3.7s1M$0.30$2.50
7456MiniMax M2.1 Lightning1067±128550.6%1.7%52 tps2.1s205K$0.30$2.40
7579Qwen3 Max Thinking Preview1067±63.1K1.4%3.1%40 tps2.1s256K$1.20$6.00
7671Gemini 2.5 Flash Lite Preview 09251066±63.3K2.8%1.2%209 tps0.7s1M$0.25$0.35
77124Qwen3 235B A22B Thinking 25071065±71.8K1.9%2.5%53 tps1.6s131K$0.59$5.70
7844Kimi K2 Thinking Turbo1065±63K1.9%2.0%75 tps1.4s262K$1.15$8.00
7965Mistral Large 31064±71.8K2.2%2.1%51 tps1.0s256K$0.50$1.50
80118GPT-4.1 mini1062±55.5K1.8%1.1%67 tps0.9s1M$0.34$1.60
View All (193 models)