Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1113
DeepSeek V3.2
1112
Gemini 2.5 Flash Lite
1111
Qwen Max
1111
Qwen3 32B
1110
GLM 5
1107
GPT-5 Mini Minimal
1103
DeepSeek V3 0324 Turbo
1097
DeepSeek V3.1 Chat
1095
LongCat Flash Chat
1085
GPT-5.1 Instant
1084
DeepSeek V3 0324
1083
GPT-5
1083
Claude Sonnet 4
1080
MiniMax M2.1
1077
Grok 4 Fast Reasoning

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
4140DeepSeek V3.21113±53.6K0.8%1.4%83 tps5.1s131K$0.43$1.09
42101Gemini 2.5 Flash Lite1112±67.6K1.7%1.3%210 tps0.7s1M$0.10$0.40
4393Qwen Max1111±67.6K1.4%1.5%49 tps1.5s33K$1.60$6.40
4495Qwen3 32B1111±175151.9%3.9%30 tps3.1s41K$0.12$0.42
4522GLM 51110±71.8K0.8%3.4%36 tps2.7s200K$0.72$2.55
4684GPT-5 Mini Minimal1107±109703.5%1.2%63 tps1.4s400K$0.25$2.00
4793DeepSeek V3 0324 Turbo1103±54.4K1.9%6.3%12 tps2.4s164K$0.73$1.79
4886DeepSeek V3.1 Chat1097±71.9K2.3%2.8%21 tps1.6s131K$0.38$1.00
49111LongCat Flash Chat1095±71.7K2.8%0.8%85 tps0.9s131K$0.14$0.68
5062GPT-5.1 Instant1085±63.7K1.1%1.3%50 tps1.9s400K$1.25$10.00
51106DeepSeek V3 03241084±45.7K1.4%5.8%12 tps2.7s164K$0.38$0.93
5252GPT-51083±57.6K2.2%3.1%78 tps23.1s400K$1.25$9.67
5386Claude Sonnet 41083±512K1.6%1.8%49 tps1.3s200K$3.00$15.00
5460MiniMax M2.11080±65.2K0.6%2.1%66 tps2.6s205K$0.30$1.20
5548Grok 4 Fast Reasoning1077±63.3K2.8%2.1%102 tps3.1s2M$0.30$0.75
56121NVIDIA Llama 3.3 Nemotron Super 49B v1.51076±127551.9%2.0%50 tps0.6s131K$0.09$0.33
5752Claude Haiku 4.51076±84.2K2.2%1.1%100 tps0.9s200K$1.00$5.00
5852Grok 4 Fast Non-Reasoning1075±62.9K3.3%1.5%93 tps0.6s2M$0.27$0.67
5956Gemini 3.1 Flash Lite Preview Thinking1071±135601.8%1.7%75 tps4.7s1M$0.25$1.50
6068Grok 41070±413.8K1.6%3.9%29 tps11.1s256K$3.00$15.00
6195Gemini 2.5 Flash1068±411.2K1.2%1.3%2 tps3.7s1M$0.30$2.50
6256MiniMax M2.1 Lightning1067±128550.6%1.7%52 tps2.1s205K$0.30$2.40
6379Qwen3 Max Thinking Preview1067±63.1K1.4%3.1%40 tps2.1s256K$1.20$6.00
6471Gemini 2.5 Flash Lite Preview 09251066±63.3K2.8%1.2%209 tps0.7s1M$0.25$0.35
65124Qwen3 235B A22B Thinking 25071065±71.8K1.9%2.5%53 tps1.6s131K$0.59$5.70
66118GPT-4.1 mini1062±55.5K1.8%1.1%67 tps0.9s1M$0.34$1.60
6781OpenAI o3-pro1061±141.3K2.7%5.2%22 tps70.8s200K$20.00$80.00
68106Grok 31054±67.1K1.7%1.5%53 tps0.6s1M$3.67$18.33
6995Kimi K2 Thinking1054±91.9K3.8%4.2%61 tps5.9s262K$0.24$1.03
7071DeepSeek V3.11053±131.8K1.6%0.8%197 tps0.4s164K$0.55$1.60
7144DeepSeek V3.1 Terminus Chat1053±62.6K2.6%3.4%27 tps1.5s131K$0.86$1.80
7262MiniMax M21046±63.8K1.9%2.2%39 tps2.3s205K$0.21$0.85
73119ERNIE 4.5 300B A47B1046±65.3K1.3%4.7%23 tps2.3s123K$0.28$1.10
74133GPT-4.1 nano1046±85.1K2.0%0.6%175 tps0.5s1M$0.10$0.40
7548Claude Sonnet 4 (Thinking)1044±58.4K2.3%1.5%52 tps1.5s200K$3.00$13.67
7671Gemini 2.5 Flash Thinking1042±56.5K1.5%2.2%88 tps6.4s1M$0.30$2.50
7771Qwen3.5 397B A17B1040±101.4K1.4%4.3%57 tps1.4s256K$0.52$3.00
78119GLM 4.7 FP81039±95151.0%6.9%40 tps1.3s200K$0.30$1.20
79113Mistral Medium1035±53.6K1.8%1.8%48 tps0.6s33K$1.48$4.55
8065GLM 4.61030±82.6K2.8%5.4%39 tps1.5s200K$0.42$1.66
View All (142 models)