Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1005
Command A
1012
Qwen3 Max Thinking
1013
DeepSeek V3 (Turbo)
1013
DeepSeek V3
1013
Kimi K2 0905
1017
Qwen3 30B A3B Thinking 2507
1018
GLM 4.6V
1018
Seed 1.8 251228
1019
GLM 4.5
1019
Qwen 2.5 32B Instruct
1022
Gemini 2.0 Flash
1024
Qwen3 VL 235B A22B Thinking
1025
GPT-5 Mini
1026
GLM 4.7
1029
DeepSeek V3.2 Exp Thinking

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
81129Command A1005±58.6K1.7%2.2%42 tps0.8s256K$2.00$7.33
82129Qwen3 Max Thinking1012±62.1K0.2%13.5%32 tps2.3s256K$1.20$6.00
83101DeepSeek V3 (Turbo)1013±127051.4%1.5%32 tps1.5s64K$0.40$1.30
84126DeepSeek V31013±68.8K1.3%0.9%69 tps1.1s64K$0.59$1.49
85133Kimi K2 09051013±112.1K3.7%4.0%30 tps1.4s262K$0.63$2.39
86148Qwen3 30B A3B Thinking 25071017±92.2K1.8%0.5%124 tps1.2s131K$0.16$1.70
87139GLM 4.6V1018±121.6K1.2%6.4%21 tps1.8s128K$0.38$0.90
8871Seed 1.8 2512281018±64.4K1.0%3.7%41 tps2.1s256K$0.25$2.00
89113GLM 4.51019±62.5K1.6%3.7%46 tps1.4s131K$0.43$1.63
90153Qwen 2.5 32B Instruct1019±81.4K1.8%2.5%48 tps1.0s131K$0.21$0.25
91143Gemini 2.0 Flash1022±72.5K2.5%<0.1%76 tps0.5s1M$0.14$0.56
92126Qwen3 VL 235B A22B Thinking1024±111.6K4.2%4.3%47 tps3.0s127K$0.47$3.31
9371GPT-5 Mini1025±63.2K2.0%2.6%66 tps14.2s400K$0.25$2.00
9468GLM 4.71026±64.5K0.8%5.8%40 tps1.5s200K$0.77$1.73
9595DeepSeek V3.2 Exp Thinking1029±111.4K0.7%7.2%26 tps3.0s131K$0.28$0.42
9686Qwen3 235B A22B1030±93.1K1.6%5.3%71 tps0.9s41K$0.23$0.63
9765GLM 4.61030±82.6K2.8%5.4%39 tps1.5s200K$0.42$1.66
98113Mistral Medium1035±53.6K1.8%1.8%48 tps0.6s33K$1.48$4.55
99106DeepSeek V3.1 Terminus Thinking1035±111.4K2.8%5.9%27 tps1.8s131K$0.56$1.68
100119GLM 4.7 FP81039±95151.0%6.9%40 tps1.3s200K$0.30$1.20
10171Qwen3.5 397B A17B1040±101.4K1.4%4.3%57 tps1.4s256K$0.52$3.00
10271Gemini 2.5 Flash Thinking1042±56.5K1.5%2.2%88 tps6.4s1M$0.30$2.50
10348Claude Sonnet 4 (Thinking)1044±58.4K2.3%1.5%52 tps1.5s200K$3.00$13.67
104133GPT-4.1 nano1046±85.1K2.0%0.6%175 tps0.5s1M$0.10$0.40
105119ERNIE 4.5 300B A47B1046±65.3K1.3%4.7%23 tps2.3s123K$0.28$1.10
10662MiniMax M21046±63.8K1.9%2.2%39 tps2.3s205K$0.21$0.85
10765DeepSeek V3.2 Exp Chat1047±92.2K3.1%2.6%29 tps1.5s131K$0.27$0.39
108126Qwen3 30B A3B1051±73.9K1.3%5.1%163 tps1.0s41K$0.06$0.21
10944DeepSeek V3.1 Terminus Chat1053±62.6K2.6%3.4%27 tps1.5s131K$0.86$1.80
11071DeepSeek V3.11053±131.8K1.6%0.8%197 tps0.4s164K$0.55$1.60
11195Kimi K2 Thinking1054±91.9K3.8%4.2%61 tps5.9s262K$0.24$1.03
112106Grok 31054±67.1K1.7%1.5%53 tps0.6s1M$3.67$18.33
11381OpenAI o3-pro1061±141.3K2.7%5.2%22 tps70.8s200K$20.00$80.00
114118GPT-4.1 mini1062±55.5K1.8%1.1%67 tps0.9s1M$0.34$1.60
11565Mistral Large 31064±71.8K2.2%2.1%51 tps1.0s256K$0.50$1.50
11644Kimi K2 Thinking Turbo1065±63K1.9%2.0%75 tps1.4s262K$1.15$8.00
117124Qwen3 235B A22B Thinking 25071065±71.8K1.9%2.5%53 tps1.6s131K$0.59$5.70
11871Gemini 2.5 Flash Lite Preview 09251066±63.3K2.8%1.2%209 tps0.7s1M$0.25$0.35
11979Qwen3 Max Thinking Preview1067±63.1K1.4%3.1%40 tps2.1s256K$1.20$6.00
12056MiniMax M2.1 Lightning1067±128550.6%1.7%52 tps2.1s205K$0.30$2.40
View All (193 models)