Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1061
OpenAI o3-pro
1054
Grok 3
1054
Kimi K2 Thinking
1053
DeepSeek V3.1
1053
DeepSeek V3.1 Terminus Chat
1051
Qwen3 30B A3B
1047
DeepSeek V3.2 Exp Chat
1046
MiniMax M2
1046
ERNIE 4.5 300B A47B
1046
GPT-4.1 nano
1044
Claude Sonnet 4 (Thinking)
1042
Gemini 2.5 Flash Thinking
1040
Qwen3.5 397B A17B
1039
GLM 4.7 FP8
1035
DeepSeek V3.1 Terminus Thinking

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
8181OpenAI o3-pro1061±141.3K2.7%5.2%22 tps70.8s200K$20.00$80.00
82106Grok 31054±67.1K1.7%1.5%53 tps0.6s1M$3.67$18.33
8395Kimi K2 Thinking1054±91.9K3.8%4.2%61 tps5.9s262K$0.24$1.03
8471DeepSeek V3.11053±131.8K1.6%0.8%197 tps0.4s164K$0.55$1.60
8544DeepSeek V3.1 Terminus Chat1053±62.6K2.6%3.4%27 tps1.5s131K$0.86$1.80
86126Qwen3 30B A3B1051±73.9K1.3%5.1%163 tps1.0s41K$0.06$0.21
8765DeepSeek V3.2 Exp Chat1047±92.2K3.1%2.6%29 tps1.5s131K$0.27$0.39
8862MiniMax M21046±63.8K1.9%2.2%39 tps2.3s205K$0.21$0.85
89119ERNIE 4.5 300B A47B1046±65.3K1.3%4.7%23 tps2.3s123K$0.28$1.10
90133GPT-4.1 nano1046±85.1K2.0%0.6%175 tps0.5s1M$0.10$0.40
9148Claude Sonnet 4 (Thinking)1044±58.4K2.3%1.5%52 tps1.5s200K$3.00$13.67
9271Gemini 2.5 Flash Thinking1042±56.5K1.5%2.2%88 tps6.4s1M$0.30$2.50
9371Qwen3.5 397B A17B1040±101.4K1.4%4.3%57 tps1.4s256K$0.52$3.00
94119GLM 4.7 FP81039±95151.0%6.9%40 tps1.3s200K$0.30$1.20
95106DeepSeek V3.1 Terminus Thinking1035±111.4K2.8%5.9%27 tps1.8s131K$0.56$1.68
96113Mistral Medium1035±53.6K1.8%1.8%48 tps0.6s33K$1.48$4.55
9765GLM 4.61030±82.6K2.8%5.4%39 tps1.5s200K$0.42$1.66
9886Qwen3 235B A22B1030±93.1K1.6%5.3%71 tps0.9s41K$0.23$0.63
9995DeepSeek V3.2 Exp Thinking1029±111.4K0.7%7.2%26 tps3.0s131K$0.28$0.42
10068GLM 4.71026±64.5K0.8%5.8%40 tps1.5s200K$0.77$1.73
10171GPT-5 Mini1025±63.2K2.0%2.6%66 tps14.2s400K$0.25$2.00
102126Qwen3 VL 235B A22B Thinking1024±111.6K4.2%4.3%47 tps3.0s127K$0.47$3.31
103143Gemini 2.0 Flash1022±72.5K2.5%<0.1%76 tps0.5s1M$0.14$0.56
104153Qwen 2.5 32B Instruct1019±81.4K1.8%2.5%48 tps1.0s131K$0.21$0.25
105113GLM 4.51019±62.5K1.6%3.7%46 tps1.4s131K$0.43$1.63
10671Seed 1.8 2512281018±64.4K1.0%3.7%41 tps2.1s256K$0.25$2.00
107139GLM 4.6V1018±121.6K1.2%6.4%21 tps1.8s128K$0.38$0.90
108148Qwen3 30B A3B Thinking 25071017±92.2K1.8%0.5%124 tps1.2s131K$0.16$1.70
109133Kimi K2 09051013±112.1K3.7%4.0%30 tps1.4s262K$0.63$2.39
110126DeepSeek V31013±68.8K1.3%0.9%69 tps1.1s64K$0.59$1.49
111101DeepSeek V3 (Turbo)1013±127051.4%1.5%32 tps1.5s64K$0.40$1.30
112129Qwen3 Max Thinking1012±62.1K0.2%13.5%32 tps2.3s256K$1.20$6.00
113129Command A1005±58.6K1.7%2.2%42 tps0.8s256K$2.00$7.33
114143Seed 1.6 2506151005±208802.2%3.1%46 tps2.2s256K$0.25$2.00
115133DeepSeek V3.2 Speciale1003±101.3K2.2%6.0%43 tps1.4s131K$0.84$1.52
116113Kimi K2 Fast1003±410K1.8%0.8%365 tps0.5s131K$1.00$3.00
117113Gemini 2.5 Flash Lite Thinking1003±83.7K2.4%1.0%118 tps4.4s1M$0.03$0.13
118133Qwen3 14B1002±63.6K1.6%1.7%109 tps0.8s41K$0.04$0.15
119148DeepSeek-R11001±65K1.7%0.8%133 tps0.6s64K$0.91$3.07
120157Qwen3 Next 80B A3B Thinking1000±73.2K3.0%0.6%175 tps1.3s256K$0.21$2.26
View All (193 models)