Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1084
DeepSeek V3.1 Chat
1082
Qwen3.5 27B
1079
Step 3.5 Flash
1079
DeepSeek V3.2 Exp Chat
1077
Qwen Max
1074
Qwen3 235B A22B
1070
Gemini 2.5 Flash Lite Preview 0925
1070
Qwen3 32B
1064
Kimi K2 Thinking
1063
DeepSeek V3.2 Exp Thinking
1060
GPT-4.1 mini
1059
Gemini 2.5 Flash Lite Thinking
1055
DeepSeek V3 0324 Turbo
1055
Qwen3.5 397B A17B
1053
Seed 2.0 Mini (Medium)

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
8186DeepSeek V3.1 Chat1084±63.7K10.1%2.8%21 tps1.6s131K$0.38$1.00
8281Qwen3.5 27B1082±265502.7%3.7%55 tps2.6s256K$0.30$2.40
8348Step 3.5 Flash1079±236452.3%2.2%109 tps0.6s256K$0.05$0.15
8465DeepSeek V3.2 Exp Chat1079±44.3K8.8%2.6%29 tps1.5s131K$0.27$0.39
8593Qwen Max1077±58.8K9.1%1.5%49 tps1.5s33K$1.60$6.40
8686Qwen3 235B A22B1074±102.8K14.4%5.3%71 tps0.9s41K$0.23$0.63
8771Gemini 2.5 Flash Lite Preview 09251070±56.7K8.6%1.2%209 tps0.7s1M$0.25$0.35
8895Qwen3 32B1070±186207.5%3.9%30 tps3.1s41K$0.12$0.42
8995Kimi K2 Thinking1064±121.6K6.8%4.2%61 tps5.9s262K$0.24$1.03
9095DeepSeek V3.2 Exp Thinking1063±85K3.4%7.2%26 tps3.0s131K$0.28$0.42
91118GPT-4.1 mini1060±411.7K6.8%1.1%67 tps0.9s1M$0.34$1.60
92113Gemini 2.5 Flash Lite Thinking1059±56.6K9.5%1.0%118 tps4.4s1M$0.03$0.13
9393DeepSeek V3 0324 Turbo1055±59.3K10.3%6.3%12 tps2.4s164K$0.73$1.79
9471Qwen3.5 397B A17B1055±111.6K2.1%4.3%57 tps1.4s256K$0.52$3.00
95139Seed 2.0 Mini (Medium)1053±216054.0%11.9%33 tps1.7s256K$0.15$0.60
96111Grok 3 Fast1051±171.1K2.6%1.7%52 tps2.4s131K$5.00$25.00
9781OpenAI o3-pro1048±82.2K3.5%5.2%22 tps70.8s200K$20.00$80.00
9837Qwen3 Omni 30B A3B Thinking1047±111.3K5.9%3.7%67 tps1.2s66K$0.97$1.79
99106DeepSeek V3.1 Terminus Thinking1047±72.5K11.6%5.9%27 tps1.8s131K$0.56$1.68
100119GLM 4.7 FP81046±194903.0%6.9%40 tps1.3s200K$0.30$1.20
10179Qwen3 Max Thinking Preview1044±55.1K7.7%3.1%40 tps2.1s256K$1.20$6.00
102143Seed 1.6 2506151042±131.2K4.8%3.1%46 tps2.2s256K$0.25$2.00
103165Pixtral Large1042±82.5K5.1%2.5%57 tps1.3s128K$1.50$4.50
104124Kimi K2 0905 Turbo1041±46.8K12.4%0.7%373 tps0.5s262K$1.70$6.50
105143Gemini 2.0 Flash1038±63.7K8.9%<0.1%76 tps0.5s1M$0.14$0.56
106101Gemini 2.5 Flash Lite1038±512.8K12.6%1.3%210 tps0.7s1M$0.10$0.40
107153OpenAI o11037±151.2K4.8%4.2%92 tps5.5s200K$15.00$60.00
108157GPT-5 Nano1035±63.8K10.6%3.2%113 tps20.9s400K$0.05$0.40
109101DeepSeek V3 (Turbo)1034±111K5.9%1.5%32 tps1.5s64K$0.40$1.30
110119ERNIE 4.5 300B A47B1032±66.1K8.7%4.7%23 tps2.3s123K$0.28$1.10
11195DeepSeek-R1 Turbo1032±101.4K5.5%2.6%29 tps1.8s64K$2.85$4.75
11271DeepSeek V3.11029±101.1K4.5%0.8%197 tps0.4s164K$0.55$1.60
113129Command A1029±511K8.4%2.2%42 tps0.8s256K$2.00$7.33
114126DeepSeek V31028±75.9K5.7%0.9%69 tps1.1s64K$0.59$1.49
11586Amazon Nova 2 Lite1027±102.6K7.9%1.0%137 tps0.6s300K$0.35$2.95
116113GLM 4.5 AirX1020±108059.0%3.3%75 tps1.2s131K$1.10$4.50
117133GPT-4.1 nano1020±48.8K9.7%0.6%175 tps0.5s1M$0.10$0.40
118124Qwen3 235B A22B Thinking 25071018±111.1K4.2%2.5%53 tps1.6s131K$0.59$5.70
119148OpenAI o31016±111.3K4.6%0.9%85 tps6.8s128K$7.33$29.33
120129DeepSeek V3.1 Thinking1014±73.9K14.0%7.1%18 tps1.8s131K$0.23$0.75
View All (237 models)