Models
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1291
Kimi K2.5
1231
Qwen3 Next 80B A3B Instruct
1228
MiniMax M2.5 Lightning
1216
Qwen3.5 122B A17B
1211
Qwen3.5 27B
1210
Kimi K2.5 Instant
1192
Kimi K2 Thinking Turbo
1178
DeepSeek V3.2 Thinking
1165
gpt-oss-120b
1131
Mistral Large 3
1107
DeepSeek V3.2 Exp Chat
1102
Step 3.5 Flash
1093
Qwen3 235B A22B
1089
DeepSeek V3.2 Exp Thinking
1073
Kimi K2 Fast

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
119Kimi K2.51291±1116.5K3.4%6.5%33 tps1.7s262K$0.34$2.57
231Qwen3 Next 80B A3B Instruct1231±58.8K5.8%0.6%84 tps1.1s256K$0.20$1.42
331MiniMax M2.5 Lightning1228±141.7K3.2%1.5%51 tps2.0s205K$0.60$2.40
436Qwen3.5 122B A17B1216±151.9K3.1%1.5%82 tps1.4s256K$0.40$3.20
536Qwen3.5 27B1211±169104.7%3.7%55 tps2.6s256K$0.30$2.40
636Kimi K2.5 Instant1210±81.8K3.2%2.9%32 tps3.0s262K$0.50$3.00
749Kimi K2 Thinking Turbo1192±620.3K3.4%2.0%75 tps1.4s262K$1.15$8.00
860DeepSeek V3.2 Thinking1178±923.3K4.0%9.0%30 tps2.6s131K$0.28$0.42
969gpt-oss-120b1165±519.2K5.0%0.7%213 tps0.5s131K$0.11$0.50
1077Mistral Large 31131±85.4K5.8%2.1%51 tps1.0s256K$0.50$1.50
1190DeepSeek V3.2 Exp Chat1107±45.5K6.1%2.6%29 tps1.5s131K$0.27$0.39
1290Step 3.5 Flash1102±248103.6%2.2%109 tps0.6s256K$0.05$0.15
1398Qwen3 235B A22B1093±64.5K8.0%5.3%71 tps0.9s41K$0.23$0.63
1498DeepSeek V3.2 Exp Thinking1089±75.9K3.5%7.2%26 tps3.0s131K$0.28$0.42
15112Kimi K2 Fast1073±535K6.4%0.8%365 tps0.5s131K$1.00$3.00
16112gpt-oss-20b1066±67.7K7.1%0.5%216 tps0.5s131K$0.06$0.26
17119DeepSeek V3.1 Terminus Thinking1061±92.9K9.4%5.9%27 tps1.8s131K$0.56$1.68
18119Qwen3 32B Fast1052±611.4K5.2%11.6%30 tps3.1s41K$0.10$0.25
19135QwQ 32B1035±411.6K6.4%5.4%41 tps2.1s16K$0.43$0.56
20135Qwen3 VL 30B A3B Instruct1034±151K6.7%1.8%80 tps2.6s129K$0.18$0.67
21135DeepSeek V31032±517.6K3.7%0.9%69 tps1.1s64K$0.59$1.49
22144Command A1024±422.4K4.8%2.2%42 tps0.8s256K$2.00$7.33
23148Nemotron 3 Nano (Thinking)1012±132K6.7%2.0%200 tps0.5s256K$0$0
24148DeepSeek-R1 Turbo1003±92.5K5.6%2.6%29 tps1.8s64K$2.85$4.75
25148Qwen3 30B A3B994±86.3K6.9%5.1%163 tps1.0s41K$0.06$0.21
26159Mistral Small 3.1 24B Instruct980±112.9K4.3%7.5%15 tps2.4s131K$0.06$0.18
27159DeepSeek-R1 0528979±65.5K3.5%1.3%93 tps0.5s64K$1.60$3.67
28167Qwen 2.5 32B Instruct972±74.1K6.5%2.5%48 tps1.0s131K$0.21$0.25
29167Llama 4 Maverick971±521K5.0%1.2%88 tps2.4s1M$0.23$0.83
30167Pixtral Large969±143.5K3.9%2.5%57 tps1.3s128K$1.50$4.50
31167Qwen3 VL 30B A3B Thinking967±111.9K8.9%4.5%84 tps2.9s127K$0.20$1.47
32167Qwen3 14B962±85.3K8.4%1.7%109 tps0.8s41K$0.04$0.15
33167Llama 3.1 8B Turbo958±142.2K2.0%2.1%650 tps0.5s128K$0.13$0.14
34179DeepSeek-R1939±66.4K4.3%0.8%133 tps0.6s64K$0.91$3.07
35179DeepSeek Prover v2935±101.3K3.0%5.2%14 tps1.3s164K$0.40$1.56
36189Codestral931±198556.0%5.2%151 tps0.9s262K$0.15$0.45
37189Mistral Small 3.1925±142.4K4.0%7.4%13 tps2.6s32K$0.17$0.28
38189Open Mistral Nemo918±211.7K4.5%1.5%171 tps0.5s131K$0.15$0.15
39201GLM 4.6V Flash892±102.5K7.6%3.7%64 tps2.1s128K$0.04$0.40
40201Llama 3.2 11B Instruct885±152.1K4.1%1.5%152 tps0.5s8K$0.16$0.16
View All (76 models)