Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1291
Kimi K2.5
1231
Qwen3 Next 80B A3B Instruct
1228
MiniMax M2.5 Lightning
1216
Qwen3.5 122B A17B
1211
Qwen3.5 27B
1210
Kimi K2.5 Instant
1192
Kimi K2 Thinking Turbo
1178
DeepSeek V3.2 Thinking
1165
gpt-oss-120b
1134
Grok 3 Beta
1131
Mistral Large 3
1107
DeepSeek V3.2 Exp Chat
1102
Step 3.5 Flash
1093
Qwen3 235B A22B
1089
DeepSeek V3.2 Exp Thinking

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
119Kimi K2.51291±1116.5K3.4%6.5%33 tps1.7s262K$0.34$2.57
231Qwen3 Next 80B A3B Instruct1231±58.8K5.8%0.6%84 tps1.1s256K$0.20$1.42
331MiniMax M2.5 Lightning1228±141.7K3.2%1.5%51 tps2.0s205K$0.60$2.40
436Qwen3.5 122B A17B1216±151.9K3.1%1.5%82 tps1.4s256K$0.40$3.20
536Qwen3.5 27B1211±169104.7%3.7%55 tps2.6s256K$0.30$2.40
636Kimi K2.5 Instant1210±81.8K3.2%2.9%32 tps3.0s262K$0.50$3.00
749Kimi K2 Thinking Turbo1192±620.3K3.4%2.0%75 tps1.4s262K$1.15$8.00
860DeepSeek V3.2 Thinking1178±923.3K4.0%9.0%30 tps2.6s131K$0.28$0.42
969gpt-oss-120b1165±519.2K5.0%0.7%213 tps0.5s131K$0.11$0.50
1097Grok 3 Beta1134±92K0.8%<0.1%58 tps0.8s131K$3.00$15.00
1177Mistral Large 31131±85.4K5.8%2.1%51 tps1.0s256K$0.50$1.50
1290DeepSeek V3.2 Exp Chat1107±45.5K6.1%2.6%29 tps1.5s131K$0.27$0.39
1390Step 3.5 Flash1102±248103.6%2.2%109 tps0.6s256K$0.05$0.15
1498Qwen3 235B A22B1093±64.5K8.0%5.3%71 tps0.9s41K$0.23$0.63
1598DeepSeek V3.2 Exp Thinking1089±75.9K3.5%7.2%26 tps3.0s131K$0.28$0.42
16112Kimi K2 Fast1073±535K6.4%0.8%365 tps0.5s131K$1.00$3.00
17112gpt-oss-20b1066±67.7K7.1%0.5%216 tps0.5s131K$0.06$0.26
18119DeepSeek V3.1 Terminus Thinking1061±92.9K9.4%5.9%27 tps1.8s131K$0.56$1.68
19151Llama 3 8B Turbo1059±246001.6%<0.1%97 tps0.1s8K$0.12$0.13
20119Qwen3 32B Fast1052±611.4K5.2%11.6%30 tps3.1s41K$0.10$0.25
21164Llama 3 70B Turbo1037±64.3K1.0%<0.1%31 tps0.0s8K$0.73$0.83
22135QwQ 32B1035±411.6K6.4%5.4%41 tps2.1s16K$0.43$0.56
23174Qwen 2.5 72B Turbo1035±226705.0%<0.1%84 tps0.8s131K$0.60$0.60
24135Qwen3 VL 30B A3B Instruct1034±151K6.7%1.8%80 tps2.6s129K$0.18$0.67
25135DeepSeek V31032±517.6K3.7%0.9%69 tps1.1s64K$0.59$1.49
26144Command A1024±422.4K4.8%2.2%42 tps0.8s256K$2.00$7.33
27148Nemotron 3 Nano (Thinking)1012±132K6.7%2.0%200 tps0.5s256K$0$0
28189K2 Think1005±161.4K5.6%<0.1%418 tps2.8sN/A$0$0
29148DeepSeek-R1 Turbo1003±92.5K5.6%2.6%29 tps1.8s64K$2.85$4.75
30148Qwen3 30B A3B994±86.3K6.9%5.1%163 tps1.0s41K$0.06$0.21
31159Mistral Small 3.1 24B Instruct980±112.9K4.3%7.5%15 tps2.4s131K$0.06$0.18
32159DeepSeek-R1 0528979±65.5K3.5%1.3%93 tps0.5s64K$1.60$3.67
33167Qwen 2.5 32B Instruct972±74.1K6.5%2.5%48 tps1.0s131K$0.21$0.25
34167Llama 4 Maverick971±521K5.0%1.2%88 tps2.4s1M$0.23$0.83
35167Pixtral Large969±143.5K3.9%2.5%57 tps1.3s128K$1.50$4.50
36167Qwen3 VL 30B A3B Thinking967±111.9K8.9%4.5%84 tps2.9s127K$0.20$1.47
37167Qwen3 14B962±85.3K8.4%1.7%109 tps0.8s41K$0.04$0.15
38167Llama 3.1 8B Turbo958±142.2K2.0%2.1%650 tps0.5s128K$0.13$0.14
39230NVIDIA Llama 3.3 Nemotron Super 49B v1942±93.6K2.3%<0.1%13 tpsN/A131K$0.07$0.20
40179DeepSeek-R1939±66.4K4.3%0.8%133 tps0.6s64K$0.91$3.07
View All (99 models)