Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1169
Kimi K2.5
1160
Kimi K2 Thinking Turbo
1141
Qwen3 Next 80B A3B Instruct
1128
MiniMax M2.5 Lightning
1124
Qwen3.5 122B A17B
1124
Kimi K2.5 Instant
1117
DeepSeek V3.2 Thinking
1108
Mistral Large 3
1086
gpt-oss-120b
1082
Qwen3.5 27B
1079
Step 3.5 Flash
1079
DeepSeek V3.2 Exp Chat
1074
Qwen3 235B A22B
1063
DeepSeek V3.2 Exp Thinking
1047
DeepSeek V3.1 Terminus Thinking

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
133Kimi K2.51169±65.3K2.8%6.5%33 tps1.7s262K$0.34$2.57
244Kimi K2 Thinking Turbo1160±813.2K3.5%2.0%75 tps1.4s262K$1.15$8.00
333Qwen3 Next 80B A3B Instruct1141±47.6K7.7%0.6%84 tps1.1s256K$0.20$1.42
479MiniMax M2.5 Lightning1128±149952.5%1.5%51 tps2.0s205K$0.60$2.40
552Qwen3.5 122B A17B1124±171.1K3.2%1.5%82 tps1.4s256K$0.40$3.20
637Kimi K2.5 Instant1124±131.4K2.4%2.9%32 tps3.0s262K$0.50$3.00
756DeepSeek V3.2 Thinking1117±610K3.8%9.0%30 tps2.6s131K$0.28$0.42
865Mistral Large 31108±74K6.3%2.1%51 tps1.0s256K$0.50$1.50
948gpt-oss-120b1086±415.1K7.5%0.7%213 tps0.5s131K$0.11$0.50
1081Qwen3.5 27B1082±265502.7%3.7%55 tps2.6s256K$0.30$2.40
1148Step 3.5 Flash1079±236452.3%2.2%109 tps0.6s256K$0.05$0.15
1265DeepSeek V3.2 Exp Chat1079±44.3K8.8%2.6%29 tps1.5s131K$0.27$0.39
1386Qwen3 235B A22B1074±102.8K14.4%5.3%71 tps0.9s41K$0.23$0.63
1495DeepSeek V3.2 Exp Thinking1063±85K3.4%7.2%26 tps3.0s131K$0.28$0.42
15106DeepSeek V3.1 Terminus Thinking1047±72.5K11.6%5.9%27 tps1.8s131K$0.56$1.68
16165Pixtral Large1042±82.5K5.1%2.5%57 tps1.3s128K$1.50$4.50
1795DeepSeek-R1 Turbo1032±101.4K5.5%2.6%29 tps1.8s64K$2.85$4.75
18129Command A1029±511K8.4%2.2%42 tps0.8s256K$2.00$7.33
19126DeepSeek V31028±75.9K5.7%0.9%69 tps1.1s64K$0.59$1.49
20139Qwen3 VL 30B A3B Instruct1012±171K6.5%1.8%80 tps2.6s129K$0.18$0.67
21113Kimi K2 Fast1006±426.2K13.8%0.8%365 tps0.5s131K$1.00$3.00
22170Llama 3.1 8B Turbo998±121.1K2.8%2.1%650 tps0.5s128K$0.13$0.14
23133DeepSeek-R1 0528983±121.3K4.6%1.3%93 tps0.5s64K$1.60$3.67
24201Gemma 3 27B IT983±1590510.4%2.0%60 tps0.8s128K$0.17$0.29
25222Sky T1 32B Preview972±1680510.6%7.8%73 tps0.6s16K$0.12$0.18
26177Mistral Small 3.1 24B Instruct966±121K10.6%7.5%15 tps2.4s131K$0.06$0.18
27161Mistral Small 3.1960±1691511.2%7.4%13 tps2.6s32K$0.17$0.28
28161Llama 4 Maverick956±411.2K8.2%1.2%88 tps2.4s1M$0.23$0.83
29121QwQ 32B955±75K15.3%5.4%41 tps2.1s16K$0.43$0.56
30101gpt-oss-20b954±56.1K10.8%0.5%216 tps0.5s131K$0.06$0.26
31165Qwen3 VL 30B A3B Thinking949±81.5K11.2%4.5%84 tps2.9s127K$0.20$1.47
32194Llama 3.2 11B Instruct943±1474514.4%1.5%152 tps0.5s8K$0.16$0.16
33126Qwen3 30B A3B939±53.7K12.1%5.1%163 tps1.0s41K$0.06$0.21
34148DeepSeek-R1939±121.6K5.5%0.8%133 tps0.6s64K$0.91$3.07
3586Nemotron 3 Nano (Thinking)938±141.3K7.6%2.0%200 tps0.5s256K$0$0
36133Qwen3 14B933±112.7K17.1%1.7%109 tps0.8s41K$0.04$0.15
37121Qwen3 32B Fast932±54.5K12.9%11.6%30 tps3.1s41K$0.10$0.25
38153Qwen 2.5 32B Instruct916±91.9K18.0%2.5%48 tps1.0s131K$0.21$0.25
39214Llama 3.3 70B Instruct Turbo914±2464011.7%2.0%78 tps1.0s131K$0.88$0.88
40246DeepSeek-R1 Distill Llama 70B907±235357.0%3.6%27 tps1.6s32K$0.73$0.95
41186Gemma 3 27B902±1861513.4%1.8%35 tps1.1s66K$0.06$0.10
42214C4AI Aya Expanse 32B889±111.3K10.0%1.5%43 tps0.5s128K$0.50$1.50
43225Command R872±1677012.5%5.8%54 tps0.6s128K$0.30$0.99
44229Ministral 8B868±1580013.5%1.4%177 tps0.4s128K$0.14$0.14
45246Mixtral 8x22B Instruct866±3157010.2%1.8%142 tps0.7s66K$0.45$0.45
46194Mistral Small 3 24B Instruct858±1858512.7%2.6%77 tps0.6s33K$0.07$0.14
47260Hermes 4 405B Reasoning FP8844±122.1K18.8%3.6%32 tps0.8s131K$1.00$3.00
48186GLM 4.6V Flash837±92K9.0%3.7%64 tps2.1s128K$0.04$0.40
49235Command R+828±2166510.1%2.8%36 tps0.7s128K$2.08$9.45
50274Pixtral 12B828±112.2K6.8%2.2%101 tps1.2s131K$0.08$0.08
51235Gemma 3 4B827±111.2K9.9%1.3%138 tps0.7s131K$0.02$0.04
52214Qwen 2.5 7B823±1568511.6%3.7%40 tps1.9s131K$0.08$0.27
53225Open Mistral Nemo818±2459012.6%1.5%171 tps0.5s131K$0.15$0.15
54256Mixtral 8x7B Instruct815±2851012.1%0.2%79 tps0.7s33K$0.23$0.31
55225Command R 7B808±131.3K12.9%1.1%76 tps0.4s128K$0.04$0.15
56235Mixtral 8x7B801±2952512.5%2.2%142 tps0.6s33K$0.23$0.23
57260Mistral Small799±2346512.3%1.7%142 tps0.6s32K$0.43$1.30
58271Hermes 3 405B Instruct792±2149010.1%2.3%20 tps1.1s131K$0.80$0.80
59256Phi 4780±1756511.7%5.1%28 tps1.3s128K$0.10$0.32
60246Ministral 3B772±1778512.3%0.8%248 tps0.4s131K$0.08$0.08
61256Gemma 3 1B758±2396513.1%0.6%176 tps1.0s33K$0.06$0.10
62201Mistral Small 24B Instruct752±2248015.0%1.5%84 tps0.4s33K$0.80$0.80
63253Gemma 2 27B727±1558511.4%1.4%44 tps1.4s8K$0.80$0.80
64265Mixtral-8x7B Instruct v0.1675±3946512.3%1.3%54 tps0.4s33K$0.60$0.60
65288Qwen 2.5 VL 3B Instruct670±122.5K9.2%3.0%44 tps2.5s128K$0.21$0.63
66281MythoMax L2 13B569±3278515.1%1.2%22 tps1.1s4K$0.18$0.18
67291Phi 4 Mini Reasoning475±121.9K22.5%9.7%30 tps0.9s128K$0.07$0.30
Show Less