Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1207
MiniMax M2.7-highspeed
1171
Kimi K2.5 Instant
1164
Step 3.5 Flash
1161
Qwen3 Next 80B A3B Instruct
1151
Qwen3.5 122B A17B
1151
Kimi K2.5
1144
gpt-oss-120b
1137
Kimi K2 Thinking Turbo
1127
DeepSeek V3.2 Thinking
1127
Nemotron 3 Nano (Thinking)
1124
DeepSeek V3.2 Exp Chat
1122
Mistral Large 3
1121
MiniMax M2.5 Lightning
1097
Qwen3.5 27B
1093
gpt-oss-20b

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
122MiniMax M2.7-highspeed1207±101.1K2.1%2.3%50 tps2.1s205K$0.60$2.40
237Kimi K2.5 Instant1171±46.2K1.8%2.9%32 tps3.0s262K$0.50$3.00
348Step 3.5 Flash1164±54K1.5%2.2%109 tps0.6s256K$0.05$0.15
433Qwen3 Next 80B A3B Instruct1161±224.9K3.8%0.6%84 tps1.1s256K$0.20$1.42
552Qwen3.5 122B A17B1151±44.7K1.6%1.5%82 tps1.4s256K$0.40$3.20
633Kimi K2.51151±332.5K1.8%6.5%33 tps1.7s262K$0.34$2.57
748gpt-oss-120b1144±240.7K3.7%0.7%213 tps0.5s131K$0.11$0.50
844Kimi K2 Thinking Turbo1137±329.8K2.5%2.0%75 tps1.4s262K$1.15$8.00
956DeepSeek V3.2 Thinking1127±337.6K2.6%9.0%30 tps2.6s131K$0.28$0.42
1086Nemotron 3 Nano (Thinking)1127±37.5K2.4%2.0%200 tps0.5s256K$0$0
1165DeepSeek V3.2 Exp Chat1124±314.3K4.0%2.6%29 tps1.5s131K$0.27$0.39
1265Mistral Large 31122±314.3K3.3%2.1%51 tps1.0s256K$0.50$1.50
1379MiniMax M2.5 Lightning1121±45.6K1.3%1.5%51 tps2.0s205K$0.60$2.40
1481Qwen3.5 27B1097±62.3K2.4%3.7%55 tps2.6s256K$0.30$2.40
15101gpt-oss-20b1093±220.3K4.6%0.5%216 tps0.5s131K$0.06$0.26
1686Qwen3 235B A22B1090±311.9K5.1%5.3%71 tps0.9s41K$0.23$0.63
17106DeepSeek V3.1 Terminus Thinking1071±38.5K4.9%5.9%27 tps1.8s131K$0.56$1.68
1895DeepSeek-R1 Turbo1069±37.3K2.9%2.6%29 tps1.8s64K$2.85$4.75
1995DeepSeek V3.2 Exp Thinking1068±39.9K2.7%7.2%26 tps3.0s131K$0.28$0.42
20121QwQ 32B1068±228.3K4.4%5.4%41 tps2.1s16K$0.43$0.56
21121Qwen3 32B Fast1059±225K3.8%11.6%30 tps3.1s41K$0.10$0.25
22126Qwen3 30B A3B1051±215.1K4.4%5.1%163 tps1.0s41K$0.06$0.21
23129Command A1039±282.1K2.2%2.2%42 tps0.8s256K$2.00$7.33
24133Qwen3 14B1037±212.5K5.7%1.7%109 tps0.8s41K$0.04$0.15
25113Kimi K2 Fast1037±2107.2K4.5%0.8%365 tps0.5s131K$1.00$3.00
26126DeepSeek V31035±257.9K1.7%0.9%69 tps1.1s64K$0.59$1.49
27170Llama 3.1 8B Turbo1027±38.2K1.4%2.1%650 tps0.5s128K$0.13$0.14
28139Qwen3 VL 30B A3B Instruct1022±72.1K4.8%1.8%80 tps2.6s129K$0.18$0.67
29133DeepSeek-R1 05281020±312.3K2.1%1.3%93 tps0.5s64K$1.60$3.67
30165Qwen3 VL 30B A3B Thinking1020±43.5K6.5%4.5%84 tps2.9s127K$0.20$1.47
31153Qwen 2.5 32B Instruct1011±316.8K3.1%2.5%48 tps1.0s131K$0.21$0.25
32161Mistral Small 3.11006±39.7K2.0%7.4%13 tps2.6s32K$0.17$0.28
33148DeepSeek-R11001±313.5K2.4%0.8%133 tps0.6s64K$0.91$3.07
34161Llama 4 Maverick999±173.4K2.5%1.2%88 tps2.4s1M$0.23$0.83
35165Pixtral Large994±49.9K2.6%2.5%57 tps1.3s128K$1.50$4.50
36194Llama 3 70B993±91.9K1.3%4.5%21 tps1.7s8K$1.08$1.38
37201Qwen 2.5 7B Turbo992±72.6K2.8%0.5%125 tps0.4s131K$0.30$0.30
38186Gemma 3 27B983±63.5K3.7%1.8%35 tps1.1s66K$0.06$0.10
39177Mistral Small 3.1 24B Instruct982±211.2K1.8%7.5%15 tps2.4s131K$0.06$0.18
40179Llama 3.1 70B Instruct976±149252.6%6.3%30 tps0.8s128K$0.17$0.22
41194Mistral Small 3 24B Instruct972±47.7K1.5%2.6%77 tps0.6s33K$0.07$0.14
42194Llama 3.2 11B Instruct967±29.6K1.9%1.5%152 tps0.5s8K$0.16$0.16
43161DeepSeek Prover v2961±63.3K1.8%5.2%14 tps1.3s164K$0.40$1.56
44201Mistral Small 24B Instruct958±46.8K2.1%1.5%84 tps0.4s33K$0.80$0.80
45225Open Mistral Nemo946±37.3K2.1%1.5%171 tps0.5s131K$0.15$0.15
46235Hermes 2 Pro Llama 3 8B945±28.8K1.0%<0.1%76 tps1.0s131K$0.08$0.09
47225Command R 7B940±314K1.9%1.1%76 tps0.4s128K$0.04$0.15
48201Gemma 3 27B IT938±39.7K1.7%2.0%60 tps0.8s128K$0.17$0.29
49214Qwen 2.5 7B934±47.5K2.3%3.7%40 tps1.9s131K$0.08$0.27
50186GLM 4.6V Flash933±47.9K3.7%3.7%64 tps2.1s128K$0.04$0.40
51214C4AI Aya Expanse 32B930±217.9K1.6%1.5%43 tps0.5s128K$0.50$1.50
52246Ministral 3B929±38.6K2.2%0.8%248 tps0.4s131K$0.08$0.08
53240Mistral Nemo928±34.2K1.2%<0.1%112 tps0.4s131K$0.07$0.13
54240Llama 3.3 70B Instruct927±119002.7%5.3%28 tps1.3s128K$0.38$0.55
55214Llama 3.3 70B Instruct Turbo925±54.2K3.3%2.0%78 tps1.0s131K$0.88$0.88
56222Sky T1 32B Preview923±311.2K1.6%7.8%73 tps0.6s16K$0.12$0.18
57235Mixtral 8x7B923±45.3K2.2%2.2%142 tps0.6s33K$0.23$0.23
58229Ministral 8B922±37.7K2.5%1.4%177 tps0.4s128K$0.14$0.14
59225Command R920±39.8K2.0%5.8%54 tps0.6s128K$0.30$0.99
60229Llama 3.1 8B915±81.3K2.9%1.9%61 tps1.0s8K$0.07$0.09
61235Gemma 3 4B909±212.6K1.9%1.3%138 tps0.7s131K$0.02$0.04
62256Gemma 3 1B909±47.1K3.0%0.6%176 tps1.0s33K$0.06$0.10
63253Gemma 2 27B906±36.9K1.8%1.4%44 tps1.4s8K$0.80$0.80
64235Command R+902±56.6K2.2%2.8%36 tps0.7s128K$2.08$9.45
65246Mixtral 8x22B Instruct900±56K2.2%1.8%142 tps0.7s66K$0.45$0.45
66256Mixtral 8x7B Instruct894±66.2K2.0%0.2%79 tps0.7s33K$0.23$0.31
67256Phi 4890±38.4K1.8%5.1%28 tps1.3s128K$0.10$0.32
68260Mistral Small881±34.8K2.5%1.7%142 tps0.6s32K$0.43$1.30
69265Mixtral-8x7B Instruct v0.1880±55.5K2.4%1.3%54 tps0.4s33K$0.60$0.60
70246DeepSeek-R1 Distill Llama 70B874±56.5K3.4%3.6%27 tps1.6s32K$0.73$0.95
71260Hermes 4 405B Reasoning FP8862±46.8K8.0%3.6%32 tps0.8s131K$1.00$3.00
72271Hermes 3 405B Instruct838±36K1.8%2.3%20 tps1.1s131K$0.80$0.80
73281Gemma 2 9B831±91.6K3.7%<0.1%100 tps0.4s8K$0.09$0.09
74281MythoMax L2 13B814±49.8K2.5%1.2%22 tps1.1s4K$0.18$0.18
75281Goliath 120B794±53.1K2.5%2.7%21 tps2.2s6K$6.56$9.38
76285Phi 4 Mini Instruct778±54.1K3.2%7.4%40 tps1.1s128K$0.07$0.30
77274Pixtral 12B755±124.6K5.7%2.2%101 tps1.2s131K$0.08$0.08
78287Phi 4 Reasoning697±84K3.5%21.0%29 tps1.0s33K$0.06$0.25
79291Phi 4 Mini Reasoning635±47.9K7.9%9.7%30 tps0.9s128K$0.07$0.30
80288Qwen 2.5 VL 3B Instruct629±75.1K6.8%3.0%44 tps2.5s128K$0.21$0.63
Show Less