Models
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1291
Kimi K2.5
1231
Qwen3 Next 80B A3B Instruct
1228
MiniMax M2.5 Lightning
1216
Qwen3.5 122B A17B
1211
Qwen3.5 27B
1210
Kimi K2.5 Instant
1192
Kimi K2 Thinking Turbo
1178
DeepSeek V3.2 Thinking
1165
gpt-oss-120b
1134
Grok 3 Beta
1131
Mistral Large 3
1107
DeepSeek V3.2 Exp Chat
1102
Step 3.5 Flash
1093
Qwen3 235B A22B
1089
DeepSeek V3.2 Exp Thinking

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
119Kimi K2.51291±1116.5K3.4%6.5%33 tps1.7s262K$0.34$2.57
231Qwen3 Next 80B A3B Instruct1231±58.8K5.8%0.6%84 tps1.1s256K$0.20$1.42
331MiniMax M2.5 Lightning1228±141.7K3.2%1.5%51 tps2.0s205K$0.60$2.40
436Qwen3.5 122B A17B1216±151.9K3.1%1.5%82 tps1.4s256K$0.40$3.20
536Qwen3.5 27B1211±169104.7%3.7%55 tps2.6s256K$0.30$2.40
636Kimi K2.5 Instant1210±81.8K3.2%2.9%32 tps3.0s262K$0.50$3.00
749Kimi K2 Thinking Turbo1192±620.3K3.4%2.0%75 tps1.4s262K$1.15$8.00
860DeepSeek V3.2 Thinking1178±923.3K4.0%9.0%30 tps2.6s131K$0.28$0.42
969gpt-oss-120b1165±519.2K5.0%0.7%213 tps0.5s131K$0.11$0.50
1097Grok 3 Beta1134±92K0.8%<0.1%58 tps0.8s131K$3.00$15.00
1177Mistral Large 31131±85.4K5.8%2.1%51 tps1.0s256K$0.50$1.50
1290DeepSeek V3.2 Exp Chat1107±45.5K6.1%2.6%29 tps1.5s131K$0.27$0.39
1390Step 3.5 Flash1102±248103.6%2.2%109 tps0.6s256K$0.05$0.15
1498Qwen3 235B A22B1093±64.5K8.0%5.3%71 tps0.9s41K$0.23$0.63
1598DeepSeek V3.2 Exp Thinking1089±75.9K3.5%7.2%26 tps3.0s131K$0.28$0.42
16112Kimi K2 Fast1073±535K6.4%0.8%365 tps0.5s131K$1.00$3.00
17112gpt-oss-20b1066±67.7K7.1%0.5%216 tps0.5s131K$0.06$0.26
18119DeepSeek V3.1 Terminus Thinking1061±92.9K9.4%5.9%27 tps1.8s131K$0.56$1.68
19151Llama 3 8B Turbo1059±246001.6%<0.1%97 tps0.1s8K$0.12$0.13
20119Qwen3 32B Fast1052±611.4K5.2%11.6%30 tps3.1s41K$0.10$0.25
21164Llama 3 70B Turbo1037±64.3K1.0%<0.1%31 tps0.0s8K$0.73$0.83
22135QwQ 32B1035±411.6K6.4%5.4%41 tps2.1s16K$0.43$0.56
23174Qwen 2.5 72B Turbo1035±226705.0%<0.1%84 tps0.8s131K$0.60$0.60
24135Qwen3 VL 30B A3B Instruct1034±151K6.7%1.8%80 tps2.6s129K$0.18$0.67
25135DeepSeek V31032±517.6K3.7%0.9%69 tps1.1s64K$0.59$1.49
26144Command A1024±422.4K4.8%2.2%42 tps0.8s256K$2.00$7.33
27148Nemotron 3 Nano (Thinking)1012±132K6.7%2.0%200 tps0.5s256K$0$0
28189K2 Think1005±161.4K5.6%<0.1%418 tps2.8sN/A$0$0
29148DeepSeek-R1 Turbo1003±92.5K5.6%2.6%29 tps1.8s64K$2.85$4.75
30148Qwen3 30B A3B994±86.3K6.9%5.1%163 tps1.0s41K$0.06$0.21
31159Mistral Small 3.1 24B Instruct980±112.9K4.3%7.5%15 tps2.4s131K$0.06$0.18
32159DeepSeek-R1 0528979±65.5K3.5%1.3%93 tps0.5s64K$1.60$3.67
33167Qwen 2.5 32B Instruct972±74.1K6.5%2.5%48 tps1.0s131K$0.21$0.25
34167Llama 4 Maverick971±521K5.0%1.2%88 tps2.4s1M$0.23$0.83
35167Pixtral Large969±143.5K3.9%2.5%57 tps1.3s128K$1.50$4.50
36167Qwen3 VL 30B A3B Thinking967±111.9K8.9%4.5%84 tps2.9s127K$0.20$1.47
37167Qwen3 14B962±85.3K8.4%1.7%109 tps0.8s41K$0.04$0.15
38167Llama 3.1 8B Turbo958±142.2K2.0%2.1%650 tps0.5s128K$0.13$0.14
39230NVIDIA Llama 3.3 Nemotron Super 49B v1942±93.6K2.3%<0.1%13 tpsN/A131K$0.07$0.20
40179DeepSeek-R1939±66.4K4.3%0.8%133 tps0.6s64K$0.91$3.07
41179DeepSeek Prover v2935±101.3K3.0%5.2%14 tps1.3s164K$0.40$1.56
42189Codestral931±198556.0%5.2%151 tps0.9s262K$0.15$0.45
43245NVIDIA Llama 3.1 Nemotron 70B928±75.3K2.0%<0.1%9 tps0.1s128K$0.33$0.39
44189Mistral Small 3.1925±142.4K4.0%7.4%13 tps2.6s32K$0.17$0.28
45189Open Mistral Nemo918±211.7K4.5%1.5%171 tps0.5s131K$0.15$0.15
46264Arcee AI Spotlight910±84.6K4.3%<0.1%121 tps0.4s131K$0.18$0.18
47264OLMo 3 32B Think910±254706.0%<0.1%84 tps0.6s66K$0.15$0.50
48264Llama 3.1 405B Instruct Turbo896±112K3.9%<0.1%26 tps0.8s131K$3.50$3.50
49264Arcee AI Virtuoso-Medium896±122K2.6%<0.1%3 tpsN/A131K$0.50$0.80
50201GLM 4.6V Flash892±102.5K7.6%3.7%64 tps2.1s128K$0.04$0.40
51280Arcee AI Blitz889±83K2.1%<0.1%6 tpsN/A33K$0.45$0.75
52201Llama 3.2 11B Instruct885±152.1K4.1%1.5%152 tps0.5s8K$0.16$0.16
53210Mistral Small 3 24B Instruct880±101.7K3.6%2.6%77 tps0.6s33K$0.07$0.14
54210Mistral Nemo875±159152.7%<0.1%112 tps0.4s131K$0.07$0.13
55210Qwen 2.5 7B Turbo870±256156.1%0.5%125 tps0.4s131K$0.30$0.30
56210Hermes 2 Pro Llama 3 8B864±211.8K2.5%<0.1%76 tps1.0s131K$0.08$0.09
57210Mistral Small 24B Instruct864±161.5K4.1%1.5%84 tps0.4s33K$0.80$0.80
58293Hermes 2 Mixtral 8x7B DPO863±171.2K1.3%<0.1%1 tpsN/A33K$0.60$0.60
59312Yi Large858±121.5K<0.1%<0.1%34 tpsN/A33K$1.50$1.50
60312Command Light856±161.1K4.9%<0.1%23 tpsN/A4K$0.10$0.20
61210Gemma 3 27B856±271.1K6.9%1.8%35 tps1.1s66K$0.06$0.10
62210Mixtral 8x7B855±181.3K5.1%2.2%142 tps0.6s33K$0.23$0.23
63210Mixtral 8x7B Instruct854±161.4K4.4%0.2%79 tps0.7s33K$0.23$0.31
64234Gemma 3 27B IT853±102.3K3.9%2.0%60 tps0.8s128K$0.17$0.29
65234Llama 3.3 70B Instruct Turbo851±191.2K6.0%2.0%78 tps1.0s131K$0.88$0.88
66234Command R 7B849±153.3K4.8%1.1%76 tps0.4s128K$0.04$0.15
67240DeepSeek-R1 Distill Llama 70B835±93.4K5.2%3.6%27 tps1.6s32K$0.73$0.95
68324Typhoon 2 70B Instruct835±151.4K4.0%<0.1%19 tps0.1s8K$0.88$0.88
69324OLMo 2 0425 1B Instruct833±215602.6%<0.1%68 tps0.0s4K$0$0
70240Mixtral-8x7B Instruct v0.1832±231.3K4.6%1.3%54 tps0.4s33K$0.60$0.60
71240Qwen 2.5 7B831±172K5.1%3.7%40 tps1.9s131K$0.08$0.27
72240Sky T1 32B Preview829±142.4K4.5%7.8%73 tps0.6s16K$0.12$0.18
73240Ministral 8B825±172.2K5.5%1.4%177 tps0.4s128K$0.14$0.14
74240C4AI Aya Expanse 32B821±73.8K4.0%1.5%43 tps0.5s128K$0.50$1.50
75240Gemma 2 27B815±171.5K4.1%1.4%44 tps1.4s8K$0.80$0.80
76252Ministral 3B806±162.3K5.1%0.8%248 tps0.4s131K$0.08$0.08
77252Gemma 3 1B802±112K6.1%0.6%176 tps1.0s33K$0.06$0.10
78252Phi 4798±161.7K3.4%5.1%28 tps1.3s128K$0.10$0.32
79262Command R778±182.2K4.9%5.8%54 tps0.6s128K$0.30$0.99
80262Mistral Small770±121.2K4.5%1.7%142 tps0.6s32K$0.43$1.30
81262Hermes 4 405B Reasoning FP8759±112.7K12.8%3.6%32 tps0.8s131K$1.00$3.00
82262Goliath 120B754±247455.7%2.7%21 tps2.2s6K$6.56$9.38
83269Gemma 3 4B742±103.3K4.7%1.3%138 tps0.7s131K$0.02$0.04
84269Mixtral 8x22B Instruct738±171.4K5.6%1.8%142 tps0.7s66K$0.45$0.45
85269Command R+738±151.6K5.6%2.8%36 tps0.7s128K$2.08$9.45
86361Command734±187654.4%<0.1%25 tpsN/A4K$0.83$1.33
87269Pixtral 12B722±213K6.3%2.2%101 tps1.2s131K$0.08$0.08
88374Mythalion 13B709±101.1K1.3%<0.1%63 tps0.5s4K$0.56$1.13
89276Hermes 3 405B Instruct702±201.4K4.1%2.3%20 tps1.1s131K$0.80$0.80
90374Phi 4 Multimodal Instruct697±162.1K6.8%<0.1%17 tps1.4s128K$0.03$0.05
91386DeepSeek-R1 Distill Qwen 7B633±195655.0%<0.1%0 tpsN/A131K$0.05$0.10
92390Llema 7B601±218504.5%<0.1%1 tps15.0s4K$0.80$1.20
93279MythoMax L2 13B600±212.3K5.8%1.2%22 tps1.1s4K$0.18$0.18
94279Phi 4 Mini Instruct599±211K7.1%7.4%40 tps1.1s128K$0.07$0.30
95279Phi 4 Reasoning573±172.1K5.5%21.0%29 tps1.0s33K$0.06$0.25
96284Qwen 2.5 VL 3B Instruct523±254.1K6.1%3.0%44 tps2.5s128K$0.21$0.63
97399DeepSeek-R1 Distill Qwen 1.5B481±197305.2%<0.1%20 tps0.0s131K$0.18$0.18
98284CodeLlama 7B Instruct Solidity463±544858.5%3.6%33 tps0.7s16K$0.80$1.20
99286Phi 4 Mini Reasoning447±153.4K12.0%9.7%30 tps0.9s128K$0.07$0.30
Show Less