Models
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

447
Phi 4 Mini Reasoning
463
CodeLlama 7B Instruct Solidity
523
Qwen 2.5 VL 3B Instruct
573
Phi 4 Reasoning
599
Phi 4 Mini Instruct
600
MythoMax L2 13B
702
Hermes 3 405B Instruct
722
Pixtral 12B
738
Command R+
738
Mixtral 8x22B Instruct
742
Gemma 3 4B
754
Goliath 120B
759
Hermes 4 405B Reasoning FP8
770
Mistral Small
778
Command R

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
1286Phi 4 Mini Reasoning447±153.4K12.0%9.7%30 tps0.9s128K$0.07$0.30
2284CodeLlama 7B Instruct Solidity463±544858.5%3.6%33 tps0.7s16K$0.80$1.20
3284Qwen 2.5 VL 3B Instruct523±254.1K6.1%3.0%44 tps2.5s128K$0.21$0.63
4279Phi 4 Reasoning573±172.1K5.5%21.0%29 tps1.0s33K$0.06$0.25
5279Phi 4 Mini Instruct599±211K7.1%7.4%40 tps1.1s128K$0.07$0.30
6279MythoMax L2 13B600±212.3K5.8%1.2%22 tps1.1s4K$0.18$0.18
7276Hermes 3 405B Instruct702±201.4K4.1%2.3%20 tps1.1s131K$0.80$0.80
8269Pixtral 12B722±213K6.3%2.2%101 tps1.2s131K$0.08$0.08
9269Command R+738±151.6K5.6%2.8%36 tps0.7s128K$2.08$9.45
10269Mixtral 8x22B Instruct738±171.4K5.6%1.8%142 tps0.7s66K$0.45$0.45
11269Gemma 3 4B742±103.3K4.7%1.3%138 tps0.7s131K$0.02$0.04
12262Goliath 120B754±247455.7%2.7%21 tps2.2s6K$6.56$9.38
13262Hermes 4 405B Reasoning FP8759±112.7K12.8%3.6%32 tps0.8s131K$1.00$3.00
14262Mistral Small770±121.2K4.5%1.7%142 tps0.6s32K$0.43$1.30
15262Command R778±182.2K4.9%5.8%54 tps0.6s128K$0.30$0.99
16252Phi 4798±161.7K3.4%5.1%28 tps1.3s128K$0.10$0.32
17252Gemma 3 1B802±112K6.1%0.6%176 tps1.0s33K$0.06$0.10
18252Ministral 3B806±162.3K5.1%0.8%248 tps0.4s131K$0.08$0.08
19240Gemma 2 27B815±171.5K4.1%1.4%44 tps1.4s8K$0.80$0.80
20240C4AI Aya Expanse 32B821±73.8K4.0%1.5%43 tps0.5s128K$0.50$1.50
21240Ministral 8B825±172.2K5.5%1.4%177 tps0.4s128K$0.14$0.14
22240Sky T1 32B Preview829±142.4K4.5%7.8%73 tps0.6s16K$0.12$0.18
23240Qwen 2.5 7B831±172K5.1%3.7%40 tps1.9s131K$0.08$0.27
24240Mixtral-8x7B Instruct v0.1832±231.3K4.6%1.3%54 tps0.4s33K$0.60$0.60
25240DeepSeek-R1 Distill Llama 70B835±93.4K5.2%3.6%27 tps1.6s32K$0.73$0.95
26234Command R 7B849±153.3K4.8%1.1%76 tps0.4s128K$0.04$0.15
27234Llama 3.3 70B Instruct Turbo851±191.2K6.0%2.0%78 tps1.0s131K$0.88$0.88
28234Gemma 3 27B IT853±102.3K3.9%2.0%60 tps0.8s128K$0.17$0.29
29210Mixtral 8x7B Instruct854±161.4K4.4%0.2%79 tps0.7s33K$0.23$0.31
30210Mixtral 8x7B855±181.3K5.1%2.2%142 tps0.6s33K$0.23$0.23
31210Gemma 3 27B856±271.1K6.9%1.8%35 tps1.1s66K$0.06$0.10
32210Mistral Small 24B Instruct864±161.5K4.1%1.5%84 tps0.4s33K$0.80$0.80
33210Hermes 2 Pro Llama 3 8B864±211.8K2.5%<0.1%76 tps1.0s131K$0.08$0.09
34210Qwen 2.5 7B Turbo870±256156.1%0.5%125 tps0.4s131K$0.30$0.30
35210Mistral Nemo875±159152.7%<0.1%112 tps0.4s131K$0.07$0.13
36210Mistral Small 3 24B Instruct880±101.7K3.6%2.6%77 tps0.6s33K$0.07$0.14
37201Llama 3.2 11B Instruct885±152.1K4.1%1.5%152 tps0.5s8K$0.16$0.16
38201GLM 4.6V Flash892±102.5K7.6%3.7%64 tps2.1s128K$0.04$0.40
39189Open Mistral Nemo918±211.7K4.5%1.5%171 tps0.5s131K$0.15$0.15
40189Mistral Small 3.1925±142.4K4.0%7.4%13 tps2.6s32K$0.17$0.28
41189Codestral931±198556.0%5.2%151 tps0.9s262K$0.15$0.45
42179DeepSeek Prover v2935±101.3K3.0%5.2%14 tps1.3s164K$0.40$1.56
43179DeepSeek-R1939±66.4K4.3%0.8%133 tps0.6s64K$0.91$3.07
44167Llama 3.1 8B Turbo958±142.2K2.0%2.1%650 tps0.5s128K$0.13$0.14
45167Qwen3 14B962±85.3K8.4%1.7%109 tps0.8s41K$0.04$0.15
46167Qwen3 VL 30B A3B Thinking967±111.9K8.9%4.5%84 tps2.9s127K$0.20$1.47
47167Pixtral Large969±143.5K3.9%2.5%57 tps1.3s128K$1.50$4.50
48167Llama 4 Maverick971±521K5.0%1.2%88 tps2.4s1M$0.23$0.83
49167Qwen 2.5 32B Instruct972±74.1K6.5%2.5%48 tps1.0s131K$0.21$0.25
50159DeepSeek-R1 0528979±65.5K3.5%1.3%93 tps0.5s64K$1.60$3.67
51159Mistral Small 3.1 24B Instruct980±112.9K4.3%7.5%15 tps2.4s131K$0.06$0.18
52148Qwen3 30B A3B994±86.3K6.9%5.1%163 tps1.0s41K$0.06$0.21
53148DeepSeek-R1 Turbo1003±92.5K5.6%2.6%29 tps1.8s64K$2.85$4.75
54148Nemotron 3 Nano (Thinking)1012±132K6.7%2.0%200 tps0.5s256K$0$0
55144Command A1024±422.4K4.8%2.2%42 tps0.8s256K$2.00$7.33
56135DeepSeek V31032±517.6K3.7%0.9%69 tps1.1s64K$0.59$1.49
57135Qwen3 VL 30B A3B Instruct1034±151K6.7%1.8%80 tps2.6s129K$0.18$0.67
58135QwQ 32B1035±411.6K6.4%5.4%41 tps2.1s16K$0.43$0.56
59119Qwen3 32B Fast1052±611.4K5.2%11.6%30 tps3.1s41K$0.10$0.25
60119DeepSeek V3.1 Terminus Thinking1061±92.9K9.4%5.9%27 tps1.8s131K$0.56$1.68
61112gpt-oss-20b1066±67.7K7.1%0.5%216 tps0.5s131K$0.06$0.26
62112Kimi K2 Fast1073±535K6.4%0.8%365 tps0.5s131K$1.00$3.00
6398DeepSeek V3.2 Exp Thinking1089±75.9K3.5%7.2%26 tps3.0s131K$0.28$0.42
6498Qwen3 235B A22B1093±64.5K8.0%5.3%71 tps0.9s41K$0.23$0.63
6590Step 3.5 Flash1102±248103.6%2.2%109 tps0.6s256K$0.05$0.15
6690DeepSeek V3.2 Exp Chat1107±45.5K6.1%2.6%29 tps1.5s131K$0.27$0.39
6777Mistral Large 31131±85.4K5.8%2.1%51 tps1.0s256K$0.50$1.50
6869gpt-oss-120b1165±519.2K5.0%0.7%213 tps0.5s131K$0.11$0.50
6960DeepSeek V3.2 Thinking1178±923.3K4.0%9.0%30 tps2.6s131K$0.28$0.42
7049Kimi K2 Thinking Turbo1192±620.3K3.4%2.0%75 tps1.4s262K$1.15$8.00
7136Kimi K2.5 Instant1210±81.8K3.2%2.9%32 tps3.0s262K$0.50$3.00
7236Qwen3.5 27B1211±169104.7%3.7%55 tps2.6s256K$0.30$2.40
7336Qwen3.5 122B A17B1216±151.9K3.1%1.5%82 tps1.4s256K$0.40$3.20
7431MiniMax M2.5 Lightning1228±141.7K3.2%1.5%51 tps2.0s205K$0.60$2.40
7531Qwen3 Next 80B A3B Instruct1231±58.8K5.8%0.6%84 tps1.1s256K$0.20$1.42
7619Kimi K2.51291±1116.5K3.4%6.5%33 tps1.7s262K$0.34$2.57
Show Less