Models
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

447
Phi 4 Mini Reasoning
463
CodeLlama 7B Instruct Solidity
523
Qwen 2.5 VL 3B Instruct
573
Phi 4 Reasoning
588
Hunyuan A13B Instruct
599
Phi 4 Mini Instruct
600
MythoMax L2 13B
610
UI-TARS 1.5 7B
686
MiniMax M1
696
DeepSeek-R1 Distill Qwen 32B
702
Hermes 3 405B Instruct
706
DeepHermes 3 Mistral 24B Preview
719
Inflection 3 Pi
722
Pixtral 12B
737
Inflection 3 Productivity

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
1286Phi 4 Mini Reasoning447±153.4K12.0%9.7%30 tps0.9s128K$0.07$0.30
2284CodeLlama 7B Instruct Solidity463±544858.5%3.6%33 tps0.7s16K$0.80$1.20
3284Qwen 2.5 VL 3B Instruct523±254.1K6.1%3.0%44 tps2.5s128K$0.21$0.63
4279Phi 4 Reasoning573±172.1K5.5%21.0%29 tps1.0s33K$0.06$0.25
5279Hunyuan A13B Instruct588±221.6K9.2%2.3%67 tps2.0s33K$0.01$0.01
6279Phi 4 Mini Instruct599±211K7.1%7.4%40 tps1.1s128K$0.07$0.30
7279MythoMax L2 13B600±212.3K5.8%1.2%22 tps1.1s4K$0.18$0.18
8279UI-TARS 1.5 7B610±4053011.7%4.0%75 tps0.9s128K$0.10$0.20
9276MiniMax M1686±133.8K5.3%<0.1%31 tps2.8s1M$0.55$2.20
10276DeepSeek-R1 Distill Qwen 32B696±202K5.5%6.2%22 tps1.8s131K$0.37$0.39
11276Hermes 3 405B Instruct702±201.4K4.1%2.3%20 tps1.1s131K$0.80$0.80
12269DeepHermes 3 Mistral 24B Preview706±307155.9%2.5%50 tps1.0s33K$0.06$0.25
13269Inflection 3 Pi719±181.5K4.1%1.1%33 tps3.4s8K$2.50$10.00
14269Pixtral 12B722±213K6.3%2.2%101 tps1.2s131K$0.08$0.08
15269Inflection 3 Productivity737±241.5K5.0%0.6%50 tps3.2s8K$2.50$10.00
16269Command R+738±151.6K5.6%2.8%36 tps0.7s128K$2.08$9.45
17269Mixtral 8x22B Instruct738±171.4K5.6%1.8%142 tps0.7s66K$0.45$0.45
18269Gemma 3 4B742±103.3K4.7%1.3%138 tps0.7s131K$0.02$0.04
19262Qwen 2.5 VL 72B Instruct746±202.1K6.0%5.3%25 tps3.7s128K$1.01$2.79
20262Goliath 120B754±247455.7%2.7%21 tps2.2s6K$6.56$9.38
21262Hermes 4 405B Reasoning FP8759±112.7K12.8%3.6%32 tps0.8s131K$1.00$3.00
22262Open Mistral 7B762±181.3K4.7%0.7%176 tps0.4s33K$0.25$0.25
23262Mistral Small770±121.2K4.5%1.7%142 tps0.6s32K$0.43$1.30
24262Baichuan-M2-32B770±3074010.8%<0.1%32 tps3.3s131K$0.07$0.07
25262Command R778±182.2K4.9%5.8%54 tps0.6s128K$0.30$0.99
26252Hermes 4 70B781±294608.9%1.1%67 tps0.6s131K$0.12$0.39
27252Mistral Large785±161.1K5.8%1.5%54 tps0.7s33K$2.00$6.00
28252GPT-3.5 Turbo Instruct787±92K2.7%<0.1%46 tps1.2s4K$1.50$2.00
29252Mercury Coder793±275103.8%<0.1%247 tps2.2s32K$0.25$1.00
30252Hermes 4 405B FP8797±218158.4%3.5%31 tps0.9s131K$0.52$1.73
31252Phi 4798±161.7K3.4%5.1%28 tps1.3s128K$0.10$0.32
32252WizardLM-2 8x22B801±121.9K3.1%11.6%11 tps2.5s66K$0.77$0.77
33252Gemma 3 1B802±112K6.1%0.6%176 tps1.0s33K$0.06$0.10
34252Magistral Small 2509802±181.8K7.5%2.7%116 tps0.6s131K$0.50$1.50
35252Ministral 3B806±162.3K5.1%0.8%248 tps0.4s131K$0.08$0.08
36240Gemma 2 27B815±171.5K4.1%1.4%44 tps1.4s8K$0.80$0.80
37240LFM2 8B A1B818±1882511.3%<0.1%142 tps0.3s33K$0.01$0.02
38240Moonshot V1 32k820±179503.1%1.4%53 tps1.4s33K$1.00$3.00
39240C4AI Aya Expanse 32B821±73.8K4.0%1.5%43 tps0.5s128K$0.50$1.50
40240Ministral 8B825±172.2K5.5%1.4%177 tps0.4s128K$0.14$0.14
View All (286 models)