Models
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

759
Hermes 4 405B Reasoning FP8
754
Goliath 120B
742
Gemma 3 4B
738
Mixtral 8x22B Instruct
738
Command R+
734
Command
722
Pixtral 12B
709
Mythalion 13B
702
Hermes 3 405B Instruct
697
Phi 4 Multimodal Instruct
633
DeepSeek-R1 Distill Qwen 7B
601
Llema 7B
600
MythoMax L2 13B
599
Phi 4 Mini Instruct
573
Phi 4 Reasoning

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
81262Hermes 4 405B Reasoning FP8759±112.7K12.8%3.6%32 tps0.8s131K$1.00$3.00
82262Goliath 120B754±247455.7%2.7%21 tps2.2s6K$6.56$9.38
83269Gemma 3 4B742±103.3K4.7%1.3%138 tps0.7s131K$0.02$0.04
84269Mixtral 8x22B Instruct738±171.4K5.6%1.8%142 tps0.7s66K$0.45$0.45
85269Command R+738±151.6K5.6%2.8%36 tps0.7s128K$2.08$9.45
86361Command734±187654.4%<0.1%25 tpsN/A4K$0.83$1.33
87269Pixtral 12B722±213K6.3%2.2%101 tps1.2s131K$0.08$0.08
88374Mythalion 13B709±101.1K1.3%<0.1%63 tps0.5s4K$0.56$1.13
89276Hermes 3 405B Instruct702±201.4K4.1%2.3%20 tps1.1s131K$0.80$0.80
90374Phi 4 Multimodal Instruct697±162.1K6.8%<0.1%17 tps1.4s128K$0.03$0.05
91386DeepSeek-R1 Distill Qwen 7B633±195655.0%<0.1%0 tpsN/A131K$0.05$0.10
92390Llema 7B601±218504.5%<0.1%1 tps15.0s4K$0.80$1.20
93279MythoMax L2 13B600±212.3K5.8%1.2%22 tps1.1s4K$0.18$0.18
94279Phi 4 Mini Instruct599±211K7.1%7.4%40 tps1.1s128K$0.07$0.30
95279Phi 4 Reasoning573±172.1K5.5%21.0%29 tps1.0s33K$0.06$0.25
96284Qwen 2.5 VL 3B Instruct523±254.1K6.1%3.0%44 tps2.5s128K$0.21$0.63
97399DeepSeek-R1 Distill Qwen 1.5B481±197305.2%<0.1%20 tps0.0s131K$0.18$0.18
98284CodeLlama 7B Instruct Solidity463±544858.5%3.6%33 tps0.7s16K$0.80$1.20
99286Phi 4 Mini Reasoning447±153.4K12.0%9.7%30 tps0.9s128K$0.07$0.30
View All (99 models)