Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

624
Qwen 2.5 VL 3B Instruct
626
Phi 4 Mini Reasoning
707
Phi 4 Reasoning
741
Goliath 120B
748
Pixtral 12B
774
Phi 4 Mini Instruct
779
Gemma 2 9B
796
MythoMax L2 13B
814
Hermes 3 405B Instruct
826
Hermes 4 405B Reasoning FP8
839
Llama 3.3 70B Instruct
844
Mixtral 8x7B Instruct
849
Mixtral-8x7B Instruct v0.1
859
Mistral Small
867
Phi 4

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
1288Qwen 2.5 VL 3B Instruct624±172.2K7.8%3.0%44 tps2.5s128K$0.21$0.63
2291Phi 4 Mini Reasoning626±84.4K4.9%9.7%30 tps0.9s128K$0.07$0.30
3287Phi 4 Reasoning707±101K2.8%21.0%29 tps1.0s33K$0.06$0.25
4281Goliath 120B741±92.6K2.1%2.7%21 tps2.2s6K$6.56$9.38
5274Pixtral 12B748±152.3K5.1%2.2%101 tps1.2s131K$0.08$0.08
6285Phi 4 Mini Instruct774±53.7K2.0%7.4%40 tps1.1s128K$0.07$0.30
7281Gemma 2 9B779±71.3K3.6%<0.1%100 tps0.4s8K$0.09$0.09
8281MythoMax L2 13B796±49K1.6%1.2%22 tps1.1s4K$0.18$0.18
9271Hermes 3 405B Instruct814±45.5K1.2%2.3%20 tps1.1s131K$0.80$0.80
10260Hermes 4 405B Reasoning FP8826±45K3.5%3.6%32 tps0.8s131K$1.00$3.00
11240Llama 3.3 70B Instruct839±186752.2%5.3%28 tps1.3s128K$0.38$0.55
12256Mixtral 8x7B Instruct844±55.7K1.3%0.2%79 tps0.7s33K$0.23$0.31
13265Mixtral-8x7B Instruct v0.1849±55K1.5%1.3%54 tps0.4s33K$0.60$0.60
14260Mistral Small859±64.4K1.6%1.7%142 tps0.6s32K$0.43$1.30
15256Phi 4867±37.7K1.2%5.1%28 tps1.3s128K$0.10$0.32
16229Llama 3.1 8B868±91.2K2.4%1.9%61 tps1.0s8K$0.07$0.09
17246Mixtral 8x22B Instruct871±45.4K1.6%1.8%142 tps0.7s66K$0.45$0.45
18246Ministral 3B886±57.5K1.4%0.8%248 tps0.4s131K$0.08$0.08
19253Gemma 2 27B889±46.5K1.2%1.4%44 tps1.4s8K$0.80$0.80
20256Gemma 3 1B892±56.1K1.9%0.6%176 tps1.0s33K$0.06$0.10
21235Mixtral 8x7B899±64.7K1.3%2.2%142 tps0.6s33K$0.23$0.23
22235Command R+904±56.3K1.3%2.8%36 tps0.7s128K$2.08$9.45
23229Ministral 8B905±46.8K1.4%1.4%177 tps0.4s128K$0.14$0.14
24222Sky T1 32B Preview905±410.5K1.1%7.8%73 tps0.6s16K$0.12$0.18
25235Hermes 2 Pro Llama 3 8B908±38.3K0.7%<0.1%76 tps1.0s131K$0.08$0.09
26235Gemma 3 4B909±411.3K1.0%1.3%138 tps0.7s131K$0.02$0.04
27240Mistral Nemo910±53.8K0.5%<0.1%112 tps0.4s131K$0.07$0.13
28225Open Mistral Nemo910±56.7K1.1%1.5%171 tps0.5s131K$0.15$0.15
29225Command R913±39.6K1.5%5.8%54 tps0.6s128K$0.30$0.99
30214Llama 3.3 70B Instruct Turbo919±83.7K1.5%2.0%78 tps1.0s131K$0.88$0.88
31214Qwen 2.5 7B924±46.9K1.4%3.7%40 tps1.9s131K$0.08$0.27
32201Gemma 3 27B IT927±38.9K0.9%2.0%60 tps0.8s128K$0.17$0.29
33225Command R 7B928±312.7K1.1%1.1%76 tps0.4s128K$0.04$0.15
34214C4AI Aya Expanse 32B931±317.1K0.8%1.5%43 tps0.5s128K$0.50$1.50
35194Llama 3 70B934±71.7K1.1%4.5%21 tps1.7s8K$1.08$1.38
36201Mistral Small 24B Instruct935±46.3K1.2%1.5%84 tps0.4s33K$0.80$0.80
37186GLM 4.6V Flash937±45.8K2.0%3.7%64 tps2.1s128K$0.04$0.40
38201Qwen 2.5 7B Turbo944±92.4K1.5%0.5%125 tps0.4s131K$0.30$0.30
39194Llama 3.2 11B Instruct955±49.2K1.0%1.5%152 tps0.5s8K$0.16$0.16
40194Mistral Small 3 24B Instruct955±47.2K0.9%2.6%77 tps0.6s33K$0.07$0.14
View All (80 models)