Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

629
Qwen 2.5 VL 3B Instruct
635
Phi 4 Mini Reasoning
697
Phi 4 Reasoning
755
Pixtral 12B
778
Phi 4 Mini Instruct
794
Goliath 120B
814
MythoMax L2 13B
831
Gemma 2 9B
838
Hermes 3 405B Instruct
862
Hermes 4 405B Reasoning FP8
874
DeepSeek-R1 Distill Llama 70B
880
Mixtral-8x7B Instruct v0.1
881
Mistral Small
890
Phi 4
894
Mixtral 8x7B Instruct

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
1288Qwen 2.5 VL 3B Instruct629±75.1K6.8%3.0%44 tps2.5s128K$0.21$0.63
2291Phi 4 Mini Reasoning635±47.9K7.9%9.7%30 tps0.9s128K$0.07$0.30
3287Phi 4 Reasoning697±84K3.5%21.0%29 tps1.0s33K$0.06$0.25
4274Pixtral 12B755±124.6K5.7%2.2%101 tps1.2s131K$0.08$0.08
5285Phi 4 Mini Instruct778±54.1K3.2%7.4%40 tps1.1s128K$0.07$0.30
6281Goliath 120B794±53.1K2.5%2.7%21 tps2.2s6K$6.56$9.38
7281MythoMax L2 13B814±49.8K2.5%1.2%22 tps1.1s4K$0.18$0.18
8281Gemma 2 9B831±91.6K3.7%<0.1%100 tps0.4s8K$0.09$0.09
9271Hermes 3 405B Instruct838±36K1.8%2.3%20 tps1.1s131K$0.80$0.80
10260Hermes 4 405B Reasoning FP8862±46.8K8.0%3.6%32 tps0.8s131K$1.00$3.00
11246DeepSeek-R1 Distill Llama 70B874±56.5K3.4%3.6%27 tps1.6s32K$0.73$0.95
12265Mixtral-8x7B Instruct v0.1880±55.5K2.4%1.3%54 tps0.4s33K$0.60$0.60
13260Mistral Small881±34.8K2.5%1.7%142 tps0.6s32K$0.43$1.30
14256Phi 4890±38.4K1.8%5.1%28 tps1.3s128K$0.10$0.32
15256Mixtral 8x7B Instruct894±66.2K2.0%0.2%79 tps0.7s33K$0.23$0.31
16246Mixtral 8x22B Instruct900±56K2.2%1.8%142 tps0.7s66K$0.45$0.45
17235Command R+902±56.6K2.2%2.8%36 tps0.7s128K$2.08$9.45
18253Gemma 2 27B906±36.9K1.8%1.4%44 tps1.4s8K$0.80$0.80
19256Gemma 3 1B909±47.1K3.0%0.6%176 tps1.0s33K$0.06$0.10
20235Gemma 3 4B909±212.6K1.9%1.3%138 tps0.7s131K$0.02$0.04
21229Llama 3.1 8B915±81.3K2.9%1.9%61 tps1.0s8K$0.07$0.09
22225Command R920±39.8K2.0%5.8%54 tps0.6s128K$0.30$0.99
23229Ministral 8B922±37.7K2.5%1.4%177 tps0.4s128K$0.14$0.14
24235Mixtral 8x7B923±45.3K2.2%2.2%142 tps0.6s33K$0.23$0.23
25222Sky T1 32B Preview923±311.2K1.6%7.8%73 tps0.6s16K$0.12$0.18
26214Llama 3.3 70B Instruct Turbo925±54.2K3.3%2.0%78 tps1.0s131K$0.88$0.88
27240Llama 3.3 70B Instruct927±119002.7%5.3%28 tps1.3s128K$0.38$0.55
28240Mistral Nemo928±34.2K1.2%<0.1%112 tps0.4s131K$0.07$0.13
29246Ministral 3B929±38.6K2.2%0.8%248 tps0.4s131K$0.08$0.08
30214C4AI Aya Expanse 32B930±217.9K1.6%1.5%43 tps0.5s128K$0.50$1.50
31186GLM 4.6V Flash933±47.9K3.7%3.7%64 tps2.1s128K$0.04$0.40
32214Qwen 2.5 7B934±47.5K2.3%3.7%40 tps1.9s131K$0.08$0.27
33201Gemma 3 27B IT938±39.7K1.7%2.0%60 tps0.8s128K$0.17$0.29
34225Command R 7B940±314K1.9%1.1%76 tps0.4s128K$0.04$0.15
35235Hermes 2 Pro Llama 3 8B945±28.8K1.0%<0.1%76 tps1.0s131K$0.08$0.09
36225Open Mistral Nemo946±37.3K2.1%1.5%171 tps0.5s131K$0.15$0.15
37201Mistral Small 24B Instruct958±46.8K2.1%1.5%84 tps0.4s33K$0.80$0.80
38161DeepSeek Prover v2961±63.3K1.8%5.2%14 tps1.3s164K$0.40$1.56
39194Llama 3.2 11B Instruct967±29.6K1.9%1.5%152 tps0.5s8K$0.16$0.16
40194Mistral Small 3 24B Instruct972±47.7K1.5%2.6%77 tps0.6s33K$0.07$0.14
View All (80 models)