Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

972
Mistral Small 3 24B Instruct
967
Llama 3.2 11B Instruct
961
DeepSeek Prover v2
958
Mistral Small 24B Instruct
946
Open Mistral Nemo
945
Hermes 2 Pro Llama 3 8B
940
Command R 7B
938
Gemma 3 27B IT
934
Qwen 2.5 7B
933
GLM 4.6V Flash
930
C4AI Aya Expanse 32B
929
Ministral 3B
928
Mistral Nemo
927
Llama 3.3 70B Instruct
925
Llama 3.3 70B Instruct Turbo

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
41194Mistral Small 3 24B Instruct972±47.7K1.5%2.6%77 tps0.6s33K$0.07$0.14
42194Llama 3.2 11B Instruct967±29.6K1.9%1.5%152 tps0.5s8K$0.16$0.16
43161DeepSeek Prover v2961±63.3K1.8%5.2%14 tps1.3s164K$0.40$1.56
44201Mistral Small 24B Instruct958±46.8K2.1%1.5%84 tps0.4s33K$0.80$0.80
45225Open Mistral Nemo946±37.3K2.1%1.5%171 tps0.5s131K$0.15$0.15
46235Hermes 2 Pro Llama 3 8B945±28.8K1.0%<0.1%76 tps1.0s131K$0.08$0.09
47225Command R 7B940±314K1.9%1.1%76 tps0.4s128K$0.04$0.15
48201Gemma 3 27B IT938±39.7K1.7%2.0%60 tps0.8s128K$0.17$0.29
49214Qwen 2.5 7B934±47.5K2.3%3.7%40 tps1.9s131K$0.08$0.27
50186GLM 4.6V Flash933±47.9K3.7%3.7%64 tps2.1s128K$0.04$0.40
51214C4AI Aya Expanse 32B930±217.9K1.6%1.5%43 tps0.5s128K$0.50$1.50
52246Ministral 3B929±38.6K2.2%0.8%248 tps0.4s131K$0.08$0.08
53240Mistral Nemo928±34.2K1.2%<0.1%112 tps0.4s131K$0.07$0.13
54240Llama 3.3 70B Instruct927±119002.7%5.3%28 tps1.3s128K$0.38$0.55
55214Llama 3.3 70B Instruct Turbo925±54.2K3.3%2.0%78 tps1.0s131K$0.88$0.88
56222Sky T1 32B Preview923±311.2K1.6%7.8%73 tps0.6s16K$0.12$0.18
57235Mixtral 8x7B923±45.3K2.2%2.2%142 tps0.6s33K$0.23$0.23
58229Ministral 8B922±37.7K2.5%1.4%177 tps0.4s128K$0.14$0.14
59225Command R920±39.8K2.0%5.8%54 tps0.6s128K$0.30$0.99
60229Llama 3.1 8B915±81.3K2.9%1.9%61 tps1.0s8K$0.07$0.09
61235Gemma 3 4B909±212.6K1.9%1.3%138 tps0.7s131K$0.02$0.04
62256Gemma 3 1B909±47.1K3.0%0.6%176 tps1.0s33K$0.06$0.10
63253Gemma 2 27B906±36.9K1.8%1.4%44 tps1.4s8K$0.80$0.80
64235Command R+902±56.6K2.2%2.8%36 tps0.7s128K$2.08$9.45
65246Mixtral 8x22B Instruct900±56K2.2%1.8%142 tps0.7s66K$0.45$0.45
66256Mixtral 8x7B Instruct894±66.2K2.0%0.2%79 tps0.7s33K$0.23$0.31
67256Phi 4890±38.4K1.8%5.1%28 tps1.3s128K$0.10$0.32
68260Mistral Small881±34.8K2.5%1.7%142 tps0.6s32K$0.43$1.30
69265Mixtral-8x7B Instruct v0.1880±55.5K2.4%1.3%54 tps0.4s33K$0.60$0.60
70246DeepSeek-R1 Distill Llama 70B874±56.5K3.4%3.6%27 tps1.6s32K$0.73$0.95
71260Hermes 4 405B Reasoning FP8862±46.8K8.0%3.6%32 tps0.8s131K$1.00$3.00
72271Hermes 3 405B Instruct838±36K1.8%2.3%20 tps1.1s131K$0.80$0.80
73281Gemma 2 9B831±91.6K3.7%<0.1%100 tps0.4s8K$0.09$0.09
74281MythoMax L2 13B814±49.8K2.5%1.2%22 tps1.1s4K$0.18$0.18
75281Goliath 120B794±53.1K2.5%2.7%21 tps2.2s6K$6.56$9.38
76285Phi 4 Mini Instruct778±54.1K3.2%7.4%40 tps1.1s128K$0.07$0.30
77274Pixtral 12B755±124.6K5.7%2.2%101 tps1.2s131K$0.08$0.08
78287Phi 4 Reasoning697±84K3.5%21.0%29 tps1.0s33K$0.06$0.25
79291Phi 4 Mini Reasoning635±47.9K7.9%9.7%30 tps0.9s128K$0.07$0.30
80288Qwen 2.5 VL 3B Instruct629±75.1K6.8%3.0%44 tps2.5s128K$0.21$0.63
View All (80 models)