Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

892
Gemma 3n E4B
890
Pixtral Large
876
Switchpoint Router
866
ERNIE 4.5 21B A3B Thinking
854
Llama 3.3 Swallow 70B Instruct
849
Magistral Medium 2509
847
Magistral Small 2506
835
Mistral Small 3.1
830
Qwen 2.5 14B Instruct
830
Magistral Small 2509
829
Inception Mercury
828
Hermes 4 405B Reasoning FP8
826
Llama 3 8B
825
Gemma 3 4B
819
Jamba 1.5 Large

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
161186Gemma 3n E4B892±102K2.6%2.0%30 tps0.5s8K$0.01$0.02
162165Pixtral Large890±126404.5%2.5%57 tps1.3s128K$1.50$4.50
163179Switchpoint Router876±166751.5%1.7%71 tps4.9s131K$0.85$3.40
164229ERNIE 4.5 21B A3B Thinking866±166852.8%1.8%87 tps1.5s120K$0.07$0.28
165209Llama 3.3 Swallow 70B Instruct854±188901.1%1.4%153 tps1.3s131K$0.13$0.39
166229Magistral Medium 2509849±169903.9%4.0%58 tps0.9s131K$2.00$5.00
167194Magistral Small 2506847±141.3K1.9%1.6%156 tps0.5s40K$0.37$1.10
168161Mistral Small 3.1835±176751.5%7.4%13 tps2.6s32K$0.17$0.28
169209Qwen 2.5 14B Instruct830±245701.7%2.4%40 tps1.6s1M$0.40$1.61
170265Magistral Small 2509830±238255.7%2.7%116 tps0.6s131K$0.50$1.50
171179Inception Mercury829±102K1.5%0.4%257 tps1.1s32K$0.25$1.00
172260Hermes 4 405B Reasoning FP8828±111.3K3.7%3.6%32 tps0.8s131K$1.00$3.00
173201Llama 3 8B826±177201.4%6.0%85 tps0.7s8K$0.12$0.16
174235Gemma 3 4B825±147553.2%1.3%138 tps0.7s131K$0.02$0.04
175222Jamba 1.5 Large819±156901.4%1.7%48 tps0.9s256K$1.50$6.00
176194Llama 3.2 11B Instruct816±225252.8%1.5%152 tps0.5s8K$0.16$0.16
177179Amazon Nova Pro 1.0807±191.4K1.7%0.9%96 tps0.7s300K$0.80$1.70
178201GPT-4o mini803±245454.4%2.1%71 tps1.7s128K$0.15$0.60
179222Sky T1 32B Preview797±176251.6%7.8%73 tps0.6s16K$0.12$0.18
180225Command R 7B775±188701.7%1.1%76 tps0.4s128K$0.04$0.15
181246Ministral 3B766±215751.7%0.8%248 tps0.4s131K$0.08$0.08
182229Ministral 8B763±235253.7%1.4%177 tps0.4s128K$0.14$0.14
183225Command R754±225402.7%5.8%54 tps0.6s128K$0.30$0.99
184235GLM 4 32B751±197402.0%2.6%40 tps1.6s33K$0.14$0.14
185246DeepSeek-R1 Distill Llama 70B733±102.6K2.3%3.6%27 tps1.6s32K$0.73$0.95
186256Gemma 3 1B733±216702.9%0.6%176 tps1.0s33K$0.06$0.10
187214C4AI Aya Expanse 32B702±228251.8%1.5%43 tps0.5s128K$0.50$1.50
188225GPT-3.5 Turbo 16k699±176900.7%<0.1%22 tps0.6s16K$3.00$4.00
189274DeepSeek-R1 Distill Qwen 32B638±91.8K2.7%6.2%22 tps1.8s131K$0.37$0.39
190287Phi 4 Reasoning632±132K2.2%21.0%29 tps1.0s33K$0.06$0.25
191284MiniMax M1570±103K1.3%<0.1%31 tps2.8s1M$0.55$2.20
192291Phi 4 Mini Reasoning545±122.3K3.1%9.7%30 tps0.9s128K$0.07$0.30
193281MythoMax L2 13B466±305204.6%1.2%22 tps1.1s4K$0.18$0.18
View All (193 models)