Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

466
MythoMax L2 13B
545
Phi 4 Mini Reasoning
570
MiniMax M1
632
Phi 4 Reasoning
638
DeepSeek-R1 Distill Qwen 32B
699
GPT-3.5 Turbo 16k
702
C4AI Aya Expanse 32B
733
Gemma 3 1B
733
DeepSeek-R1 Distill Llama 70B
751
GLM 4 32B
754
Command R
763
Ministral 8B
766
Ministral 3B
775
Command R 7B
797
Sky T1 32B Preview

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
1281MythoMax L2 13B466±305204.6%1.2%22 tps1.1s4K$0.18$0.18
2291Phi 4 Mini Reasoning545±122.3K3.1%9.7%30 tps0.9s128K$0.07$0.30
3284MiniMax M1570±103K1.3%<0.1%31 tps2.8s1M$0.55$2.20
4287Phi 4 Reasoning632±132K2.2%21.0%29 tps1.0s33K$0.06$0.25
5274DeepSeek-R1 Distill Qwen 32B638±91.8K2.7%6.2%22 tps1.8s131K$0.37$0.39
6225GPT-3.5 Turbo 16k699±176900.7%<0.1%22 tps0.6s16K$3.00$4.00
7214C4AI Aya Expanse 32B702±228251.8%1.5%43 tps0.5s128K$0.50$1.50
8256Gemma 3 1B733±216702.9%0.6%176 tps1.0s33K$0.06$0.10
9246DeepSeek-R1 Distill Llama 70B733±102.6K2.3%3.6%27 tps1.6s32K$0.73$0.95
10235GLM 4 32B751±197402.0%2.6%40 tps1.6s33K$0.14$0.14
11225Command R754±225402.7%5.8%54 tps0.6s128K$0.30$0.99
12229Ministral 8B763±235253.7%1.4%177 tps0.4s128K$0.14$0.14
13246Ministral 3B766±215751.7%0.8%248 tps0.4s131K$0.08$0.08
14225Command R 7B775±188701.7%1.1%76 tps0.4s128K$0.04$0.15
15222Sky T1 32B Preview797±176251.6%7.8%73 tps0.6s16K$0.12$0.18
16201GPT-4o mini803±245454.4%2.1%71 tps1.7s128K$0.15$0.60
17179Amazon Nova Pro 1.0807±191.4K1.7%0.9%96 tps0.7s300K$0.80$1.70
18194Llama 3.2 11B Instruct816±225252.8%1.5%152 tps0.5s8K$0.16$0.16
19222Jamba 1.5 Large819±156901.4%1.7%48 tps0.9s256K$1.50$6.00
20235Gemma 3 4B825±147553.2%1.3%138 tps0.7s131K$0.02$0.04
21201Llama 3 8B826±177201.4%6.0%85 tps0.7s8K$0.12$0.16
22260Hermes 4 405B Reasoning FP8828±111.3K3.7%3.6%32 tps0.8s131K$1.00$3.00
23179Inception Mercury829±102K1.5%0.4%257 tps1.1s32K$0.25$1.00
24265Magistral Small 2509830±238255.7%2.7%116 tps0.6s131K$0.50$1.50
25209Qwen 2.5 14B Instruct830±245701.7%2.4%40 tps1.6s1M$0.40$1.61
26161Mistral Small 3.1835±176751.5%7.4%13 tps2.6s32K$0.17$0.28
27194Magistral Small 2506847±141.3K1.9%1.6%156 tps0.5s40K$0.37$1.10
28229Magistral Medium 2509849±169903.9%4.0%58 tps0.9s131K$2.00$5.00
29209Llama 3.3 Swallow 70B Instruct854±188901.1%1.4%153 tps1.3s131K$0.13$0.39
30229ERNIE 4.5 21B A3B Thinking866±166852.8%1.8%87 tps1.5s120K$0.07$0.28
31179Switchpoint Router876±166751.5%1.7%71 tps4.9s131K$0.85$3.40
32165Pixtral Large890±126404.5%2.5%57 tps1.3s128K$1.50$4.50
33186Gemma 3n E4B892±102K2.6%2.0%30 tps0.5s8K$0.01$0.02
34186GLM 4.6V Flash899±151.3K2.3%3.7%64 tps2.1s128K$0.04$0.40
35170Devstral Medium899±109951.5%1.5%77 tps0.6s131K$0.40$2.00
36194Llama 3.3 70B900±121.1K3.0%0.3%500 tps0.5s8K$0.48$0.66
37214Qwen 2.5 7B901±194904.9%3.7%40 tps1.9s131K$0.08$0.27
38186Grok 3 Mini908±93.8K2.3%1.2%43 tps0.5s131K$0.30$0.50
39113GLM 4.5 AirX913±235401.8%3.3%75 tps1.2s131K$1.10$4.50
40186Jamba 1.6 Large915±186601.5%2.0%59 tps1.2s256K$1.33$5.33
View All (193 models)