Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

466
MythoMax L2 13B
545
Phi 4 Mini Reasoning
570
MiniMax M1
624
DeepSeek-R1 Distill Llama 8B
632
Phi 4 Reasoning
638
DeepSeek-R1 Distill Qwen 32B
648
DeepSeek-R1 Distill Qwen 1.5B
664
DeepSeek-R1 Distill Qwen 7B
664
DeepSeek-R1 Distill Qwen 14B
699
GPT-3.5 Turbo 16k
702
C4AI Aya Expanse 32B
733
Gemma 3 1B
733
DeepSeek-R1 Distill Llama 70B
748
Arcee AI Blitz
751
GLM 4 32B

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
1281MythoMax L2 13B466±305204.6%1.2%22 tps1.1s4K$0.18$0.18
2291Phi 4 Mini Reasoning545±122.3K3.1%9.7%30 tps0.9s128K$0.07$0.30
3284MiniMax M1570±103K1.3%<0.1%31 tps2.8s1M$0.55$2.20
4428DeepSeek-R1 Distill Llama 8B624±201.1K3.1%<0.1%17 tpsN/A32K$0.04$0.04
5287Phi 4 Reasoning632±132K2.2%21.0%29 tps1.0s33K$0.06$0.25
6274DeepSeek-R1 Distill Qwen 32B638±91.8K2.7%6.2%22 tps1.8s131K$0.37$0.39
7430DeepSeek-R1 Distill Qwen 1.5B648±179152.1%<0.1%20 tps0.0s131K$0.18$0.18
8424DeepSeek-R1 Distill Qwen 7B664±216401.5%<0.1%0 tpsN/A131K$0.05$0.10
9406DeepSeek-R1 Distill Qwen 14B664±141.7K2.5%<0.1%44 tps1.7s64K$0.63$0.63
10225GPT-3.5 Turbo 16k699±176900.7%<0.1%22 tps0.6s16K$3.00$4.00
11214C4AI Aya Expanse 32B702±228251.8%1.5%43 tps0.5s128K$0.50$1.50
12256Gemma 3 1B733±216702.9%0.6%176 tps1.0s33K$0.06$0.10
13246DeepSeek-R1 Distill Llama 70B733±102.6K2.3%3.6%27 tps1.6s32K$0.73$0.95
14241Arcee AI Blitz748±157101.4%<0.1%6 tpsN/A33K$0.45$0.75
15235GLM 4 32B751±197402.0%2.6%40 tps1.6s33K$0.14$0.14
16225Command R754±225402.7%5.8%54 tps0.6s128K$0.30$0.99
17229Ministral 8B763±235253.7%1.4%177 tps0.4s128K$0.14$0.14
18246Ministral 3B766±215751.7%0.8%248 tps0.4s131K$0.08$0.08
19339Refuel LLM 2 Small768±208002.4%<0.1%116 tps0.5s8K$0.20$0.20
20399Magistral Medium (Thinking)774±72.3K2.7%<0.1%67 tps0.8s41K$2.00$5.00
21225Command R 7B775±188701.7%1.1%76 tps0.4s128K$0.04$0.15
22314MAI-DS-R1778±121.7K3.4%<0.1%73 tps3.2s64K$0.10$0.40
23219Arcee AI Virtuoso-Large791±118401.8%<0.1%64 tps0.5s131K$0.75$1.20
24292Arcee AI Spotlight796±151.4K1.8%<0.1%121 tps0.4s131K$0.18$0.18
25222Sky T1 32B Preview797±176251.6%7.8%73 tps0.6s16K$0.12$0.18
26201GPT-4o mini803±245454.4%2.1%71 tps1.7s128K$0.15$0.60
27179Amazon Nova Pro 1.0807±191.4K1.7%0.9%96 tps0.7s300K$0.80$1.70
28270Arcee AI Virtuoso-Medium809±215400.9%<0.1%3 tpsN/A131K$0.50$0.80
29241Claude Haiku 3813±186452.3%0.4%62 tps0.5s200K$0.25$1.25
30194Llama 3.2 11B Instruct816±225252.8%1.5%152 tps0.5s8K$0.16$0.16
31222Jamba 1.5 Large819±156901.4%1.7%48 tps0.9s256K$1.50$6.00
32235Gemma 3 4B825±147553.2%1.3%138 tps0.7s131K$0.02$0.04
33201Llama 3 8B826±177201.4%6.0%85 tps0.7s8K$0.12$0.16
34260Hermes 4 405B Reasoning FP8828±111.3K3.7%3.6%32 tps0.8s131K$1.00$3.00
35179Inception Mercury829±102K1.5%0.4%257 tps1.1s32K$0.25$1.00
36265Magistral Small 2509830±238255.7%2.7%116 tps0.6s131K$0.50$1.50
37270AFM 4.5B Preview830±228752.2%<0.1%32 tps0.0s66K$0$0
38209Qwen 2.5 14B Instruct830±245701.7%2.4%40 tps1.6s1M$0.40$1.61
39161Mistral Small 3.1835±176751.5%7.4%13 tps2.6s32K$0.17$0.28
40374Cogito V2 671B839±131.3K3.0%<0.1%41 tps0.6s164K$1.25$1.25
View All (260 models)