Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

258
Phi 4 Mini Reasoning
486
Phi 4 Reasoning
654
GPT-4o mini
679
Qwen 2.5 14B Instruct
681
Hermes 4 405B Reasoning FP8
689
Gemma 3 12B
707
MiniMax M1
709
C4AI Aya Expanse 32B
710
Switchpoint Router
724
Gemma 3 4B
726
Llama 3.3 70B
730
Command R
736
DeepSeek-R1 Distill Qwen 32B
744
Jamba 1.5 Large
755
DeepSeek-R1 Distill Llama 70B

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
1291Phi 4 Mini Reasoning258±229357.0%9.7%30 tps0.9s128K$0.07$0.30
2287Phi 4 Reasoning486±325302.8%21.0%29 tps1.0s33K$0.06$0.25
3201GPT-4o mini654±305154.6%2.1%71 tps1.7s128K$0.15$0.60
4209Qwen 2.5 14B Instruct679±225053.8%2.4%40 tps1.6s1M$0.40$1.61
5260Hermes 4 405B Reasoning FP8681±317255.2%3.6%32 tps0.8s131K$1.00$3.00
6214Gemma 3 12B689±235504.3%4.2%73 tps0.8s131K$0.05$0.12
7284MiniMax M1707±141.5K1.0%<0.1%31 tps2.8s1M$0.55$2.20
8214C4AI Aya Expanse 32B709±229251.6%1.5%43 tps0.5s128K$0.50$1.50
9179Switchpoint Router710±266203.1%1.7%71 tps4.9s131K$0.85$3.40
10235Gemma 3 4B724±246603.6%1.3%138 tps0.7s131K$0.02$0.04
11194Llama 3.3 70B726±267355.2%0.3%500 tps0.5s8K$0.48$0.66
12225Command R730±236052.4%5.8%54 tps0.6s128K$0.30$0.99
13274DeepSeek-R1 Distill Qwen 32B736±225701.7%6.2%22 tps1.8s131K$0.37$0.39
14222Jamba 1.5 Large744±247152.1%1.7%48 tps0.9s256K$1.50$6.00
15246DeepSeek-R1 Distill Llama 70B755±199602.5%3.6%27 tps1.6s32K$0.73$0.95
16186Jamba 1.6 Large761±157801.9%2.0%59 tps1.2s256K$1.33$5.33
17229Magistral Medium 2509782±285507.6%4.0%58 tps0.9s131K$2.00$5.00
18225Command R 7B787±266602.9%1.1%76 tps0.4s128K$0.04$0.15
19179Amazon Nova Pro 1.0803±131.2K2.0%0.9%96 tps0.7s300K$0.80$1.70
20186Grok 3 Mini Fast807±142.4K1.8%1.6%44 tps0.5s131K$0.60$4.00
21186Gemma 3n E4B814±171.6K3.6%2.0%30 tps0.5s8K$0.01$0.02
22235GLM 4 32B820±167002.1%2.6%40 tps1.6s33K$0.14$0.14
23222Sky T1 32B Preview821±186251.6%7.8%73 tps0.6s16K$0.12$0.18
24214OpenAI o3-mini-high833±132.9K1.0%2.4%231 tps10.5s200K$1.10$4.40
25177Mistral Small 3.1 24B Instruct839±226953.5%7.5%15 tps2.4s131K$0.06$0.18
26179Inception Mercury847±131.4K1.0%0.4%257 tps1.1s32K$0.25$1.00
27177OpenAI o3-mini851±74.7K1.8%0.8%143 tps3.3s200K$1.10$4.40
28186Grok 3 Mini852±142.5K1.4%1.2%43 tps0.5s131K$0.30$0.50
29175OpenAI o3-mini-low852±84.4K1.8%0.7%139 tps1.5s200K$1.10$4.40
30186GLM 4.6V Flash858±237502.6%3.7%64 tps2.1s128K$0.04$0.40
31209Llama 3.3 Swallow 70B Instruct865±198201.8%1.4%153 tps1.3s131K$0.13$0.39
32170Devstral Medium865±198051.8%1.5%77 tps0.6s131K$0.40$2.00
33194Magistral Small 2506870±161K2.8%1.6%156 tps0.5s40K$0.37$1.10
34160Llama 4 Scout872±114.4K1.4%0.6%88 tps5.1s131K$0.18$0.46
35157Cogito v2.1 671B876±304903.9%0.8%85 tps0.5s128K$1.25$1.25
36201Gemma 3 27B IT879±215601.8%2.0%60 tps0.8s128K$0.17$0.29
37139GLM 4.6V880±338852.2%6.4%21 tps1.8s128K$0.38$0.90
38143Seed 1.6 250615888±235302.8%3.1%46 tps2.2s256K$0.25$2.00
39161Llama 4 Maverick900±115.1K1.9%1.2%88 tps2.4s1M$0.23$0.83
40148DeepSeek-R1904±161.7K2.8%0.8%133 tps0.6s64K$0.91$3.07
View All (173 models)