Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

508
LFM2.5 1.2B Thinking
624
Qwen 2.5 VL 3B Instruct
626
Phi 4 Mini Reasoning
646
UI-TARS 1.5 7B
649
Moonshot V1 128k Vision
707
Phi 4 Reasoning
741
Goliath 120B
748
Pixtral 12B
757
LFM2 8B A1B
767
MiniMax M2-her
774
Phi 4 Mini Instruct
777
Hunyuan A13B Instruct
778
C4AI Aya Expanse 8B
779
Gemma 2 9B
786
Magistral Small 2509

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
1291LFM2.5 1.2B Thinking508±265455.2%2.6%258 tps0.4s33K$0$0
2288Qwen 2.5 VL 3B Instruct624±172.2K7.8%3.0%44 tps2.5s128K$0.21$0.63
3291Phi 4 Mini Reasoning626±84.4K4.9%9.7%30 tps0.9s128K$0.07$0.30
4289UI-TARS 1.5 7B646±179154.2%4.0%75 tps0.9s128K$0.10$0.20
5274Moonshot V1 128k Vision649±315156.4%3.1%44 tps3.8s131K$2.00$5.00
6287Phi 4 Reasoning707±101K2.8%21.0%29 tps1.0s33K$0.06$0.25
7281Goliath 120B741±92.6K2.1%2.7%21 tps2.2s6K$6.56$9.38
8274Pixtral 12B748±152.3K5.1%2.2%101 tps1.2s131K$0.08$0.08
9274LFM2 8B A1B757±91.9K4.8%<0.1%142 tps0.3s33K$0.01$0.02
10274MiniMax M2-her767±118801.7%<0.1%108 tps0.7s205K$0.30$1.20
11285Phi 4 Mini Instruct774±53.7K2.0%7.4%40 tps1.1s128K$0.07$0.30
12285Hunyuan A13B Instruct777±54.1K2.4%2.3%67 tps2.0s33K$0.01$0.01
13274C4AI Aya Expanse 8B778±158852.7%0.9%61 tps0.4s8K$0.50$1.50
14281Gemma 2 9B779±71.3K3.6%<0.1%100 tps0.4s8K$0.09$0.09
15265Magistral Small 2509786±111.7K3.7%2.7%116 tps0.6s131K$0.50$1.50
16281MythoMax L2 13B796±49K1.6%1.2%22 tps1.1s4K$0.18$0.18
17265Qwen 2.5 VL 72B Instruct808±121.3K5.5%5.3%25 tps3.7s128K$1.01$2.79
18271Hermes 3 405B Instruct814±45.5K1.2%2.3%20 tps1.1s131K$0.80$0.80
19265LFM2 2.6B816±81.7K5.2%6.7%184 tps0.4s33K$0.01$0.02
20271Mistral Large818±64.3K1.6%1.5%54 tps0.7s33K$2.00$6.00
21274DeepHermes 3 Mistral 24B Preview824±99953.4%2.5%50 tps1.0s33K$0.06$0.25
22260Hermes 4 405B Reasoning FP8826±45K3.5%3.6%32 tps0.8s131K$1.00$3.00
23271Inflection 3 Pi837±46.8K0.9%1.1%33 tps3.4s8K$2.50$10.00
24240Llama 3.3 70B Instruct839±186752.2%5.3%28 tps1.3s128K$0.38$0.55
25214OpenAI o3-mini-high839±63.6K1.6%2.4%231 tps10.5s200K$1.10$4.40
26260Apriel 1.6 15B Thinker840±128552.3%2.6%92 tps0.4s131K$0$0
27284MiniMax M1843±52.7K1.6%<0.1%31 tps2.8s1M$0.55$2.20
28256Mixtral 8x7B Instruct844±55.7K1.3%0.2%79 tps0.7s33K$0.23$0.31
29214Qwen 2.5 VL 32B Instruct847±168404.0%6.3%43 tps3.2s128K$0.35$0.62
30265Mixtral-8x7B Instruct v0.1849±55K1.5%1.3%54 tps0.4s33K$0.60$0.60
31260Open Mistral 7B849±64.8K1.2%0.7%176 tps0.4s33K$0.25$0.25
32256Solar Mini 250422856±83.2K2.0%1.8%90 tps1.7s33K$0.15$0.15
33265Inflection 3 Productivity858±46.5K0.8%0.6%50 tps3.2s8K$2.50$10.00
34260Mistral Small859±64.4K1.6%1.7%142 tps0.6s32K$0.43$1.30
35256Phi 4867±37.7K1.2%5.1%28 tps1.3s128K$0.10$0.32
36229Llama 3.1 8B868±91.2K2.4%1.9%61 tps1.0s8K$0.07$0.09
37246Mixtral 8x22B Instruct871±45.4K1.6%1.8%142 tps0.7s66K$0.45$0.45
38240Moonshot V1 32k872±43.8K0.8%1.4%53 tps1.4s33K$1.00$3.00
39265Ministral 3B 2512876±91.3K3.4%2.8%339 tps0.6s131K$0.10$0.10
40246Mixtral 8x22B886±64.7K1.1%1.2%140 tps0.6s64K$2.00$6.00
View All (283 models)