Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

629
Qwen 2.5 VL 3B Instruct
630
LFM2.5 1.2B Thinking
635
Phi 4 Mini Reasoning
667
UI-TARS 1.5 7B
688
MiniMax M1
697
Phi 4 Reasoning
739
Hunyuan A13B Instruct
755
Pixtral 12B
778
Phi 4 Mini Instruct
779
Moonshot V1 128k Vision
791
MiniMax M2-her
794
Goliath 120B
798
C4AI Aya Expanse 8B
807
DeepSeek-R1 Distill Qwen 32B
814
MythoMax L2 13B

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
1288Qwen 2.5 VL 3B Instruct629±75.1K6.8%3.0%44 tps2.5s128K$0.21$0.63
2291LFM2.5 1.2B Thinking630±227054.7%2.6%258 tps0.4s33K$0$0
3291Phi 4 Mini Reasoning635±47.9K7.9%9.7%30 tps0.9s128K$0.07$0.30
4289UI-TARS 1.5 7B667±181.4K8.7%4.0%75 tps0.9s128K$0.10$0.20
5284MiniMax M1688±47.8K4.0%<0.1%31 tps2.8s1M$0.55$2.20
6287Phi 4 Reasoning697±84K3.5%21.0%29 tps1.0s33K$0.06$0.25
7285Hunyuan A13B Instruct739±54.6K5.0%2.3%67 tps2.0s33K$0.01$0.01
8274Pixtral 12B755±124.6K5.7%2.2%101 tps1.2s131K$0.08$0.08
9285Phi 4 Mini Instruct778±54.1K3.2%7.4%40 tps1.1s128K$0.07$0.30
10274Moonshot V1 128k Vision779±129555.0%3.1%44 tps3.8s131K$2.00$5.00
11274MiniMax M2-her791±111.1K2.2%<0.1%108 tps0.7s205K$0.30$1.20
12281Goliath 120B794±53.1K2.5%2.7%21 tps2.2s6K$6.56$9.38
13274C4AI Aya Expanse 8B798±121.2K6.5%0.9%61 tps0.4s8K$0.50$1.50
14274DeepSeek-R1 Distill Qwen 32B807±64K3.3%6.2%22 tps1.8s131K$0.37$0.39
15281MythoMax L2 13B814±49.8K2.5%1.2%22 tps1.1s4K$0.18$0.18
16274LFM2 8B A1B831±72.3K6.7%<0.1%142 tps0.3s33K$0.01$0.02
17281Gemma 2 9B831±91.6K3.7%<0.1%100 tps0.4s8K$0.09$0.09
18271Hermes 3 405B Instruct838±36K1.8%2.3%20 tps1.1s131K$0.80$0.80
19265Qwen 2.5 VL 72B Instruct849±102.6K6.3%5.3%25 tps3.7s128K$1.01$2.79
20274DeepHermes 3 Mistral 24B Preview850±121.7K4.7%2.5%50 tps1.0s33K$0.06$0.25
21271Mistral Large851±54.7K2.5%1.5%54 tps0.7s33K$2.00$6.00
22260Hermes 4 405B Reasoning FP8862±46.8K8.0%3.6%32 tps0.8s131K$1.00$3.00
23271Inflection 3 Pi862±37.3K1.4%1.1%33 tps3.4s8K$2.50$10.00
24265LFM2 2.6B863±52.2K6.3%6.7%184 tps0.4s33K$0.01$0.02
25246DeepSeek-R1 Distill Llama 70B874±56.5K3.4%3.6%27 tps1.6s32K$0.73$0.95
26265Magistral Small 2509876±73.4K4.8%2.7%116 tps0.6s131K$0.50$1.50
27246Hermes 4 70B878±91.3K4.3%1.1%67 tps0.6s131K$0.12$0.39
28260Open Mistral 7B879±35.4K2.4%0.7%176 tps0.4s33K$0.25$0.25
29265Mixtral-8x7B Instruct v0.1880±55.5K2.4%1.3%54 tps0.4s33K$0.60$0.60
30260Mistral Small881±34.8K2.5%1.7%142 tps0.6s32K$0.43$1.30
31265Inflection 3 Productivity883±47K1.7%0.6%50 tps3.2s8K$2.50$10.00
32229Magistral Medium 2509887±56.2K6.2%4.0%58 tps0.9s131K$2.00$5.00
33256Phi 4890±38.4K1.8%5.1%28 tps1.3s128K$0.10$0.32
34240Moonshot V1 32k893±54K1.2%1.4%53 tps1.4s33K$1.00$3.00
35256Mixtral 8x7B Instruct894±66.2K2.0%0.2%79 tps0.7s33K$0.23$0.31
36240Hermes 4 405B FP8897±102.1K5.6%3.5%31 tps0.9s131K$0.52$1.73
37260Apriel 1.6 15B Thinker898±111.1K2.1%2.6%92 tps0.4s131K$0$0
38265Ministral 3B 2512900±111.7K3.5%2.8%339 tps0.6s131K$0.10$0.10
39246Mixtral 8x22B Instruct900±56K2.2%1.8%142 tps0.7s66K$0.45$0.45
40256Solar Mini 250422901±53.7K4.4%1.8%90 tps1.7s33K$0.15$0.15
View All (288 models)