Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

630
LFM2.5 1.2B Thinking
667
UI-TARS 1.5 7B
688
MiniMax M1
739
Hunyuan A13B Instruct
779
Moonshot V1 128k Vision
791
MiniMax M2-her
798
C4AI Aya Expanse 8B
807
DeepSeek-R1 Distill Qwen 32B
831
LFM2 8B A1B
849
Qwen 2.5 VL 72B Instruct
850
DeepHermes 3 Mistral 24B Preview
851
Mistral Large
862
Inflection 3 Pi
863
LFM2 2.6B
876
Magistral Small 2509

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
1291LFM2.5 1.2B Thinking630±227054.7%2.6%258 tps0.4s33K$0$0
2289UI-TARS 1.5 7B667±181.4K8.7%4.0%75 tps0.9s128K$0.10$0.20
3284MiniMax M1688±47.8K4.0%<0.1%31 tps2.8s1M$0.55$2.20
4285Hunyuan A13B Instruct739±54.6K5.0%2.3%67 tps2.0s33K$0.01$0.01
5274Moonshot V1 128k Vision779±129555.0%3.1%44 tps3.8s131K$2.00$5.00
6274MiniMax M2-her791±111.1K2.2%<0.1%108 tps0.7s205K$0.30$1.20
7274C4AI Aya Expanse 8B798±121.2K6.5%0.9%61 tps0.4s8K$0.50$1.50
8274DeepSeek-R1 Distill Qwen 32B807±64K3.3%6.2%22 tps1.8s131K$0.37$0.39
9274LFM2 8B A1B831±72.3K6.7%<0.1%142 tps0.3s33K$0.01$0.02
10265Qwen 2.5 VL 72B Instruct849±102.6K6.3%5.3%25 tps3.7s128K$1.01$2.79
11274DeepHermes 3 Mistral 24B Preview850±121.7K4.7%2.5%50 tps1.0s33K$0.06$0.25
12271Mistral Large851±54.7K2.5%1.5%54 tps0.7s33K$2.00$6.00
13271Inflection 3 Pi862±37.3K1.4%1.1%33 tps3.4s8K$2.50$10.00
14265LFM2 2.6B863±52.2K6.3%6.7%184 tps0.4s33K$0.01$0.02
15265Magistral Small 2509876±73.4K4.8%2.7%116 tps0.6s131K$0.50$1.50
16246Hermes 4 70B878±91.3K4.3%1.1%67 tps0.6s131K$0.12$0.39
17260Open Mistral 7B879±35.4K2.4%0.7%176 tps0.4s33K$0.25$0.25
18265Inflection 3 Productivity883±47K1.7%0.6%50 tps3.2s8K$2.50$10.00
19229Magistral Medium 2509887±56.2K6.2%4.0%58 tps0.9s131K$2.00$5.00
20240Moonshot V1 32k893±54K1.2%1.4%53 tps1.4s33K$1.00$3.00
21240Hermes 4 405B FP8897±102.1K5.6%3.5%31 tps0.9s131K$0.52$1.73
22260Apriel 1.6 15B Thinker898±111.1K2.1%2.6%92 tps0.4s131K$0$0
23265Ministral 3B 2512900±111.7K3.5%2.8%339 tps0.6s131K$0.10$0.10
24256Solar Mini 250422901±53.7K4.4%1.8%90 tps1.7s33K$0.15$0.15
25214OpenAI o3-mini-high901±315.8K3.0%2.4%231 tps10.5s200K$1.10$4.40
26253GPT-4 Turbo903±137854.3%4.7%21 tps1.9s128K$10.00$30.00
27246Mixtral 8x22B903±55.2K2.1%1.2%140 tps0.6s64K$2.00$6.00
28235GLM 4 32B904±211.7K2.1%2.6%40 tps1.6s33K$0.14$0.14
29246WizardLM-2 8x22B911±29.8K1.1%11.6%11 tps2.5s66K$0.77$0.77
30240Moonshot V1 8k911±64K1.8%1.0%55 tps1.5s8K$0.20$2.00
31229Krutrim Spectre V2912±47.4K1.1%<0.1%33 tps3.1s4K$0.19$0.19
32214Moonshot V1 128k917±54.7K1.7%1.4%54 tps1.5s131K$2.00$5.00
33225GPT-3.5 Turbo 16k923±310.7K1.6%<0.1%22 tps0.6s16K$3.00$4.00
34240GPT-3.5 Turbo Instruct926±38K1.4%<0.1%46 tps1.2s4K$1.50$2.00
35229Moonshot V1 Auto928±64K1.6%1.2%54 tps1.5s8K$2.00$5.00
36229ERNIE 4.5 21B A3B Thinking930±62.7K3.6%1.8%87 tps1.5s120K$0.07$0.28
37201GPT-4o mini932±49K3.4%2.1%71 tps1.7s128K$0.15$0.60
38209Llama 3.3 Swallow 70B Instruct938±310.7K3.2%1.4%153 tps1.3s131K$0.13$0.39
39209Qwen 2.5 14B Instruct944±39K2.3%2.4%40 tps1.6s1M$0.40$1.61
40222Jamba 1.5 Large944±313.6K1.6%1.7%48 tps0.9s256K$1.50$6.00
View All (208 models)