Models
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

588
Hunyuan A13B Instruct
610
UI-TARS 1.5 7B
686
MiniMax M1
696
DeepSeek-R1 Distill Qwen 32B
706
DeepHermes 3 Mistral 24B Preview
719
Inflection 3 Pi
737
Inflection 3 Productivity
746
Qwen 2.5 VL 72B Instruct
762
Open Mistral 7B
770
Baichuan-M2-32B
781
Hermes 4 70B
785
Mistral Large
787
GPT-3.5 Turbo Instruct
793
Mercury Coder
797
Hermes 4 405B FP8

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
1279Hunyuan A13B Instruct588±221.6K9.2%2.3%67 tps2.0s33K$0.01$0.01
2279UI-TARS 1.5 7B610±4053011.7%4.0%75 tps0.9s128K$0.10$0.20
3276MiniMax M1686±133.8K5.3%<0.1%31 tps2.8s1M$0.55$2.20
4276DeepSeek-R1 Distill Qwen 32B696±202K5.5%6.2%22 tps1.8s131K$0.37$0.39
5269DeepHermes 3 Mistral 24B Preview706±307155.9%2.5%50 tps1.0s33K$0.06$0.25
6269Inflection 3 Pi719±181.5K4.1%1.1%33 tps3.4s8K$2.50$10.00
7269Inflection 3 Productivity737±241.5K5.0%0.6%50 tps3.2s8K$2.50$10.00
8262Qwen 2.5 VL 72B Instruct746±202.1K6.0%5.3%25 tps3.7s128K$1.01$2.79
9262Open Mistral 7B762±181.3K4.7%0.7%176 tps0.4s33K$0.25$0.25
10262Baichuan-M2-32B770±3074010.8%<0.1%32 tps3.3s131K$0.07$0.07
11252Hermes 4 70B781±294608.9%1.1%67 tps0.6s131K$0.12$0.39
12252Mistral Large785±161.1K5.8%1.5%54 tps0.7s33K$2.00$6.00
13252GPT-3.5 Turbo Instruct787±92K2.7%<0.1%46 tps1.2s4K$1.50$2.00
14252Mercury Coder793±275103.8%<0.1%247 tps2.2s32K$0.25$1.00
15252Hermes 4 405B FP8797±218158.4%3.5%31 tps0.9s131K$0.52$1.73
16252WizardLM-2 8x22B801±121.9K3.1%11.6%11 tps2.5s66K$0.77$0.77
17252Magistral Small 2509802±181.8K7.5%2.7%116 tps0.6s131K$0.50$1.50
18240LFM2 8B A1B818±1882511.3%<0.1%142 tps0.3s33K$0.01$0.02
19240Moonshot V1 32k820±179503.1%1.4%53 tps1.4s33K$1.00$3.00
20240Krutrim 2825±102.3K2.3%12.5%33 tps2.1s128K$1.00$1.00
21240LFM2 2.6B826±2681010.0%6.7%184 tps0.4s33K$0.01$0.02
22240GLM 4.5 Flash834±375208.8%12.2%15 tps2.2s131K$0$0
23234ERNIE 4.5 21B A3B Thinking838±231.1K6.9%1.8%87 tps1.5s120K$0.07$0.28
24234GPT-3.5 Turbo 16k838±102.7K3.6%<0.1%22 tps0.6s16K$3.00$4.00
25234Jamba 1.5 Large851±92.9K4.0%1.7%48 tps0.9s256K$1.50$6.00
26210Ministral 3B 2512854±575158.0%2.8%339 tps0.6s131K$0.10$0.10
27210Qwen 2.5 14B Instruct861±162.4K5.7%2.4%40 tps1.6s1M$0.40$1.61
28210Moonshot V1 8k863±139155.2%1.0%55 tps1.5s8K$0.20$2.00
29210Gemma 3 12B867±112.5K4.9%4.2%73 tps0.8s131K$0.05$0.12
30210GLM 4 32B868±122.9K4.9%2.6%40 tps1.6s33K$0.14$0.14
31210Krutrim Spectre V2868±161.3K3.6%<0.1%33 tps3.1s4K$0.19$0.19
32210Mixtral 8x22B871±221.3K5.0%1.2%140 tps0.6s64K$2.00$6.00
33210GLM 4.7 Flash871±286104.7%5.8%61 tps2.8s128K$0.07$0.39
34210Solar Mini 250422874±171.3K5.9%1.8%90 tps1.7s33K$0.15$0.15
35210Mistral Medium 3875±234856.7%2.4%47 tps0.8s33K$0.40$2.00
36210DeepSeek R1T2 Chimera876±102.1K5.9%3.0%28 tps1.8s164K$0.13$0.45
37210Inception Mercury878±56.9K3.7%0.4%257 tps1.1s32K$0.25$1.00
38210Moonshot V1 128k879±191.1K4.6%1.4%54 tps1.5s131K$2.00$5.00
39210Qwen3 4B880±85.1K9.6%1.9%94 tps1.5s128K$0.01$0.01
40210Gemma 3n E4B882±76K4.5%2.0%30 tps0.5s8K$0.01$0.02
View All (210 models)