Models
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

834
GLM 4.5 Flash
832
Mixtral-8x7B Instruct v0.1
831
Qwen 2.5 7B
829
Sky T1 32B Preview
826
LFM2 2.6B
825
Krutrim 2
825
Ministral 8B
821
C4AI Aya Expanse 32B
820
Moonshot V1 32k
818
LFM2 8B A1B
815
Gemma 2 27B
806
Ministral 3B
802
Magistral Small 2509
802
Gemma 3 1B
801
WizardLM-2 8x22B

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
241240GLM 4.5 Flash834±375208.8%12.2%15 tps2.2s131K$0$0
242240Mixtral-8x7B Instruct v0.1832±231.3K4.6%1.3%54 tps0.4s33K$0.60$0.60
243240Qwen 2.5 7B831±172K5.1%3.7%40 tps1.9s131K$0.08$0.27
244240Sky T1 32B Preview829±142.4K4.5%7.8%73 tps0.6s16K$0.12$0.18
245240LFM2 2.6B826±2681010.0%6.7%184 tps0.4s33K$0.01$0.02
246240Krutrim 2825±102.3K2.3%12.5%33 tps2.1s128K$1.00$1.00
247240Ministral 8B825±172.2K5.5%1.4%177 tps0.4s128K$0.14$0.14
248240C4AI Aya Expanse 32B821±73.8K4.0%1.5%43 tps0.5s128K$0.50$1.50
249240Moonshot V1 32k820±179503.1%1.4%53 tps1.4s33K$1.00$3.00
250240LFM2 8B A1B818±1882511.3%<0.1%142 tps0.3s33K$0.01$0.02
251240Gemma 2 27B815±171.5K4.1%1.4%44 tps1.4s8K$0.80$0.80
252252Ministral 3B806±162.3K5.1%0.8%248 tps0.4s131K$0.08$0.08
253252Magistral Small 2509802±181.8K7.5%2.7%116 tps0.6s131K$0.50$1.50
254252Gemma 3 1B802±112K6.1%0.6%176 tps1.0s33K$0.06$0.10
255252WizardLM-2 8x22B801±121.9K3.1%11.6%11 tps2.5s66K$0.77$0.77
256252Phi 4798±161.7K3.4%5.1%28 tps1.3s128K$0.10$0.32
257252Hermes 4 405B FP8797±218158.4%3.5%31 tps0.9s131K$0.52$1.73
258252Mercury Coder793±275103.8%<0.1%247 tps2.2s32K$0.25$1.00
259252GPT-3.5 Turbo Instruct787±92K2.7%<0.1%46 tps1.2s4K$1.50$2.00
260252Mistral Large785±161.1K5.8%1.5%54 tps0.7s33K$2.00$6.00
261252Hermes 4 70B781±294608.9%1.1%67 tps0.6s131K$0.12$0.39
262262Command R778±182.2K4.9%5.8%54 tps0.6s128K$0.30$0.99
263262Baichuan-M2-32B770±3074010.8%<0.1%32 tps3.3s131K$0.07$0.07
264262Mistral Small770±121.2K4.5%1.7%142 tps0.6s32K$0.43$1.30
265262Open Mistral 7B762±181.3K4.7%0.7%176 tps0.4s33K$0.25$0.25
266262Hermes 4 405B Reasoning FP8759±112.7K12.8%3.6%32 tps0.8s131K$1.00$3.00
267262Goliath 120B754±247455.7%2.7%21 tps2.2s6K$6.56$9.38
268262Qwen 2.5 VL 72B Instruct746±202.1K6.0%5.3%25 tps3.7s128K$1.01$2.79
269269Gemma 3 4B742±103.3K4.7%1.3%138 tps0.7s131K$0.02$0.04
270269Mixtral 8x22B Instruct738±171.4K5.6%1.8%142 tps0.7s66K$0.45$0.45
271269Command R+738±151.6K5.6%2.8%36 tps0.7s128K$2.08$9.45
272269Inflection 3 Productivity737±241.5K5.0%0.6%50 tps3.2s8K$2.50$10.00
273269Pixtral 12B722±213K6.3%2.2%101 tps1.2s131K$0.08$0.08
274269Inflection 3 Pi719±181.5K4.1%1.1%33 tps3.4s8K$2.50$10.00
275269DeepHermes 3 Mistral 24B Preview706±307155.9%2.5%50 tps1.0s33K$0.06$0.25
276276Hermes 3 405B Instruct702±201.4K4.1%2.3%20 tps1.1s131K$0.80$0.80
277276DeepSeek-R1 Distill Qwen 32B696±202K5.5%6.2%22 tps1.8s131K$0.37$0.39
278276MiniMax M1686±133.8K5.3%<0.1%31 tps2.8s1M$0.55$2.20
279279UI-TARS 1.5 7B610±4053011.7%4.0%75 tps0.9s128K$0.10$0.20
280279MythoMax L2 13B600±212.3K5.8%1.2%22 tps1.1s4K$0.18$0.18
View All (286 models)