Models
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

851
Llama 3.3 70B Instruct Turbo
849
Command R 7B
846
Wikipedia
842
MAI-DS-R1
838
GPT-3.5 Turbo 16k
838
Cogito V2 671B
838
ERNIE 4.5 21B A3B Thinking
835
DeepSeek-R1 Distill Llama 70B
835
Typhoon 2 70B Instruct
834
GLM 4.5 Flash
833
OLMo 2 0425 1B Instruct
832
NVIDIA Llama 3.1 Nemotron Ultra 253B v1
832
Mixtral-8x7B Instruct v0.1
831
Qwen 2.5 7B
829
Sky T1 32B Preview

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
321234Llama 3.3 70B Instruct Turbo851±191.2K6.0%2.0%78 tps1.0s131K$0.88$0.88
322234Command R 7B849±153.3K4.8%1.1%76 tps0.4s128K$0.04$0.15
323312Wikipedia846±79.8K4.9%<0.1%47 tps2.1s32K$0$0
324324MAI-DS-R1842±73.5K11.7%<0.1%73 tps3.2s64K$0.10$0.40
325234GPT-3.5 Turbo 16k838±102.7K3.6%<0.1%22 tps0.6s16K$3.00$4.00
326324Cogito V2 671B838±171.6K5.9%<0.1%41 tps0.6s164K$1.25$1.25
327234ERNIE 4.5 21B A3B Thinking838±231.1K6.9%1.8%87 tps1.5s120K$0.07$0.28
328240DeepSeek-R1 Distill Llama 70B835±93.4K5.2%3.6%27 tps1.6s32K$0.73$0.95
329324Typhoon 2 70B Instruct835±151.4K4.0%<0.1%19 tps0.1s8K$0.88$0.88
330240GLM 4.5 Flash834±375208.8%12.2%15 tps2.2s131K$0$0
331324OLMo 2 0425 1B Instruct833±215602.6%<0.1%68 tps0.0s4K$0$0
332324NVIDIA Llama 3.1 Nemotron Ultra 253B v1832±162.2K4.1%<0.1%40 tps0.8s128K$0.30$0.90
333240Mixtral-8x7B Instruct v0.1832±231.3K4.6%1.3%54 tps0.4s33K$0.60$0.60
334240Qwen 2.5 7B831±172K5.1%3.7%40 tps1.9s131K$0.08$0.27
335240Sky T1 32B Preview829±142.4K4.5%7.8%73 tps0.6s16K$0.12$0.18
336240LFM2 2.6B826±2681010.0%6.7%184 tps0.4s33K$0.01$0.02
337240Krutrim 2825±102.3K2.3%12.5%33 tps2.1s128K$1.00$1.00
338240Ministral 8B825±172.2K5.5%1.4%177 tps0.4s128K$0.14$0.14
339240C4AI Aya Expanse 32B821±73.8K4.0%1.5%43 tps0.5s128K$0.50$1.50
340240Moonshot V1 32k820±179503.1%1.4%53 tps1.4s33K$1.00$3.00
341240LFM2 8B A1B818±1882511.3%<0.1%142 tps0.3s33K$0.01$0.02
342240Gemma 2 27B815±171.5K4.1%1.4%44 tps1.4s8K$0.80$0.80
343337GLM 4.1V 9B Thinking813±161.1K4.2%<0.1%69 tps1.3s66K$0.04$0.14
344252Ministral 3B806±162.3K5.1%0.8%248 tps0.4s131K$0.08$0.08
345337Qwen 2 72B Instruct805±191K3.3%<0.1%3 tpsN/A33K$0.90$0.90
346346Magistral Medium (Thinking)804±102.2K5.7%<0.1%67 tps0.8s41K$2.00$5.00
347252Magistral Small 2509802±181.8K7.5%2.7%116 tps0.6s131K$0.50$1.50
348252Gemma 3 1B802±112K6.1%0.6%176 tps1.0s33K$0.06$0.10
349252WizardLM-2 8x22B801±121.9K3.1%11.6%11 tps2.5s66K$0.77$0.77
350252Phi 4798±161.7K3.4%5.1%28 tps1.3s128K$0.10$0.32
351252Hermes 4 405B FP8797±218158.4%3.5%31 tps0.9s131K$0.52$1.73
352346Magistral Medium 2507795±2566514.2%<0.1%86 tps0.7s41K$2.00$5.00
353252Mercury Coder793±275103.8%<0.1%247 tps2.2s32K$0.25$1.00
354252GPT-3.5 Turbo Instruct787±92K2.7%<0.1%46 tps1.2s4K$1.50$2.00
355252Mistral Large785±161.1K5.8%1.5%54 tps0.7s33K$2.00$6.00
356252Hermes 4 70B781±294608.9%1.1%67 tps0.6s131K$0.12$0.39
357262Command R778±182.2K4.9%5.8%54 tps0.6s128K$0.30$0.99
358262Baichuan-M2-32B770±3074010.8%<0.1%32 tps3.3s131K$0.07$0.07
359262Mistral Small770±121.2K4.5%1.7%142 tps0.6s32K$0.43$1.30
360354OLMo 3 7B Think763±217707.8%4.2%77 tps0.4s66K$0.12$0.20
View All (404 models)