Models
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

883
Magistral Medium 2509
894
Amazon Nova Pro 1.0
899
Moonshot V1 Auto
900
GPT-4o mini
903
Mistral Small 3.2 24B Instruct
904
Llama 3 8B
908
GPT-3.5 Turbo
908
Magistral Small 2506
912
Inception Mercury Coder Small Beta
913
Seed 1.6 Flash 250715
915
Rnj-1 Instruct
916
Devstral Small
917
Jamba 1.7 Large
919
Llama 3.3 Swallow 70B Instruct
924
Jamba 1.6 Large

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
41210Magistral Medium 2509883±162.6K9.5%4.0%58 tps0.9s131K$2.00$5.00
42201Amazon Nova Pro 1.0894±105.7K4.0%0.9%96 tps0.7s300K$0.80$1.70
43201Moonshot V1 Auto899±229304.1%1.2%54 tps1.5s8K$2.00$5.00
44201GPT-4o mini900±142.8K5.3%2.1%71 tps1.7s128K$0.15$0.60
45201Mistral Small 3.2 24B Instruct903±208508.6%1.9%113 tps1.1s131K$0.02$0.08
46201Llama 3 8B904±103.2K3.5%6.0%85 tps0.7s8K$0.12$0.16
47201GPT-3.5 Turbo908±151.2K2.5%1.3%74 tps0.9s16K$0.75$1.75
48201Magistral Small 2506908±114.3K3.1%1.6%156 tps0.5s40K$0.37$1.10
49189Inception Mercury Coder Small Beta912±206103.2%1.7%270 tps1.4s32K$0.25$1.00
50189Seed 1.6 Flash 250715913±161.2K5.7%2.5%108 tps1.6s256K$0.07$0.30
51189Rnj-1 Instruct915±219706.7%0.6%103 tps0.3s33K$0.15$0.15
52189Devstral Small916±171.4K5.0%2.4%180 tps0.6s131K$0.10$0.30
53189Jamba 1.7 Large917±169758.0%1.3%58 tps1.0s256K$1.33$5.33
54189Llama 3.3 Swallow 70B Instruct919±83.5K5.5%1.4%153 tps1.3s131K$0.13$0.39
55189Jamba 1.6 Large924±93.3K3.8%2.0%59 tps1.2s256K$1.33$5.33
56189Grok 3 Mini927±69.9K6.4%1.2%43 tps0.5s131K$0.30$0.50
57189Llama 3.3 70B933±103K6.6%0.3%500 tps0.5s8K$0.48$0.66
58179NVIDIA Llama 3.3 Nemotron Super 49B v1.5941±191.4K6.9%2.0%50 tps0.6s131K$0.09$0.33
59179ERNIE 4.5 VL 424B A47B942±187256.5%4.9%36 tps3.5s123K$0.42$1.25
60179ERNIE 4.5 21B A3B943±285406.9%2.3%78 tps1.5s120K$0.05$0.19
61179Grok 3 Mini Fast943±79K7.0%1.6%44 tps0.5s131K$0.60$4.00
62179Ministral 14B 3.0948±168058.5%2.0%119 tps0.5s128K$0.20$0.20
63179Qwen3 8B948±94.2K8.2%2.4%61 tps1.4s41K$0.02$0.07
64179Switchpoint Router949±102.7K3.6%1.7%71 tps4.9s131K$0.85$3.40
65179Qwen3 30B A3B Thinking 2507953±103.5K4.7%0.5%124 tps1.2s131K$0.16$1.70
66167Qwen 2.5 72B960±151.4K4.5%1.2%96 tps1.2s131K$0.14$0.26
67167Devstral Medium962±113.5K5.2%1.5%77 tps0.6s131K$0.40$2.00
68167Llama 4 Scout965±517.5K5.3%0.6%88 tps5.1s131K$0.18$0.46
69167Mistral Small 3.2 24B970±134.6K4.9%2.8%141 tps0.7s33K$0.02$0.08
70167Nemotron 3 Nano974±465806.5%1.3%216 tps0.8s256K$0.05$4.94
71167DeepSeek V3.1 Thinking976±95.2K9.5%7.1%18 tps1.8s131K$0.23$0.75
72159Seed 2.0 Mini (Medium)981±355705.8%11.9%33 tps1.7s256K$0.15$0.60
73159Kimi K2 0711981±67K4.5%1.6%29 tps1.3s131K$0.72$2.60
74159GLM 4.6V986±83K5.5%6.4%21 tps1.8s128K$0.38$0.90
75159Grok Code Fast 1987±92.5K6.0%5.9%294 tps0.5s256K$0.20$1.50
76159OpenAI o3-mini-low988±612.2K6.4%0.7%139 tps1.5s200K$1.10$4.40
77159GPT-5 Nano989±64.6K8.0%3.2%113 tps20.9s400K$0.05$0.40
78148Seed 1.6 250615995±211.6K6.0%3.1%46 tps2.2s256K$0.25$2.00
79148OpenAI o4-mini-high995±713.6K6.2%1.9%117 tps15.9s200K$1.10$4.40
80148OpenAI o3-mini999±415K5.5%0.8%143 tps3.3s200K$1.10$4.40
View All (210 models)