Models
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

899
Moonshot V1 Auto
900
GPT-4o mini
903
Mistral Small 3.2 24B Instruct
904
Llama 3 8B
908
GPT-3.5 Turbo
908
Magistral Small 2506
912
Inception Mercury Coder Small Beta
913
Seed 1.6 Flash 250715
915
Rnj-1 Instruct
916
Devstral Small
917
Jamba 1.7 Large
918
Open Mistral Nemo
919
Llama 3.3 Swallow 70B Instruct
924
Jamba 1.6 Large
925
Mistral Small 3.1

Last updated about 1 month ago

RankNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
81Moonshot V1 Auto899±229304.1%1.2%54 tps1.5s8K$2.00$5.00
82GPT-4o mini900±142.8K5.3%2.1%71 tps1.7s128K$0.15$0.60
83Mistral Small 3.2 24B Instruct903±208508.6%1.9%113 tps1.1s131K$0.02$0.08
84Llama 3 8B904±103.2K3.5%6.0%85 tps0.7s8K$0.12$0.16
85GPT-3.5 Turbo908±151.2K2.5%1.3%74 tps0.9s16K$0.75$1.75
86Magistral Small 2506908±114.3K3.1%1.6%156 tps0.5s40K$0.37$1.10
87Inception Mercury Coder Small Beta912±206103.2%1.7%270 tps1.4s32K$0.25$1.00
88Seed 1.6 Flash 250715913±161.2K5.7%2.5%108 tps1.6s256K$0.07$0.30
89Rnj-1 Instruct915±219706.7%0.6%103 tps0.3s33K$0.15$0.15
90Devstral Small916±171.4K5.0%2.4%180 tps0.6s131K$0.10$0.30
91Jamba 1.7 Large917±169758.0%1.3%58 tps1.0s256K$1.33$5.33
92Open Mistral Nemo918±211.7K4.5%1.5%171 tps0.5s131K$0.15$0.15
93Llama 3.3 Swallow 70B Instruct919±83.5K5.5%1.4%153 tps1.3s131K$0.13$0.39
94Jamba 1.6 Large924±93.3K3.8%2.0%59 tps1.2s256K$1.33$5.33
95Mistral Small 3.1925±142.4K4.0%7.4%13 tps2.6s32K$0.17$0.28
96Grok 3 Mini927±69.9K6.4%1.2%43 tps0.5s131K$0.30$0.50
97Codestral931±198556.0%5.2%151 tps0.9s262K$0.15$0.45
98Llama 3.3 70B933±103K6.6%0.3%500 tps0.5s8K$0.48$0.66
99DeepSeek Prover v2935±101.3K3.0%5.2%14 tps1.3s164K$0.40$1.56
100DeepSeek-R1939±66.4K4.3%0.8%133 tps0.6s64K$0.91$3.07
101NVIDIA Llama 3.3 Nemotron Super 49B v1.5941±191.4K6.9%2.0%50 tps0.6s131K$0.09$0.33
102ERNIE 4.5 VL 424B A47B942±187256.5%4.9%36 tps3.5s123K$0.42$1.25
103ERNIE 4.5 21B A3B943±285406.9%2.3%78 tps1.5s120K$0.05$0.19
104Grok 3 Mini Fast943±79K7.0%1.6%44 tps0.5s131K$0.60$4.00
105Ministral 14B 3.0948±168058.5%2.0%119 tps0.5s128K$0.20$0.20
106Qwen3 8B948±94.2K8.2%2.4%61 tps1.4s41K$0.02$0.07
107Switchpoint Router949±102.7K3.6%1.7%71 tps4.9s131K$0.85$3.40
108Qwen3 30B A3B Thinking 2507953±103.5K4.7%0.5%124 tps1.2s131K$0.16$1.70
109Llama 3.1 8B Turbo958±142.2K2.0%2.1%650 tps0.5s128K$0.13$0.14
110Qwen 2.5 72B960±151.4K4.5%1.2%96 tps1.2s131K$0.14$0.26
111Qwen3 14B962±85.3K8.4%1.7%109 tps0.8s41K$0.04$0.15
112Devstral Medium962±113.5K5.2%1.5%77 tps0.6s131K$0.40$2.00
113Llama 4 Scout965±517.5K5.3%0.6%88 tps5.1s131K$0.18$0.46
114Qwen3 VL 30B A3B Thinking967±111.9K8.9%4.5%84 tps2.9s127K$0.20$1.47
115Pixtral Large969±143.5K3.9%2.5%57 tps1.3s128K$1.50$4.50
116Mistral Small 3.2 24B970±134.6K4.9%2.8%141 tps0.7s33K$0.02$0.08
117Llama 4 Maverick971±521K5.0%1.2%88 tps2.4s1M$0.23$0.83
118Qwen 2.5 32B Instruct972±74.1K6.5%2.5%48 tps1.0s131K$0.21$0.25
119Nemotron 3 Nano974±465806.5%1.3%216 tps0.8s256K$0.05$4.94
120DeepSeek V3.1 Thinking976±95.2K9.5%7.1%18 tps1.8s131K$0.23$0.75
View All (286 models)