Models
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

908
Magistral Small 2506
908
GPT-3.5 Turbo
904
Llama 3 8B
903
Mistral Small 3.2 24B Instruct
900
GPT-4o mini
899
Moonshot V1 Auto
894
Amazon Nova Pro 1.0
892
GLM 4.6V Flash
885
Llama 3.2 11B Instruct
883
Magistral Medium 2509
882
Gemma 3n E4B
880
Qwen3 4B
880
Mistral Small 3 24B Instruct
879
Moonshot V1 128k
878
Inception Mercury

Last updated about 1 month ago

RankNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
201Magistral Small 2506908±114.3K3.1%1.6%156 tps0.5s40K$0.37$1.10
202GPT-3.5 Turbo908±151.2K2.5%1.3%74 tps0.9s16K$0.75$1.75
203Llama 3 8B904±103.2K3.5%6.0%85 tps0.7s8K$0.12$0.16
204Mistral Small 3.2 24B Instruct903±208508.6%1.9%113 tps1.1s131K$0.02$0.08
205GPT-4o mini900±142.8K5.3%2.1%71 tps1.7s128K$0.15$0.60
206Moonshot V1 Auto899±229304.1%1.2%54 tps1.5s8K$2.00$5.00
207Amazon Nova Pro 1.0894±105.7K4.0%0.9%96 tps0.7s300K$0.80$1.70
208GLM 4.6V Flash892±102.5K7.6%3.7%64 tps2.1s128K$0.04$0.40
209Llama 3.2 11B Instruct885±152.1K4.1%1.5%152 tps0.5s8K$0.16$0.16
210Magistral Medium 2509883±162.6K9.5%4.0%58 tps0.9s131K$2.00$5.00
211Gemma 3n E4B882±76K4.5%2.0%30 tps0.5s8K$0.01$0.02
212Qwen3 4B880±85.1K9.6%1.9%94 tps1.5s128K$0.01$0.01
213Mistral Small 3 24B Instruct880±101.7K3.6%2.6%77 tps0.6s33K$0.07$0.14
214Moonshot V1 128k879±191.1K4.6%1.4%54 tps1.5s131K$2.00$5.00
215Inception Mercury878±56.9K3.7%0.4%257 tps1.1s32K$0.25$1.00
216DeepSeek R1T2 Chimera876±102.1K5.9%3.0%28 tps1.8s164K$0.13$0.45
217Mistral Medium 3875±234856.7%2.4%47 tps0.8s33K$0.40$2.00
218Mistral Nemo875±159152.7%<0.1%112 tps0.4s131K$0.07$0.13
219Solar Mini 250422874±171.3K5.9%1.8%90 tps1.7s33K$0.15$0.15
220GLM 4.7 Flash871±286104.7%5.8%61 tps2.8s128K$0.07$0.39
221Mixtral 8x22B871±221.3K5.0%1.2%140 tps0.6s64K$2.00$6.00
222Qwen 2.5 7B Turbo870±256156.1%0.5%125 tps0.4s131K$0.30$0.30
223Krutrim Spectre V2868±161.3K3.6%<0.1%33 tps3.1s4K$0.19$0.19
224GLM 4 32B868±122.9K4.9%2.6%40 tps1.6s33K$0.14$0.14
225Gemma 3 12B867±112.5K4.9%4.2%73 tps0.8s131K$0.05$0.12
226Hermes 2 Pro Llama 3 8B864±211.8K2.5%<0.1%76 tps1.0s131K$0.08$0.09
227Mistral Small 24B Instruct864±161.5K4.1%1.5%84 tps0.4s33K$0.80$0.80
228Moonshot V1 8k863±139155.2%1.0%55 tps1.5s8K$0.20$2.00
229Qwen 2.5 14B Instruct861±162.4K5.7%2.4%40 tps1.6s1M$0.40$1.61
230Gemma 3 27B856±271.1K6.9%1.8%35 tps1.1s66K$0.06$0.10
231Mixtral 8x7B855±181.3K5.1%2.2%142 tps0.6s33K$0.23$0.23
232Ministral 3B 2512854±575158.0%2.8%339 tps0.6s131K$0.10$0.10
233Mixtral 8x7B Instruct854±161.4K4.4%0.2%79 tps0.7s33K$0.23$0.31
234Gemma 3 27B IT853±102.3K3.9%2.0%60 tps0.8s128K$0.17$0.29
235Jamba 1.5 Large851±92.9K4.0%1.7%48 tps0.9s256K$1.50$6.00
236Llama 3.3 70B Instruct Turbo851±191.2K6.0%2.0%78 tps1.0s131K$0.88$0.88
237Command R 7B849±153.3K4.8%1.1%76 tps0.4s128K$0.04$0.15
238GPT-3.5 Turbo 16k838±102.7K3.6%<0.1%22 tps0.6s16K$3.00$4.00
239ERNIE 4.5 21B A3B Thinking838±231.1K6.9%1.8%87 tps1.5s120K$0.07$0.28
240DeepSeek-R1 Distill Llama 70B835±93.4K5.2%3.6%27 tps1.6s32K$0.73$0.95
View All (286 models)