Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

546
Qwen 2.5 VL 3B Instruct
726
GLM 4.6V Flash
739
Grok 3 Mini
744
Pixtral 12B
745
Llama 3.3 70B
778
Pixtral Large
781
Gemma 3n E4B
790
Magistral Small 2509
797
Magistral Medium 2509
804
Qwen 2.5 VL 72B Instruct
818
Qwen3 30B A3B Thinking 2507
821
Nemotron 3 Nano (Thinking)
826
GPT-4o mini
827
Qwen3 8B
830
DeepSeek V3.2 Speciale

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
1288Qwen 2.5 VL 3B Instruct546±311K9.1%3.0%44 tps2.5s128K$0.21$0.63
2186GLM 4.6V Flash726±295752.5%3.7%64 tps2.1s128K$0.04$0.40
3186Grok 3 Mini739±201.4K2.5%1.2%43 tps0.5s131K$0.30$0.50
4274Pixtral 12B744±339409.6%2.2%101 tps1.2s131K$0.08$0.08
5194Llama 3.3 70B745±305254.5%0.3%500 tps0.5s8K$0.48$0.66
6165Pixtral Large778±181.1K7.2%2.5%57 tps1.3s128K$1.50$4.50
7186Gemma 3n E4B781±275354.5%2.0%30 tps0.5s8K$0.01$0.02
8265Magistral Small 2509790±295306.2%2.7%116 tps0.6s131K$0.50$1.50
9229Magistral Medium 2509797±175705.0%4.0%58 tps0.9s131K$2.00$5.00
10265Qwen 2.5 VL 72B Instruct804±297156.5%5.3%25 tps3.7s128K$1.01$2.79
11148Qwen3 30B A3B Thinking 2507818±187953.0%0.5%124 tps1.2s131K$0.16$1.70
1286Nemotron 3 Nano (Thinking)821±235404.4%2.0%200 tps0.5s256K$0$0
13201GPT-4o mini826±186456.5%2.1%71 tps1.7s128K$0.15$0.60
14161Qwen3 8B827±366004.0%2.4%61 tps1.4s41K$0.02$0.07
15133DeepSeek V3.2 Speciale830±285403.6%6.0%43 tps1.4s131K$0.84$1.52
16186Grok 3 Mini Fast832±231K3.3%1.6%44 tps0.5s131K$0.60$4.00
1784GPT-5 Mini Minimal835±131.1K6.6%1.2%63 tps1.4s400K$0.25$2.00
18175OpenAI o3-mini-low838±211.7K2.6%0.7%139 tps1.5s200K$1.10$4.40
19157GPT-5 Nano843±142K6.0%3.2%113 tps20.9s400K$0.05$0.40
20157Qwen3 Next 80B A3B Thinking846±151.3K3.9%0.6%175 tps1.3s256K$0.21$2.26
21121Qwen3 32B Fast853±131.8K4.2%11.6%30 tps3.1s41K$0.10$0.25
22133Qwen3 14B856±247452.6%1.7%109 tps0.8s41K$0.04$0.15
23170Kimi K2 0711858±248904.3%1.6%29 tps1.3s131K$0.72$2.60
24139GLM 4.6V865±248902.7%6.4%21 tps1.8s128K$0.38$0.90
25129Qwen3 Max Thinking866±141.5K1.7%13.5%32 tps2.3s256K$1.20$6.00
26214OpenAI o3-mini-high868±131.4K3.8%2.4%231 tps10.5s200K$1.10$4.40
27246DeepSeek-R1 Distill Llama 70B869±285904.8%3.6%27 tps1.6s32K$0.73$0.95
28179GLM 4.7 Flash874±248552.8%5.8%61 tps2.8s128K$0.07$0.39
29160Llama 4 Scout875±152.3K2.9%0.6%88 tps5.1s131K$0.18$0.46
30165Qwen3 4B878±237353.9%1.9%94 tps1.5s128K$0.01$0.01
31161Llama 4 Maverick883±123.6K4.4%1.2%88 tps2.4s1M$0.23$0.83
32148DeepSeek-R1894±121.1K3.8%0.8%133 tps0.6s64K$0.91$3.07
3386Qwen3 235B A22B898±237253.3%5.3%71 tps0.9s41K$0.23$0.63
34139Seed 2.0 Mini (Medium)900±305153.7%11.9%33 tps1.7s256K$0.15$0.60
35177OpenAI o3-mini901±122.5K3.1%0.8%143 tps3.3s200K$1.10$4.40
3662MiniMax M2905±181.4K3.5%2.2%39 tps2.3s205K$0.21$0.85
37121QwQ 32B910±131.8K3.1%5.4%41 tps2.1s16K$0.43$0.56
38143Gemini 2.0 Flash Lite917±112.5K6.7%<0.1%42 tps0.5s1M$0.08$0.30
39126Qwen3 30B A3B918±158653.4%5.1%163 tps1.0s41K$0.06$0.21
40119ERNIE 4.5 300B A47B918±171.6K2.7%4.7%23 tps2.3s123K$0.28$1.10
View All (154 models)