Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

546
Qwen 2.5 VL 3B Instruct
726
GLM 4.6V Flash
744
Pixtral 12B
778
Pixtral Large
821
Nemotron 3 Nano (Thinking)
853
Qwen3 32B Fast
856
Qwen3 14B
869
DeepSeek-R1 Distill Llama 70B
883
Llama 4 Maverick
894
DeepSeek-R1
898
Qwen3 235B A22B
910
QwQ 32B
918
Qwen3 30B A3B
947
Mistral Large 3
950
gpt-oss-20b

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
1288Qwen 2.5 VL 3B Instruct546±311K9.1%3.0%44 tps2.5s128K$0.21$0.63
2186GLM 4.6V Flash726±295752.5%3.7%64 tps2.1s128K$0.04$0.40
3274Pixtral 12B744±339409.6%2.2%101 tps1.2s131K$0.08$0.08
4165Pixtral Large778±181.1K7.2%2.5%57 tps1.3s128K$1.50$4.50
586Nemotron 3 Nano (Thinking)821±235404.4%2.0%200 tps0.5s256K$0$0
6121Qwen3 32B Fast853±131.8K4.2%11.6%30 tps3.1s41K$0.10$0.25
7133Qwen3 14B856±247452.6%1.7%109 tps0.8s41K$0.04$0.15
8246DeepSeek-R1 Distill Llama 70B869±285904.8%3.6%27 tps1.6s32K$0.73$0.95
9161Llama 4 Maverick883±123.6K4.4%1.2%88 tps2.4s1M$0.23$0.83
10148DeepSeek-R1894±121.1K3.8%0.8%133 tps0.6s64K$0.91$3.07
1186Qwen3 235B A22B898±237253.3%5.3%71 tps0.9s41K$0.23$0.63
12121QwQ 32B910±131.8K3.1%5.4%41 tps2.1s16K$0.43$0.56
13126Qwen3 30B A3B918±158653.4%5.1%163 tps1.0s41K$0.06$0.21
1465Mistral Large 3947±201.3K4.4%2.1%51 tps1.0s256K$0.50$1.50
15101gpt-oss-20b950±181.4K4.7%0.5%216 tps0.5s131K$0.06$0.26
16126DeepSeek V3960±73.4K2.3%0.9%69 tps1.1s64K$0.59$1.49
17129Command A965±83K2.9%2.2%42 tps0.8s256K$2.00$7.33
18113Kimi K2 Fast975±104.8K2.3%0.8%365 tps0.5s131K$1.00$3.00
19106DeepSeek V3.1 Terminus Thinking1000±147452.6%5.9%27 tps1.8s131K$0.56$1.68
20133DeepSeek-R1 05281001±151.1K4.1%1.3%93 tps0.5s64K$1.60$3.67
2195DeepSeek-R1 Turbo1009±206603.6%2.6%29 tps1.8s64K$2.85$4.75
2256DeepSeek V3.2 Thinking1021±131.9K1.8%9.0%30 tps2.6s131K$0.28$0.42
2379MiniMax M2.5 Lightning1031±208201.8%1.5%51 tps2.0s205K$0.60$2.40
2495DeepSeek V3.2 Exp Thinking1038±176553.7%7.2%26 tps3.0s131K$0.28$0.42
2581Qwen3.5 27B1056±176652.9%3.7%55 tps2.6s256K$0.30$2.40
2652Qwen3.5 122B A17B1063±149801.5%1.5%82 tps1.4s256K$0.40$3.20
2748Step 3.5 Flash1067±206302.3%2.2%109 tps0.6s256K$0.05$0.15
2865DeepSeek V3.2 Exp Chat1072±127552.6%2.6%29 tps1.5s131K$0.27$0.39
2944Kimi K2 Thinking Turbo1072±171.3K2.2%2.0%75 tps1.4s262K$1.15$8.00
3048gpt-oss-120b1074±63K2.6%0.7%213 tps0.5s131K$0.11$0.50
3133Qwen3 Next 80B A3B Instruct1083±161.5K2.6%0.6%84 tps1.1s256K$0.20$1.42
3233Kimi K2.51090±134.3K2.1%6.5%33 tps1.7s262K$0.34$2.57
3337Kimi K2.5 Instant1093±121.4K2.7%2.9%32 tps3.0s262K$0.50$3.00