Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1093
Kimi K2.5 Instant
1090
Kimi K2.5
1083
Qwen3 Next 80B A3B Instruct
1074
gpt-oss-120b
1072
Kimi K2 Thinking Turbo
1072
DeepSeek V3.2 Exp Chat
1067
Step 3.5 Flash
1063
Qwen3.5 122B A17B
1056
Qwen3.5 27B
1038
DeepSeek V3.2 Exp Thinking
1031
MiniMax M2.5 Lightning
1021
DeepSeek V3.2 Thinking
1009
DeepSeek-R1 Turbo
1001
DeepSeek-R1 0528
1000
DeepSeek V3.1 Terminus Thinking

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
137Kimi K2.5 Instant1093±121.4K2.7%2.9%32 tps3.0s262K$0.50$3.00
233Kimi K2.51090±134.3K2.1%6.5%33 tps1.7s262K$0.34$2.57
333Qwen3 Next 80B A3B Instruct1083±161.5K2.6%0.6%84 tps1.1s256K$0.20$1.42
448gpt-oss-120b1074±63K2.6%0.7%213 tps0.5s131K$0.11$0.50
544Kimi K2 Thinking Turbo1072±171.3K2.2%2.0%75 tps1.4s262K$1.15$8.00
665DeepSeek V3.2 Exp Chat1072±127552.6%2.6%29 tps1.5s131K$0.27$0.39
748Step 3.5 Flash1067±206302.3%2.2%109 tps0.6s256K$0.05$0.15
852Qwen3.5 122B A17B1063±149801.5%1.5%82 tps1.4s256K$0.40$3.20
981Qwen3.5 27B1056±176652.9%3.7%55 tps2.6s256K$0.30$2.40
1095DeepSeek V3.2 Exp Thinking1038±176553.7%7.2%26 tps3.0s131K$0.28$0.42
1179MiniMax M2.5 Lightning1031±208201.8%1.5%51 tps2.0s205K$0.60$2.40
1256DeepSeek V3.2 Thinking1021±131.9K1.8%9.0%30 tps2.6s131K$0.28$0.42
1395DeepSeek-R1 Turbo1009±206603.6%2.6%29 tps1.8s64K$2.85$4.75
14133DeepSeek-R1 05281001±151.1K4.1%1.3%93 tps0.5s64K$1.60$3.67
15106DeepSeek V3.1 Terminus Thinking1000±147452.6%5.9%27 tps1.8s131K$0.56$1.68
16113Kimi K2 Fast975±104.8K2.3%0.8%365 tps0.5s131K$1.00$3.00
17129Command A965±83K2.9%2.2%42 tps0.8s256K$2.00$7.33
18126DeepSeek V3960±73.4K2.3%0.9%69 tps1.1s64K$0.59$1.49
19101gpt-oss-20b950±181.4K4.7%0.5%216 tps0.5s131K$0.06$0.26
2065Mistral Large 3947±201.3K4.4%2.1%51 tps1.0s256K$0.50$1.50
21126Qwen3 30B A3B918±158653.4%5.1%163 tps1.0s41K$0.06$0.21
22121QwQ 32B910±131.8K3.1%5.4%41 tps2.1s16K$0.43$0.56
2386Qwen3 235B A22B898±237253.3%5.3%71 tps0.9s41K$0.23$0.63
24148DeepSeek-R1894±121.1K3.8%0.8%133 tps0.6s64K$0.91$3.07
25161Llama 4 Maverick883±123.6K4.4%1.2%88 tps2.4s1M$0.23$0.83
26246DeepSeek-R1 Distill Llama 70B869±285904.8%3.6%27 tps1.6s32K$0.73$0.95
27133Qwen3 14B856±247452.6%1.7%109 tps0.8s41K$0.04$0.15
28121Qwen3 32B Fast853±131.8K4.2%11.6%30 tps3.1s41K$0.10$0.25
2986Nemotron 3 Nano (Thinking)821±235404.4%2.0%200 tps0.5s256K$0$0
30165Pixtral Large778±181.1K7.2%2.5%57 tps1.3s128K$1.50$4.50
31274Pixtral 12B744±339409.6%2.2%101 tps1.2s131K$0.08$0.08
32186GLM 4.6V Flash726±295752.5%3.7%64 tps2.1s128K$0.04$0.40
33288Qwen 2.5 VL 3B Instruct546±311K9.1%3.0%44 tps2.5s128K$0.21$0.63