Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1249
gpt-oss-120b
1198
Qwen3 Next 80B A3B Instruct
1150
gpt-oss-20b
1147
Qwen3.5 122B A17B
1140
Kimi K2.5 Instant
1140
Step 3.5 Flash
1098
Qwen3 32B Fast
1096
DeepSeek V3.2 Thinking
1091
QwQ 32B
1090
Kimi K2.5
1089
Nemotron 3 Nano (Thinking)
1075
DeepSeek-R1 Turbo
1065
Kimi K2 Thinking Turbo
1064
Mistral Large 3
1051
Qwen3 30B A3B

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
148gpt-oss-120b1249±47.3K1.3%0.7%213 tps0.5s131K$0.11$0.50
233Qwen3 Next 80B A3B Instruct1198±73.4K3.0%0.6%84 tps1.1s256K$0.20$1.42
3101gpt-oss-20b1150±74K1.7%0.5%216 tps0.5s131K$0.06$0.26
452Qwen3.5 122B A17B1147±137651.9%1.5%82 tps1.4s256K$0.40$3.20
537Kimi K2.5 Instant1140±101.1K1.4%2.9%32 tps3.0s262K$0.50$3.00
648Step 3.5 Flash1140±159650.5%2.2%109 tps0.6s256K$0.05$0.15
7121Qwen3 32B Fast1098±89K1.0%11.6%30 tps3.1s41K$0.10$0.25
856DeepSeek V3.2 Thinking1096±63.8K0.9%9.0%30 tps2.6s131K$0.28$0.42
9121QwQ 32B1091±59.9K0.9%5.4%41 tps2.1s16K$0.43$0.56
1033Kimi K2.51090±64.5K0.7%6.5%33 tps1.7s262K$0.34$2.57
1186Nemotron 3 Nano (Thinking)1089±91.5K0.7%2.0%200 tps0.5s256K$0$0
1295DeepSeek-R1 Turbo1075±61.9K2.4%2.6%29 tps1.8s64K$2.85$4.75
1344Kimi K2 Thinking Turbo1065±63K1.9%2.0%75 tps1.4s262K$1.15$8.00
1465Mistral Large 31064±71.8K2.2%2.1%51 tps1.0s256K$0.50$1.50
15126Qwen3 30B A3B1051±73.9K1.3%5.1%163 tps1.0s41K$0.06$0.21
1665DeepSeek V3.2 Exp Chat1047±92.2K3.1%2.6%29 tps1.5s131K$0.27$0.39
17106DeepSeek V3.1 Terminus Thinking1035±111.4K2.8%5.9%27 tps1.8s131K$0.56$1.68
1886Qwen3 235B A22B1030±93.1K1.6%5.3%71 tps0.9s41K$0.23$0.63
1995DeepSeek V3.2 Exp Thinking1029±111.4K0.7%7.2%26 tps3.0s131K$0.28$0.42
20153Qwen 2.5 32B Instruct1019±81.4K1.8%2.5%48 tps1.0s131K$0.21$0.25
21126DeepSeek V31013±68.8K1.3%0.9%69 tps1.1s64K$0.59$1.49
22129Command A1005±58.6K1.7%2.2%42 tps0.8s256K$2.00$7.33
23113Kimi K2 Fast1003±410K1.8%0.8%365 tps0.5s131K$1.00$3.00
24133Qwen3 14B1002±63.6K1.6%1.7%109 tps0.8s41K$0.04$0.15
25148DeepSeek-R11001±65K1.7%0.8%133 tps0.6s64K$0.91$3.07
26133DeepSeek-R1 0528998±44.9K1.5%1.3%93 tps0.5s64K$1.60$3.67
27161Llama 4 Maverick980±67.3K1.8%1.2%88 tps2.4s1M$0.23$0.83
2879MiniMax M2.5 Lightning972±179351.1%1.5%51 tps2.0s205K$0.60$2.40
29170Llama 3.1 8B Turbo954±206852.1%2.1%650 tps0.5s128K$0.13$0.14
30177Mistral Small 3.1 24B Instruct949±167452.0%7.5%15 tps2.4s131K$0.06$0.18
31161DeepSeek Prover v2933±178252.4%5.2%14 tps1.3s164K$0.40$1.56
32214Llama 3.3 70B Instruct Turbo931±275053.8%2.0%78 tps1.0s131K$0.88$0.88
33201Gemma 3 27B IT930±156552.2%2.0%60 tps0.8s128K$0.17$0.29
34214Qwen 2.5 7B901±194904.9%3.7%40 tps1.9s131K$0.08$0.27
35186GLM 4.6V Flash899±151.3K2.3%3.7%64 tps2.1s128K$0.04$0.40
36165Pixtral Large890±126404.5%2.5%57 tps1.3s128K$1.50$4.50
37161Mistral Small 3.1835±176751.5%7.4%13 tps2.6s32K$0.17$0.28
38260Hermes 4 405B Reasoning FP8828±111.3K3.7%3.6%32 tps0.8s131K$1.00$3.00
39235Gemma 3 4B825±147553.2%1.3%138 tps0.7s131K$0.02$0.04
40194Llama 3.2 11B Instruct816±225252.8%1.5%152 tps0.5s8K$0.16$0.16
View All (51 models)