Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

890
Llama 4 Maverick
911
Mistral Large 3
924
Command A
954
Kimi K2 Thinking Turbo
968
QwQ 32B
974
DeepSeek V3
992
Kimi K2 Fast
1050
DeepSeek V3.2 Thinking
1135
Qwen3 Next 80B A3B Instruct
1136
gpt-oss-120b
1186
Kimi K2.5

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
1161Llama 4 Maverick890±142.2K1.8%1.2%88 tps2.4s1M$0.23$0.83
265Mistral Large 3911±305353.6%2.1%51 tps1.0s256K$0.50$1.50
3129Command A924±172K2.0%2.2%42 tps0.8s256K$2.00$7.33
444Kimi K2 Thinking Turbo954±285051.0%2.0%75 tps1.4s262K$1.15$8.00
5121QwQ 32B968±206501.5%5.4%41 tps2.1s16K$0.43$0.56
6126DeepSeek V3974±161.5K1.4%0.9%69 tps1.1s64K$0.59$1.49
7113Kimi K2 Fast992±103.5K2.8%0.8%365 tps0.5s131K$1.00$3.00
856DeepSeek V3.2 Thinking1050±236951.4%9.0%30 tps2.6s131K$0.28$0.42
933Qwen3 Next 80B A3B Instruct1135±256700.7%0.6%84 tps1.1s256K$0.20$1.42
1048gpt-oss-120b1136±197751.3%0.7%213 tps0.5s131K$0.11$0.50
1133Kimi K2.51186±387600.7%6.5%33 tps1.7s262K$0.34$2.57