Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1110
Kimi K2.5
1038
Qwen3 Next 80B A3B Instruct
1000
gpt-oss-120b
942
DeepSeek V3.2 Thinking
917
Kimi K2 Thinking Turbo
910
DeepSeek V3
887
Kimi K2 Fast
881
Mistral Large 3
836
Command A
816
gpt-oss-20b
719
Llama 4 Maverick

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
133Kimi K2.51110±267202.0%6.5%33 tps1.7s262K$0.34$2.57
233Qwen3 Next 80B A3B Instruct1038±159202.6%0.6%84 tps1.1s256K$0.20$1.42
348gpt-oss-120b1000±151.1K1.3%0.7%213 tps0.5s131K$0.11$0.50
456DeepSeek V3.2 Thinking942±267052.8%9.0%30 tps2.6s131K$0.28$0.42
544Kimi K2 Thinking Turbo917±275301.9%2.0%75 tps1.4s262K$1.15$8.00
6126DeepSeek V3910±385651.7%0.9%69 tps1.1s64K$0.59$1.49
7113Kimi K2 Fast887±141.6K1.0%0.8%365 tps0.5s131K$1.00$3.00
865Mistral Large 3881±274953.9%2.1%51 tps1.0s256K$0.50$1.50
9129Command A836±158551.2%2.2%42 tps0.8s256K$2.00$7.33
10101gpt-oss-20b816±205551.8%0.5%216 tps0.5s131K$0.06$0.26
11161Llama 4 Maverick719±271K2.9%1.2%88 tps2.4s1M$0.23$0.83