Models
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1113
Kimi K2 Thinking Turbo
1092
DeepSeek V3.2 Thinking
877
DeepSeek V3.2 Exp Thinking
873
gpt-oss-120b

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
147Kimi K2 Thinking Turbo1113±1614.3K2.6%2.0%75 tps1.4s262K$1.15$8.00
259DeepSeek V3.2 Thinking1092±1416.2K3.8%9.0%30 tps2.6s131K$0.28$0.42
3103DeepSeek V3.2 Exp Thinking877±193.7K2.1%7.2%26 tps3.0s131K$0.28$0.42
451gpt-oss-120b873±175K1.9%0.7%213 tps0.5s131K$0.11$0.50