Models
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1178
DeepSeek V3.2 Thinking
1107
DeepSeek V3.2 Exp Chat
1089
DeepSeek V3.2 Exp Thinking
1061
DeepSeek V3.1 Terminus Thinking
1032
DeepSeek V3
1003
DeepSeek-R1 Turbo
979
DeepSeek-R1 0528
939
DeepSeek-R1
935
DeepSeek Prover v2
835
DeepSeek-R1 Distill Llama 70B

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
160DeepSeek V3.2 Thinking1178±923.3K4.0%9.0%30 tps2.6s131K$0.28$0.42
290DeepSeek V3.2 Exp Chat1107±45.5K6.1%2.6%29 tps1.5s131K$0.27$0.39
398DeepSeek V3.2 Exp Thinking1089±75.9K3.5%7.2%26 tps3.0s131K$0.28$0.42
4119DeepSeek V3.1 Terminus Thinking1061±92.9K9.4%5.9%27 tps1.8s131K$0.56$1.68
5135DeepSeek V31032±517.6K3.7%0.9%69 tps1.1s64K$0.59$1.49
6148DeepSeek-R1 Turbo1003±92.5K5.6%2.6%29 tps1.8s64K$2.85$4.75
7159DeepSeek-R1 0528979±65.5K3.5%1.3%93 tps0.5s64K$1.60$3.67
8179DeepSeek-R1939±66.4K4.3%0.8%133 tps0.6s64K$0.91$3.07
9179DeepSeek Prover v2935±101.3K3.0%5.2%14 tps1.3s164K$0.40$1.56
10240DeepSeek-R1 Distill Llama 70B835±93.4K5.2%3.6%27 tps1.6s32K$0.73$0.95