Models
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

971
Llama 4 Maverick
965
Llama 4 Scout
958
Llama 3.1 8B Turbo
941
NVIDIA Llama 3.3 Nemotron Super 49B v1.5
933
Llama 3.3 70B
919
Llama 3.3 Swallow 70B Instruct
904
Llama 3 8B
885
Llama 3.2 11B Instruct
864
Hermes 2 Pro Llama 3 8B
851
Llama 3.3 70B Instruct Turbo
835
DeepSeek-R1 Distill Llama 70B

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
1167Llama 4 Maverick971±521K5.0%1.2%88 tps2.4s1M$0.23$0.83
2167Llama 4 Scout965±517.5K5.3%0.6%88 tps5.1s131K$0.18$0.46
3167Llama 3.1 8B Turbo958±142.2K2.0%2.1%650 tps0.5s128K$0.13$0.14
4179NVIDIA Llama 3.3 Nemotron Super 49B v1.5941±191.4K6.9%2.0%50 tps0.6s131K$0.09$0.33
5189Llama 3.3 70B933±103K6.6%0.3%500 tps0.5s8K$0.48$0.66
6189Llama 3.3 Swallow 70B Instruct919±83.5K5.5%1.4%153 tps1.3s131K$0.13$0.39
7201Llama 3 8B904±103.2K3.5%6.0%85 tps0.7s8K$0.12$0.16
8201Llama 3.2 11B Instruct885±152.1K4.1%1.5%152 tps0.5s8K$0.16$0.16
9210Hermes 2 Pro Llama 3 8B864±211.8K2.5%<0.1%76 tps1.0s131K$0.08$0.09
10234Llama 3.3 70B Instruct Turbo851±191.2K6.0%2.0%78 tps1.0s131K$0.88$0.88
11240DeepSeek-R1 Distill Llama 70B835±93.4K5.2%3.6%27 tps1.6s32K$0.73$0.95