Models
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1059
Llama 3 8B Turbo
1037
Llama 3 70B Turbo
971
Llama 4 Maverick
965
Llama 4 Scout
958
Llama 3.1 8B Turbo
942
NVIDIA Llama 3.3 Nemotron Super 49B v1
941
NVIDIA Llama 3.3 Nemotron Super 49B v1.5
935
Dobby Unhinged Llama 3.3 70B
933
Llama 3.3 70B
933
Llama 3.1 70B Instruct Turbo
928
NVIDIA Llama 3.1 Nemotron 70B
919
Llama 3.3 Swallow 70B Instruct
904
Llama 3 8B
896
Llama 3.1 405B Instruct Turbo
885
Llama 3.2 11B Instruct

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
1151Llama 3 8B Turbo1059±246001.6%<0.1%97 tps0.1s8K$0.12$0.13
2164Llama 3 70B Turbo1037±64.3K1.0%<0.1%31 tps0.0s8K$0.73$0.83
3167Llama 4 Maverick971±521K5.0%1.2%88 tps2.4s1M$0.23$0.83
4167Llama 4 Scout965±517.5K5.3%0.6%88 tps5.1s131K$0.18$0.46
5167Llama 3.1 8B Turbo958±142.2K2.0%2.1%650 tps0.5s128K$0.13$0.14
6230NVIDIA Llama 3.3 Nemotron Super 49B v1942±93.6K2.3%<0.1%13 tpsN/A131K$0.07$0.20
7179NVIDIA Llama 3.3 Nemotron Super 49B v1.5941±191.4K6.9%2.0%50 tps0.6s131K$0.09$0.33
8230Dobby Unhinged Llama 3.3 70B935±198602.8%<0.1%41 tps0.4s128K$0.90$0.90
9189Llama 3.3 70B933±103K6.6%0.3%500 tps0.5s8K$0.48$0.66
10245Llama 3.1 70B Instruct Turbo933±114.1K3.8%<0.1%110 tps0.8s128K$0.88$0.88
11245NVIDIA Llama 3.1 Nemotron 70B928±75.3K2.0%<0.1%9 tps0.1s128K$0.33$0.39
12189Llama 3.3 Swallow 70B Instruct919±83.5K5.5%1.4%153 tps1.3s131K$0.13$0.39
13201Llama 3 8B904±103.2K3.5%6.0%85 tps0.7s8K$0.12$0.16
14264Llama 3.1 405B Instruct Turbo896±112K3.9%<0.1%26 tps0.8s131K$3.50$3.50
15201Llama 3.2 11B Instruct885±152.1K4.1%1.5%152 tps0.5s8K$0.16$0.16
16210Hermes 2 Pro Llama 3 8B864±211.8K2.5%<0.1%76 tps1.0s131K$0.08$0.09
17234Llama 3.3 70B Instruct Turbo851±191.2K6.0%2.0%78 tps1.0s131K$0.88$0.88
18240DeepSeek-R1 Distill Llama 70B835±93.4K5.2%3.6%27 tps1.6s32K$0.73$0.95
19324NVIDIA Llama 3.1 Nemotron Ultra 253B v1832±162.2K4.1%<0.1%40 tps0.8s128K$0.30$0.90
20386Shisa V2 Llama 3.3 70B623±245859.3%<0.1%8 tps2.0s33K$0.03$0.09
21390DeepSeek-R1 Distill Llama 8B595±191.2K5.6%<0.1%17 tpsN/A32K$0.04$0.04