Models
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

935
DeepSeek Prover v2
935
R1 1776
935
Dobby Unhinged Llama 3.3 70B
936
Jamba 1.7 Mini
939
DeepSeek-R1
941
NVIDIA Llama 3.3 Nemotron Super 49B v1.5
942
ERNIE 4.5 VL 424B A47B
942
NVIDIA Llama 3.3 Nemotron Super 49B v1
943
ERNIE 4.5 21B A3B
943
Grok 3 Mini Fast
948
Ministral 14B 3.0
948
Qwen3 8B
949
Switchpoint Router
952
Magistral Medium
953
Qwen3 30B A3B Thinking 2507

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
161179DeepSeek Prover v2935±101.3K3.0%5.2%14 tps1.3s164K$0.40$1.56
162230R1 1776935±93.3K4.2%<0.1%61 tps1.0s128K$2.00$8.00
163230Dobby Unhinged Llama 3.3 70B935±198602.8%<0.1%41 tps0.4s128K$0.90$0.90
164230Jamba 1.7 Mini936±241K8.4%<0.1%84 tps0.9s256K$0.20$0.40
165179DeepSeek-R1939±66.4K4.3%0.8%133 tps0.6s64K$0.91$3.07
166179NVIDIA Llama 3.3 Nemotron Super 49B v1.5941±191.4K6.9%2.0%50 tps0.6s131K$0.09$0.33
167179ERNIE 4.5 VL 424B A47B942±187256.5%4.9%36 tps3.5s123K$0.42$1.25
168230NVIDIA Llama 3.3 Nemotron Super 49B v1942±93.6K2.3%<0.1%13 tpsN/A131K$0.07$0.20
169179ERNIE 4.5 21B A3B943±285406.9%2.3%78 tps1.5s120K$0.05$0.19
170179Grok 3 Mini Fast943±79K7.0%1.6%44 tps0.5s131K$0.60$4.00
171179Ministral 14B 3.0948±168058.5%2.0%119 tps0.5s128K$0.20$0.20
172179Qwen3 8B948±94.2K8.2%2.4%61 tps1.4s41K$0.02$0.07
173179Switchpoint Router949±102.7K3.6%1.7%71 tps4.9s131K$0.85$3.40
174230Magistral Medium952±161.3K10.8%<0.1%95 tps0.5s41K$2.00$5.00
175179Qwen3 30B A3B Thinking 2507953±103.5K4.7%0.5%124 tps1.2s131K$0.16$1.70
176167Llama 3.1 8B Turbo958±142.2K2.0%2.1%650 tps0.5s128K$0.13$0.14
177167Qwen 2.5 72B960±151.4K4.5%1.2%96 tps1.2s131K$0.14$0.26
178167Qwen3 14B962±85.3K8.4%1.7%109 tps0.8s41K$0.04$0.15
179167Devstral Medium962±113.5K5.2%1.5%77 tps0.6s131K$0.40$2.00
180167Llama 4 Scout965±517.5K5.3%0.6%88 tps5.1s131K$0.18$0.46
181167Qwen3 VL 30B A3B Thinking967±111.9K8.9%4.5%84 tps2.9s127K$0.20$1.47
182167Pixtral Large969±143.5K3.9%2.5%57 tps1.3s128K$1.50$4.50
183167Mistral Small 3.2 24B970±134.6K4.9%2.8%141 tps0.7s33K$0.02$0.08
184167Llama 4 Maverick971±521K5.0%1.2%88 tps2.4s1M$0.23$0.83
185219Arcee Coder Large971±73.6K2.6%<0.1%54 tps1.3s33K$0.50$0.80
186167Qwen 2.5 32B Instruct972±74.1K6.5%2.5%48 tps1.0s131K$0.21$0.25
187211Arcee AI Coder-Large972±159854.4%<0.1%60 tps1.6s33K$0.50$0.80
188167Nemotron 3 Nano974±465806.5%1.3%216 tps0.8s256K$0.05$4.94
189211Grok 4 (Low Reasoning)975±215202.8%<0.1%18 tps9.5s256K$0$0
190167DeepSeek V3.1 Thinking976±95.2K9.5%7.1%18 tps1.8s131K$0.23$0.75
191159DeepSeek-R1 0528979±65.5K3.5%1.3%93 tps0.5s64K$1.60$3.67
192159Mistral Small 3.1 24B Instruct980±112.9K4.3%7.5%15 tps2.4s131K$0.06$0.18
193159Seed 2.0 Mini (Medium)981±355705.8%11.9%33 tps1.7s256K$0.15$0.60
194159Kimi K2 0711981±67K4.5%1.6%29 tps1.3s131K$0.72$2.60
195195GLM 4.6 FP8982±171.2K11.7%<0.1%56 tps1.8s200K$0.40$1.75
196195Cypher Alpha985±207358.7%<0.1%4 tpsN/A1M$0$0
197159GLM 4.6V986±83K5.5%6.4%21 tps1.8s128K$0.38$0.90
198159Grok Code Fast 1987±92.5K6.0%5.9%294 tps0.5s256K$0.20$1.50
199159OpenAI o3-mini-low988±612.2K6.4%0.7%139 tps1.5s200K$1.10$4.40
200159GPT-5 Nano989±64.6K8.0%3.2%113 tps20.9s400K$0.05$0.40
View All (404 models)