Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

938
Nemotron 3 Nano (Thinking)
938
GLM 4.6V
937
OpenAI o4-mini-high
937
OpenAI o3-mini-low
935
Llama 3 70B Turbo
933
Qwen3 14B
932
Arcee AI Virtuoso-Medium
932
Qwen3 32B Fast
929
Qwen 2.5 VL 72B Instruct
928
Qwen 2.5 14B Instruct
926
Grok 3 Mini Beta
924
DeepSeek V3.2 Speciale
924
Devstral Small
922
Gemini 1.5 Pro
920
Magistral Small 2506

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
20186Nemotron 3 Nano (Thinking)938±141.3K7.6%2.0%200 tps0.5s256K$0$0
202139GLM 4.6V938±112.5K6.1%6.4%21 tps1.8s128K$0.38$0.90
203148OpenAI o4-mini-high937±66K14.9%1.9%117 tps15.9s200K$1.10$4.40
204175OpenAI o3-mini-low937±46.1K13.6%0.7%139 tps1.5s200K$1.10$4.40
205177Llama 3 70B Turbo935±91.9K2.1%<0.1%31 tps0.0s8K$0.73$0.83
206133Qwen3 14B933±112.7K17.1%1.7%109 tps0.8s41K$0.04$0.15
207270Arcee AI Virtuoso-Medium932±154906.7%<0.1%3 tpsN/A131K$0.50$0.80
208121Qwen3 32B Fast932±54.5K12.9%11.6%30 tps3.1s41K$0.10$0.25
209265Qwen 2.5 VL 72B Instruct929±121.2K7.9%5.3%25 tps3.7s128K$1.01$2.79
210209Qwen 2.5 14B Instruct928±1391011.7%2.4%40 tps1.6s1M$0.40$1.61
211219Grok 3 Mini Beta926±111.1K1.3%<0.1%75 tps0.5s131K$0.45$2.25
212133DeepSeek V3.2 Speciale924±121.6K6.3%6.0%43 tps1.4s131K$0.84$1.52
213201Devstral Small924±1657012.3%2.4%180 tps0.6s131K$0.10$0.30
214211Gemini 1.5 Pro922±81.8K3.0%<0.1%15 tps0.0s2M$0.78$3.13
215194Magistral Small 2506920±152K6.9%1.6%156 tps0.5s40K$0.37$1.10
216179Inception Mercury919±102.8K11.5%0.4%257 tps1.1s32K$0.25$1.00
217148Qwen3 30B A3B Thinking 2507919±101.3K3.6%0.5%124 tps1.2s131K$0.16$1.70
218229Magistral Medium 2509918±82.1K11.3%4.0%58 tps0.9s131K$2.00$5.00
219153Qwen 2.5 32B Instruct916±91.9K18.0%2.5%48 tps1.0s131K$0.21$0.25
220179Amazon Nova Pro 1.0916±162.1K10.3%0.9%96 tps0.7s300K$0.80$1.70
221186Mistral Small 3.2 24B Instruct915±225259.5%1.9%113 tps1.1s131K$0.02$0.08
222214Llama 3.3 70B Instruct Turbo914±2464011.7%2.0%78 tps1.0s131K$0.88$0.88
223160Llama 4 Scout911±68K9.6%0.6%88 tps5.1s131K$0.18$0.46
224179Baichuan-M2-32B911±2550513.7%<0.1%32 tps3.3s131K$0.07$0.07
225170Kimi K2 0711911±83.2K9.2%1.6%29 tps1.3s131K$0.72$2.60
226170Mistral Small 3.2 24B911±102K12.4%2.8%141 tps0.7s33K$0.02$0.08
227182Fauna Fox909±112.5K10.1%<0.1%194 tps0.3s128K$0.04$0.15
228253R1 1776908±1787014.3%<0.1%61 tps1.0s128K$2.00$8.00
229214OpenAI o3-mini-high908±141.1K6.5%2.4%231 tps10.5s200K$1.10$4.40
230246DeepSeek-R1 Distill Llama 70B907±235357.0%3.6%27 tps1.6s32K$0.73$0.95
231186Gemma 3n E4B905±102.6K8.4%2.0%30 tps0.5s8K$0.01$0.02
232201ERNIE 4.5 VL 424B A47B905±128057.5%4.9%36 tps3.5s123K$0.42$1.25
233292NVIDIA Llama 3.1 Nemotron Ultra 253B v1905±1478511.3%<0.1%40 tps0.8s128K$0.30$0.90
234209Llama 3.3 Swallow 70B Instruct904±81.6K15.2%1.4%153 tps1.3s131K$0.13$0.39
235186Grok 3 Mini903±56K12.8%1.2%43 tps0.5s131K$0.30$0.50
236157Qwen3 Next 80B A3B Thinking903±74.9K11.2%0.6%175 tps1.3s256K$0.21$2.26
237161Qwen3 8B902±122K17.8%2.4%61 tps1.4s41K$0.02$0.07
238186Gemma 3 27B902±1861513.4%1.8%35 tps1.1s66K$0.06$0.10
239186Grok 3 Mini Fast897±75.2K14.9%1.6%44 tps0.5s131K$0.60$4.00
240277Dobby Unhinged Llama 3.3 70B896±227751.9%<0.1%41 tps0.4s128K$0.90$0.90
View All (312 models)