Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

956
GLM 4.7 Flash
956
NVIDIA Llama 3.1 Nemotron Ultra 253B v1
954
Rnj-1 Instruct
954
Devstral Small
954
Exaone 3.5 32B Instruct
954
Arcee AI Virtuoso-Medium
953
Arcee AI Spotlight
952
Qwen3.5 9B FP8
952
Solar Pro 2 250710 (Reasoning)
950
GPT-3.5 Turbo
948
Weather
948
Grok 3 Mini
947
Cypher Alpha
947
Krutrim 2
946
Open Mistral Nemo

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
281179GLM 4.7 Flash956±84.8K1.8%5.8%61 tps2.8s128K$0.07$0.39
282292NVIDIA Llama 3.1 Nemotron Ultra 253B v1956±39.4K1.6%<0.1%40 tps0.8s128K$0.30$0.90
283222Rnj-1 Instruct954±63K3.7%0.6%103 tps0.3s33K$0.15$0.15
284201Devstral Small954±65.4K2.4%2.4%180 tps0.6s131K$0.10$0.30
285292Exaone 3.5 32B Instruct954±53.1K1.3%<0.1%17 tpsN/A33K$0$0
286270Arcee AI Virtuoso-Medium954±210.8K0.9%<0.1%3 tpsN/A131K$0.50$0.80
287292Arcee AI Spotlight953±219.7K2.1%<0.1%121 tps0.4s131K$0.18$0.18
288209Qwen3.5 9B FP8952±194953.9%5.8%64 tps0.7s256K$0.10$0.15
289270Solar Pro 2 250710 (Reasoning)952±54K2.5%<0.1%9 tpsN/A66K$0.50$0.50
290209GPT-3.5 Turbo950±26K1.0%1.3%74 tps0.9s16K$0.75$1.75
291314Weather948±45.3K2.0%<0.1%36 tps1.1s32K$0$0
292186Grok 3 Mini948±228.5K3.9%1.2%43 tps0.5s131K$0.30$0.50
293277Cypher Alpha947±33.5K2.5%<0.1%4 tpsN/A1M$0$0
294214Krutrim 2947±211.8K0.6%12.5%33 tps2.1s128K$1.00$1.00
295225Open Mistral Nemo946±37.3K2.1%1.5%171 tps0.5s131K$0.15$0.15
296302OLMo 2 0425 1B Instruct946±63K1.3%<0.1%68 tps0.0s4K$0$0
297235Hermes 2 Pro Llama 3 8B945±28.8K1.0%<0.1%76 tps1.0s131K$0.08$0.09
298222Jamba 1.5 Large944±313.6K1.6%1.7%48 tps0.9s256K$1.50$6.00
299209Qwen 2.5 14B Instruct944±39K2.3%2.4%40 tps1.6s1M$0.40$1.61
300277Dobby Unhinged Llama 3.3 70B943±54.4K0.8%<0.1%41 tps0.4s128K$0.90$0.90
301292GPT-5 Nano Minimal943±43.2K8.1%<0.1%88 tps0.8s400K$0.05$0.40
302253R1 1776943±47.1K3.5%<0.1%61 tps1.0s128K$2.00$8.00
303277Grok 2943±39.7K1.2%<0.1%55 tps1.1s131K$2.00$10.00
304302Yi Large941±38.8K0.3%<0.1%34 tpsN/A33K$1.50$1.50
305225Command R 7B940±314K1.9%1.1%76 tps0.4s128K$0.04$0.15
306302YouTube939±58.1K3.1%<0.1%34 tps2.7s32K$0.99$0.99
307209Llama 3.3 Swallow 70B Instruct938±310.7K3.2%1.4%153 tps1.3s131K$0.13$0.39
308324Qwen 2 72B Instruct938±34.8K1.4%<0.1%3 tpsN/A33K$0.90$0.90
309201Gemma 3 27B IT938±39.7K1.7%2.0%60 tps0.8s128K$0.17$0.29
310361Venice Uncensored934±111.4K4.9%<0.1%59 tps3.9s33K$0$0
311214Qwen 2.5 7B934±47.5K2.3%3.7%40 tps1.9s131K$0.08$0.27
312186GLM 4.6V Flash933±47.9K3.7%3.7%64 tps2.1s128K$0.04$0.40
313201GPT-4o mini932±49K3.4%2.1%71 tps1.7s128K$0.15$0.60
314214C4AI Aya Expanse 32B930±217.9K1.6%1.5%43 tps0.5s128K$0.50$1.50
315229ERNIE 4.5 21B A3B Thinking930±62.7K3.6%1.8%87 tps1.5s120K$0.07$0.28
316277Claude Sonnet 3930±34.4K1.1%<0.1%35 tps1.0s200K$3.00$15.00
317246Ministral 3B929±38.6K2.2%0.8%248 tps0.4s131K$0.08$0.08
318229Moonshot V1 Auto928±64K1.6%1.2%54 tps1.5s8K$2.00$5.00
319240Mistral Nemo928±34.2K1.2%<0.1%112 tps0.4s131K$0.07$0.13
320240Llama 3.3 70B Instruct927±119002.7%5.3%28 tps1.3s128K$0.38$0.55
View All (432 models)