Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

931
C4AI Aya Expanse 32B
930
NVIDIA Llama 3.1 Nemotron Ultra 253B v1
929
GPT-3.5 Turbo 16k
928
Command R 7B
928
Llama 3.3 Swallow 70B Instruct
928
GPT-3.5 Turbo
927
Grok 3 Mini Fast
927
Gemma 3 27B IT
927
Cogito V2 Preview Llama 405B
925
OpenAI o3-mini-low
924
Qwen 2.5 7B
924
Exaone 3.5 32B Instruct
924
Jamba 1.5 Large
922
Moonshot V1 128k
921
Seed 1.6 Flash 250715

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
281214C4AI Aya Expanse 32B931±317.1K0.8%1.5%43 tps0.5s128K$0.50$1.50
282292NVIDIA Llama 3.1 Nemotron Ultra 253B v1930±58.5K0.9%<0.1%40 tps0.8s128K$0.30$0.90
283225GPT-3.5 Turbo 16k929±39.2K0.6%<0.1%22 tps0.6s16K$3.00$4.00
284225Command R 7B928±312.7K1.1%1.1%76 tps0.4s128K$0.04$0.15
285209Llama 3.3 Swallow 70B Instruct928±39.7K1.1%1.4%153 tps1.3s131K$0.13$0.39
286209GPT-3.5 Turbo928±45.8K0.6%1.3%74 tps0.9s16K$0.75$1.75
287186Grok 3 Mini Fast927±221.1K1.5%1.6%44 tps0.5s131K$0.60$4.00
288201Gemma 3 27B IT927±38.9K0.9%2.0%60 tps0.8s128K$0.17$0.29
289314Cogito V2 Preview Llama 405B927±88002.4%<0.1%23 tps2.1s33K$1.17$1.17
290175OpenAI o3-mini-low925±318K2.6%0.7%139 tps1.5s200K$1.10$4.40
291214Qwen 2.5 7B924±46.9K1.4%3.7%40 tps1.9s131K$0.08$0.27
292292Exaone 3.5 32B Instruct924±42.8K1.7%<0.1%17 tpsN/A33K$0$0
293222Jamba 1.5 Large924±413K1.0%1.7%48 tps0.9s256K$1.50$6.00
294214Moonshot V1 128k922±54.4K0.9%1.4%54 tps1.5s131K$2.00$5.00
295209Seed 1.6 Flash 250715921±52.5K1.9%2.5%108 tps1.6s256K$0.07$0.30
296177OpenAI o3-mini921±319.4K2.3%0.8%143 tps3.3s200K$1.10$4.40
297324Solar Pro 3921±101.7K2.6%2.0%99 tps1.3s131K$0.15$0.60
298214Gemma 3 12B920±49K1.3%4.2%73 tps0.8s131K$0.05$0.12
299277Claude Sonnet 3920±43.9K0.8%<0.1%35 tps1.0s200K$3.00$15.00
300214Llama 3.3 70B Instruct Turbo919±83.7K1.5%2.0%78 tps1.0s131K$0.88$0.88
301277Wikipedia917±267K1.7%<0.1%47 tps2.1s32K$0$0
302302Yi Large917±38.6K0.3%<0.1%34 tpsN/A33K$1.50$1.50
303201Devstral Small917±45.1K1.3%2.4%180 tps0.6s131K$0.10$0.30
304233Cogito V2 Preview Llama 70B915±149403.1%<0.1%44 tps1.6s33K$0.44$0.44
305324Qwen 2 72B Instruct914±34.7K0.7%<0.1%3 tpsN/A33K$0.90$0.90
306277Jamba 1.7 Mini913±82.5K2.6%<0.1%84 tps0.9s256K$0.20$0.40
307292AFM 4.5B913±39.1K2.6%<0.1%81 tps0.3s66K$0.05$0.20
308225Command R913±39.6K1.5%5.8%54 tps0.6s128K$0.30$0.99
309324Typhoon 2 70B Instruct911±48.1K0.9%<0.1%19 tps0.1s8K$0.88$0.88
310225Open Mistral Nemo910±56.7K1.1%1.5%171 tps0.5s131K$0.15$0.15
311240Mistral Nemo910±53.8K0.5%<0.1%112 tps0.4s131K$0.07$0.13
312235Gemma 3 4B909±411.3K1.0%1.3%138 tps0.7s131K$0.02$0.04
313194INTELLECT-3909±145702.6%1.5%114 tps0.6s131K$0.20$1.10
314235Hermes 2 Pro Llama 3 8B908±38.3K0.7%<0.1%76 tps1.0s131K$0.08$0.09
315339Refuel LLM 2 Small907±317.9K1.0%<0.1%116 tps0.5s8K$0.20$0.20
316222Sky T1 32B Preview905±410.5K1.1%7.8%73 tps0.6s16K$0.12$0.18
317229Ministral 8B905±46.8K1.4%1.4%177 tps0.4s128K$0.14$0.14
318235Command R+904±56.3K1.3%2.8%36 tps0.7s128K$2.08$9.45
319240GPT-3.5 Turbo Instruct903±37K0.6%<0.1%46 tps1.2s4K$1.50$2.00
320331Marin 8B Instruct902±99352.6%<0.1%170 tps0.2s131K$0.18$0.18
View All (410 models)