Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

999
K2 Think
998
Llama 3.1 8B Turbo
996
LongCat Flash Chat
992
Arcee AI Virtuoso-Large
991
Kimi K2 0905
990
Qwen3 Max Thinking
987
Qwen Turbo
984
Cogito v2.1 671B
983
DeepSeek-R1 0528
983
Gemma 3 27B IT
981
GLM 4.5 Air
980
Arcee AI Maestro Reasoning
979
Arcee AI Blitz
979
Qwen3 VL 235B A22B Thinking
978
DeepSeek R1T2 Chimera

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
161200K2 Think999±141.1K6.2%<0.1%418 tps2.8sN/A$0$0
162170Llama 3.1 8B Turbo998±121.1K2.8%2.1%650 tps0.5s128K$0.13$0.14
163111LongCat Flash Chat996±149307.0%0.8%85 tps0.9s131K$0.14$0.68
164219Arcee AI Virtuoso-Large992±101.4K15.2%<0.1%64 tps0.5s131K$0.75$1.20
165133Kimi K2 0905991±67.5K5.6%4.0%30 tps1.4s262K$0.63$2.39
166129Qwen3 Max Thinking990±131.7K2.3%13.5%32 tps2.3s256K$1.20$6.00
167159Qwen Turbo987±44.8K12.9%<0.1%53 tps1.1s1M$0.05$0.20
168157Cogito v2.1 671B984±177155.9%0.8%85 tps0.5s128K$1.25$1.25
169133DeepSeek-R1 0528983±121.3K4.6%1.3%93 tps0.5s64K$1.60$3.67
170201Gemma 3 27B IT983±1590510.4%2.0%60 tps0.8s128K$0.17$0.29
171147GLM 4.5 Air981±64.6K15.0%<0.1%22 tps1.4s131K$0.10$0.38
172147Arcee AI Maestro Reasoning980±91.8K12.0%<0.1%85 tps0.3s131K$0.90$3.30
173241Arcee AI Blitz979±136105.4%<0.1%6 tpsN/A33K$0.45$0.75
174126Qwen3 VL 235B A22B Thinking979±63.5K11.5%4.3%47 tps3.0s127K$0.47$3.31
175165DeepSeek R1T2 Chimera978±101.1K11.0%3.0%28 tps1.8s164K$0.13$0.45
176214Qwen 2.5 VL 32B Instruct977±208507.6%6.3%43 tps3.2s128K$0.35$0.62
177209Seed 1.6 Flash 250715974±169806.2%2.5%108 tps1.6s256K$0.07$0.30
178265Llama 3.1 405B Instruct Turbo973±1862510.1%<0.1%26 tps0.8s131K$3.50$3.50
179222Sky T1 32B Preview972±1680510.6%7.8%73 tps0.6s16K$0.12$0.18
180302YouTube968±123.7K3.4%<0.1%34 tps2.7s32K$0.99$0.99
181177Mistral Small 3.1 24B Instruct966±121K10.6%7.5%15 tps2.4s131K$0.06$0.18
182161Mistral Small 3.1960±1691511.2%7.4%13 tps2.6s32K$0.17$0.28
183302OLMo 2 0425 1B Instruct956±195701.7%<0.1%68 tps0.0s4K$0$0
184161Llama 4 Maverick956±411.2K8.2%1.2%88 tps2.4s1M$0.23$0.83
185121QwQ 32B955±75K15.3%5.4%41 tps2.1s16K$0.43$0.56
186253Magistral Medium955±1479518.0%<0.1%95 tps0.5s41K$2.00$5.00
187101gpt-oss-20b954±56.1K10.8%0.5%216 tps0.5s131K$0.06$0.26
188233Llama 3.1 70B Instruct Turbo951±101.9K10.3%<0.1%110 tps0.8s128K$0.88$0.88
189165Qwen3 VL 30B A3B Thinking949±81.5K11.2%4.5%84 tps2.9s127K$0.20$1.47
190213Claude Haiku 3.5949±83.4K9.7%0.8%40 tps2.8s200K$0.80$4.00
191177OpenAI o3-mini946±66.7K12.3%0.8%143 tps3.3s200K$1.10$4.40
192153Ministral 14B 3.0945±2849011.7%2.0%119 tps0.5s128K$0.20$0.20
193170Devstral Medium945±111.6K14.7%1.5%77 tps0.6s131K$0.40$2.00
194292GPT-5 Nano Minimal945±111.3K12.9%<0.1%88 tps0.8s400K$0.05$0.40
195241Claude Haiku 3944±1188010.7%0.4%62 tps0.5s200K$0.25$1.25
196194Llama 3.2 11B Instruct943±1474514.4%1.5%152 tps0.5s8K$0.16$0.16
197219EXAONE Deep 32B941±175254.5%<0.1%24 tpsN/A33K$0$0
198126Qwen3 30B A3B939±53.7K12.1%5.1%163 tps1.0s41K$0.06$0.21
199201GPT-4o mini939±91.4K9.2%2.1%71 tps1.7s128K$0.15$0.60
200148DeepSeek-R1939±121.6K5.5%0.8%133 tps0.6s64K$0.91$3.07
View All (312 models)