Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

995
Seed 1.6 250615
994
Arcee AI Virtuoso-Large
994
Qwen3 30B A3B
992
Claude Haiku 3
989
GPT-5 Nano
988
OpenAI o3-mini-low
987
Grok Code Fast 1
986
GLM 4.6V
985
Cypher Alpha
982
GLM 4.6 FP8
981
Kimi K2 0711
981
Seed 2.0 Mini (Medium)
980
Mistral Small 3.1 24B Instruct
979
DeepSeek-R1 0528
976
DeepSeek V3.1 Thinking

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
201148Seed 1.6 250615995±211.6K6.0%3.1%46 tps2.2s256K$0.25$2.00
202195Arcee AI Virtuoso-Large994±83K5.7%<0.1%64 tps0.5s131K$0.75$1.20
203148Qwen3 30B A3B994±86.3K6.9%5.1%163 tps1.0s41K$0.06$0.21
204195Claude Haiku 3992±112.8K3.0%0.4%62 tps0.5s200K$0.25$1.25
205159GPT-5 Nano989±64.6K8.0%3.2%113 tps20.9s400K$0.05$0.40
206159OpenAI o3-mini-low988±612.2K6.4%0.7%139 tps1.5s200K$1.10$4.40
207159Grok Code Fast 1987±92.5K6.0%5.9%294 tps0.5s256K$0.20$1.50
208159GLM 4.6V986±83K5.5%6.4%21 tps1.8s128K$0.38$0.90
209195Cypher Alpha985±207358.7%<0.1%4 tpsN/A1M$0$0
210195GLM 4.6 FP8982±171.2K11.7%<0.1%56 tps1.8s200K$0.40$1.75
211159Kimi K2 0711981±67K4.5%1.6%29 tps1.3s131K$0.72$2.60
212159Seed 2.0 Mini (Medium)981±355705.8%11.9%33 tps1.7s256K$0.15$0.60
213159Mistral Small 3.1 24B Instruct980±112.9K4.3%7.5%15 tps2.4s131K$0.06$0.18
214159DeepSeek-R1 0528979±65.5K3.5%1.3%93 tps0.5s64K$1.60$3.67
215167DeepSeek V3.1 Thinking976±95.2K9.5%7.1%18 tps1.8s131K$0.23$0.75
216211Grok 4 (Low Reasoning)975±215202.8%<0.1%18 tps9.5s256K$0$0
217167Nemotron 3 Nano974±465806.5%1.3%216 tps0.8s256K$0.05$4.94
218211Arcee AI Coder-Large972±159854.4%<0.1%60 tps1.6s33K$0.50$0.80
219167Qwen 2.5 32B Instruct972±74.1K6.5%2.5%48 tps1.0s131K$0.21$0.25
220219Arcee Coder Large971±73.6K2.6%<0.1%54 tps1.3s33K$0.50$0.80
221167Llama 4 Maverick971±521K5.0%1.2%88 tps2.4s1M$0.23$0.83
222167Mistral Small 3.2 24B970±134.6K4.9%2.8%141 tps0.7s33K$0.02$0.08
223167Pixtral Large969±143.5K3.9%2.5%57 tps1.3s128K$1.50$4.50
224167Qwen3 VL 30B A3B Thinking967±111.9K8.9%4.5%84 tps2.9s127K$0.20$1.47
225167Llama 4 Scout965±517.5K5.3%0.6%88 tps5.1s131K$0.18$0.46
226167Devstral Medium962±113.5K5.2%1.5%77 tps0.6s131K$0.40$2.00
227167Qwen3 14B962±85.3K8.4%1.7%109 tps0.8s41K$0.04$0.15
228167Qwen 2.5 72B960±151.4K4.5%1.2%96 tps1.2s131K$0.14$0.26
229167Llama 3.1 8B Turbo958±142.2K2.0%2.1%650 tps0.5s128K$0.13$0.14
230179Qwen3 30B A3B Thinking 2507953±103.5K4.7%0.5%124 tps1.2s131K$0.16$1.70
231230Magistral Medium952±161.3K10.8%<0.1%95 tps0.5s41K$2.00$5.00
232179Switchpoint Router949±102.7K3.6%1.7%71 tps4.9s131K$0.85$3.40
233179Qwen3 8B948±94.2K8.2%2.4%61 tps1.4s41K$0.02$0.07
234179Ministral 14B 3.0948±168058.5%2.0%119 tps0.5s128K$0.20$0.20
235179Grok 3 Mini Fast943±79K7.0%1.6%44 tps0.5s131K$0.60$4.00
236179ERNIE 4.5 21B A3B943±285406.9%2.3%78 tps1.5s120K$0.05$0.19
237230NVIDIA Llama 3.3 Nemotron Super 49B v1942±93.6K2.3%<0.1%13 tpsN/A131K$0.07$0.20
238179ERNIE 4.5 VL 424B A47B942±187256.5%4.9%36 tps3.5s123K$0.42$1.25
239179NVIDIA Llama 3.3 Nemotron Super 49B v1.5941±191.4K6.9%2.0%50 tps0.6s131K$0.09$0.33
240179DeepSeek-R1939±66.4K4.3%0.8%133 tps0.6s64K$0.91$3.07
View All (404 models)