Models
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1020
OpenAI o3
1018
Gemini 2.0 Flash
1016
GLM 4.5 Air
1009
Qwen3 VL 235B A22B Thinking
1007
Qwen3 Coder Plus
1002
GPT-5 Mini High
1001
Qwen 2.5 VL 32B Instruct
1000
Qwen3 235B A22B Thinking 2507
999
OpenAI o3-mini-high
999
OpenAI o3-mini
995
OpenAI o4-mini-high
995
Seed 1.6 250615
994
Arcee AI Virtuoso-Large
992
Claude Haiku 3
989
GPT-5 Nano

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
161144OpenAI o31020±75.9K4.0%0.9%85 tps6.8s128K$7.33$29.33
162144Gemini 2.0 Flash1018±78.2K3.8%<0.1%76 tps0.5s1M$0.14$0.56
163189GLM 4.5 Air1016±67.1K6.9%<0.1%22 tps1.4s131K$0.10$0.38
164148Qwen3 VL 235B A22B Thinking1009±64.6K8.3%4.3%47 tps3.0s127K$0.47$3.31
165148Qwen3 Coder Plus1007±226104.7%5.1%56 tps2.3s128K$1.80$9.80
166195GPT-5 Mini High1002±93K7.7%<0.1%33 tps3.9s400K$0.25$2.00
167148Qwen 2.5 VL 32B Instruct1001±218654.9%6.3%43 tps3.2s128K$0.35$0.62
168148Qwen3 235B A22B Thinking 25071000±102.8K4.4%2.5%53 tps1.6s131K$0.59$5.70
169148OpenAI o3-mini-high999±58.3K4.1%2.4%231 tps10.5s200K$1.10$4.40
170148OpenAI o3-mini999±415K5.5%0.8%143 tps3.3s200K$1.10$4.40
171148OpenAI o4-mini-high995±713.6K6.2%1.9%117 tps15.9s200K$1.10$4.40
172148Seed 1.6 250615995±211.6K6.0%3.1%46 tps2.2s256K$0.25$2.00
173195Arcee AI Virtuoso-Large994±83K5.7%<0.1%64 tps0.5s131K$0.75$1.20
174195Claude Haiku 3992±112.8K3.0%0.4%62 tps0.5s200K$0.25$1.25
175159GPT-5 Nano989±64.6K8.0%3.2%113 tps20.9s400K$0.05$0.40
176159OpenAI o3-mini-low988±612.2K6.4%0.7%139 tps1.5s200K$1.10$4.40
177159Grok Code Fast 1987±92.5K6.0%5.9%294 tps0.5s256K$0.20$1.50
178159GLM 4.6V986±83K5.5%6.4%21 tps1.8s128K$0.38$0.90
179195Cypher Alpha985±207358.7%<0.1%4 tpsN/A1M$0$0
180195GLM 4.6 FP8982±171.2K11.7%<0.1%56 tps1.8s200K$0.40$1.75
181159Kimi K2 0711981±67K4.5%1.6%29 tps1.3s131K$0.72$2.60
182159Seed 2.0 Mini (Medium)981±355705.8%11.9%33 tps1.7s256K$0.15$0.60
183167DeepSeek V3.1 Thinking976±95.2K9.5%7.1%18 tps1.8s131K$0.23$0.75
184211Grok 4 (Low Reasoning)975±215202.8%<0.1%18 tps9.5s256K$0$0
185167Nemotron 3 Nano974±465806.5%1.3%216 tps0.8s256K$0.05$4.94
186211Arcee AI Coder-Large972±159854.4%<0.1%60 tps1.6s33K$0.50$0.80
187219Arcee Coder Large971±73.6K2.6%<0.1%54 tps1.3s33K$0.50$0.80
188167Mistral Small 3.2 24B970±134.6K4.9%2.8%141 tps0.7s33K$0.02$0.08
189167Llama 4 Scout965±517.5K5.3%0.6%88 tps5.1s131K$0.18$0.46
190167Devstral Medium962±113.5K5.2%1.5%77 tps0.6s131K$0.40$2.00
191167Qwen 2.5 72B960±151.4K4.5%1.2%96 tps1.2s131K$0.14$0.26
192179Qwen3 30B A3B Thinking 2507953±103.5K4.7%0.5%124 tps1.2s131K$0.16$1.70
193230Magistral Medium952±161.3K10.8%<0.1%95 tps0.5s41K$2.00$5.00
194179Switchpoint Router949±102.7K3.6%1.7%71 tps4.9s131K$0.85$3.40
195179Qwen3 8B948±94.2K8.2%2.4%61 tps1.4s41K$0.02$0.07
196179Ministral 14B 3.0948±168058.5%2.0%119 tps0.5s128K$0.20$0.20
197179Grok 3 Mini Fast943±79K7.0%1.6%44 tps0.5s131K$0.60$4.00
198179ERNIE 4.5 21B A3B943±285406.9%2.3%78 tps1.5s120K$0.05$0.19
199179ERNIE 4.5 VL 424B A47B942±187256.5%4.9%36 tps3.5s123K$0.42$1.25
200179NVIDIA Llama 3.3 Nemotron Super 49B v1.5941±191.4K6.9%2.0%50 tps0.6s131K$0.09$0.33
View All (305 models)