Models
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

992
Claude Haiku 3
994
Qwen3 30B A3B
994
Arcee AI Virtuoso-Large
995
Seed 1.6 250615
995
OpenAI o4-mini-high
999
OpenAI o3-mini
999
OpenAI o3-mini-high
1000
Qwen3 235B A22B Thinking 2507
1001
Qwen 2.5 VL 32B Instruct
1002
GPT-5 Mini High
1003
DeepSeek-R1 Turbo
1005
K2 Think
1007
Qwen3 Coder Plus
1009
Qwen3 VL 235B A22B Thinking
1012
Nemotron 3 Nano (Thinking)

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
201195Claude Haiku 3992±112.8K3.0%0.4%62 tps0.5s200K$0.25$1.25
202148Qwen3 30B A3B994±86.3K6.9%5.1%163 tps1.0s41K$0.06$0.21
203195Arcee AI Virtuoso-Large994±83K5.7%<0.1%64 tps0.5s131K$0.75$1.20
204148Seed 1.6 250615995±211.6K6.0%3.1%46 tps2.2s256K$0.25$2.00
205148OpenAI o4-mini-high995±713.6K6.2%1.9%117 tps15.9s200K$1.10$4.40
206148OpenAI o3-mini999±415K5.5%0.8%143 tps3.3s200K$1.10$4.40
207148OpenAI o3-mini-high999±58.3K4.1%2.4%231 tps10.5s200K$1.10$4.40
208148Qwen3 235B A22B Thinking 25071000±102.8K4.4%2.5%53 tps1.6s131K$0.59$5.70
209148Qwen 2.5 VL 32B Instruct1001±218654.9%6.3%43 tps3.2s128K$0.35$0.62
210195GPT-5 Mini High1002±93K7.7%<0.1%33 tps3.9s400K$0.25$2.00
211148DeepSeek-R1 Turbo1003±92.5K5.6%2.6%29 tps1.8s64K$2.85$4.75
212189K2 Think1005±161.4K5.6%<0.1%418 tps2.8sN/A$0$0
213148Qwen3 Coder Plus1007±226104.7%5.1%56 tps2.3s128K$1.80$9.80
214148Qwen3 VL 235B A22B Thinking1009±64.6K8.3%4.3%47 tps3.0s127K$0.47$3.31
215148Nemotron 3 Nano (Thinking)1012±132K6.7%2.0%200 tps0.5s256K$0$0
216189GLM 4.5 Air1016±67.1K6.9%<0.1%22 tps1.4s131K$0.10$0.38
217144Gemini 2.0 Flash1018±78.2K3.8%<0.1%76 tps0.5s1M$0.14$0.56
218144OpenAI o31020±75.9K4.0%0.9%85 tps6.8s128K$7.33$29.33
219144DeepSeek V3.1 Nex N11021±195655.0%3.4%24 tps7.2s131K$0.14$0.50
220144Command A1024±422.4K4.8%2.2%42 tps0.8s256K$2.00$7.33
221135Amazon Nova 2 Lite1026±103.6K6.0%1.0%137 tps0.6s300K$0.35$2.95
222174Claude Haiku 3.51028±66.4K4.9%0.8%40 tps2.8s200K$0.80$4.00
223135Gemini 2.0 Flash Lite1029±514.7K9.5%<0.1%42 tps0.5s1M$0.08$0.30
224135DeepSeek V3.2 Speciale1030±102.3K6.1%6.0%43 tps1.4s131K$0.84$1.52
225135DeepSeek V31032±517.6K3.7%0.9%69 tps1.1s64K$0.59$1.49
226135Qwen3 VL 30B A3B Instruct1034±151K6.7%1.8%80 tps2.6s129K$0.18$0.67
227135Gemini 3.1 Flash Lite Preview1034±219804.4%1.0%8 tps1.2s1M$0.25$1.50
228135Gemini 2.5 Flash Lite Thinking Preview 09251035±75.8K6.8%1.5%152 tps3.0s1M$0.10$0.40
229135Qwen3 Next 80B A3B Thinking1035±56.2K7.4%0.6%175 tps1.3s256K$0.21$2.26
230174Qwen 2.5 72B Turbo1035±226705.0%<0.1%84 tps0.8s131K$0.60$0.60
231135QwQ 32B1035±411.6K6.4%5.4%41 tps2.1s16K$0.43$0.56
232164Llama 3 70B Turbo1037±64.3K1.0%<0.1%31 tps0.0s8K$0.73$0.83
233128Gemini 3.1 Flash Lite Preview Thinking1039±161.4K4.2%1.7%75 tps4.7s1M$0.25$1.50
234164EXAONE Deep 32B1040±148801.7%<0.1%24 tpsN/A33K$0$0
235128OpenAI o4-mini1042±58.5K6.4%1.4%97 tps7.0s128K$1.10$4.40
236128Kimi K2 Thinking1042±103.3K5.1%4.2%61 tps5.9s262K$0.24$1.03
237128GLM 4.5 AirX1042±151.1K6.9%3.3%75 tps1.2s131K$1.10$4.50
238164Grok 4 0709 EU1043±111.3K5.7%<0.1%33 tps8.2s128K$3.00$15.00
239128Qwen3 32B1044±198506.6%3.9%30 tps3.1s41K$0.12$0.42
240128Cogito v2.1 671B1044±191.2K4.6%0.8%85 tps0.5s128K$1.25$1.25
View All (404 models)