Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

958
OpenAI o4-mini-high
956
OpenAI o4-mini
955
DeepSeek V3.1 Thinking
952
Qwen3 Max Thinking Preview
951
OpenAI o3-pro
950
gpt-oss-20b
949
Qwen3.5 35B A3B
947
Mistral Large 3
928
Seed 1.6 250615
926
GPT-5 Mini High
926
Kimi K2 Thinking
925
Kimi K2 0905 Turbo
925
Gemini 1.5 Pro
922
Kimi K2 0905
922
Qwen3 VL 235B A22B Thinking

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
121148OpenAI o4-mini-high958±112.2K3.1%1.9%117 tps15.9s200K$1.10$4.40
122139OpenAI o4-mini956±161.4K2.8%1.4%97 tps7.0s128K$1.10$4.40
123129DeepSeek V3.1 Thinking955±141.1K2.2%7.1%18 tps1.8s131K$0.23$0.75
12479Qwen3 Max Thinking Preview952±201.1K2.2%3.1%40 tps2.1s256K$1.20$6.00
12581OpenAI o3-pro951±191.6K3.4%5.2%22 tps70.8s200K$20.00$80.00
126101gpt-oss-20b950±181.4K4.7%0.5%216 tps0.5s131K$0.06$0.26
127101Qwen3.5 35B A3B949±275302.8%2.1%116 tps2.1s256K$0.63$1.13
12865Mistral Large 3947±201.3K4.4%2.1%51 tps1.0s256K$0.50$1.50
129143Seed 1.6 250615928±216355.2%3.1%46 tps2.2s256K$0.25$2.00
130241GPT-5 Mini High926±158804.9%<0.1%33 tps3.9s400K$0.25$2.00
13195Kimi K2 Thinking926±177402.0%4.2%61 tps5.9s262K$0.24$1.03
132124Kimi K2 0905 Turbo925±131.5K4.7%0.7%373 tps0.5s262K$1.70$6.50
133211Gemini 1.5 Pro925±207504.5%<0.1%15 tps0.0s2M$0.78$3.13
134133Kimi K2 0905922±218054.2%4.0%30 tps1.4s262K$0.63$2.39
135126Qwen3 VL 235B A22B Thinking922±187454.5%4.3%47 tps3.0s127K$0.47$3.31
136133Solar Pro 2 250710918±141.4K3.4%<0.1%9 tpsN/A66K$0.50$0.50
137119ERNIE 4.5 300B A47B918±171.6K2.7%4.7%23 tps2.3s123K$0.28$1.10
138126Qwen3 30B A3B918±158653.4%5.1%163 tps1.0s41K$0.06$0.21
139143Gemini 2.0 Flash Lite917±112.5K6.7%<0.1%42 tps0.5s1M$0.08$0.30
140121QwQ 32B910±131.8K3.1%5.4%41 tps2.1s16K$0.43$0.56
14162MiniMax M2905±181.4K3.5%2.2%39 tps2.3s205K$0.21$0.85
142213Claude Haiku 3.5901±102.7K5.9%0.8%40 tps2.8s200K$0.80$4.00
143177OpenAI o3-mini901±122.5K3.1%0.8%143 tps3.3s200K$1.10$4.40
144139Seed 2.0 Mini (Medium)900±305153.7%11.9%33 tps1.7s256K$0.15$0.60
14586Qwen3 235B A22B898±237253.3%5.3%71 tps0.9s41K$0.23$0.63
146302YouTube896±162.4K5.6%<0.1%34 tps2.7s32K$0.99$0.99
147148DeepSeek-R1894±121.1K3.8%0.8%133 tps0.6s64K$0.91$3.07
148161Llama 4 Maverick883±123.6K4.4%1.2%88 tps2.4s1M$0.23$0.83
149165Qwen3 4B878±237353.9%1.9%94 tps1.5s128K$0.01$0.01
150270Solar Pro 2 250710 (Reasoning)878±255053.8%<0.1%9 tpsN/A66K$0.50$0.50
151160Llama 4 Scout875±152.3K2.9%0.6%88 tps5.1s131K$0.18$0.46
152179GLM 4.7 Flash874±248552.8%5.8%61 tps2.8s128K$0.07$0.39
153246DeepSeek-R1 Distill Llama 70B869±285904.8%3.6%27 tps1.6s32K$0.73$0.95
154214OpenAI o3-mini-high868±131.4K3.8%2.4%231 tps10.5s200K$1.10$4.40
155129Qwen3 Max Thinking866±141.5K1.7%13.5%32 tps2.3s256K$1.20$6.00
156139GLM 4.6V865±248902.7%6.4%21 tps1.8s128K$0.38$0.90
157170Kimi K2 0711858±248904.3%1.6%29 tps1.3s131K$0.72$2.60
158133Qwen3 14B856±247452.6%1.7%109 tps0.8s41K$0.04$0.15
159121Qwen3 32B Fast853±131.8K4.2%11.6%30 tps3.1s41K$0.10$0.25
160157Qwen3 Next 80B A3B Thinking846±151.3K3.9%0.6%175 tps1.3s256K$0.21$2.26
View All (188 models)