Models
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1054
GPT-5.1 Codex Mini (High)
1057
GPT-5.1 Codex Mini (Medium)
1058
LongCat Flash Chat
1058
Seed 2.0 Lite (Medium)
1061
Gemini 2.5 Flash Lite Thinking
1061
OpenAI o1-pro
1061
DeepSeek V3.1 Terminus Thinking
1062
OpenAI o1
1063
Grok 4.20 Beta Non-reasoning
1066
gpt-oss-20b
1070
Kimi K2 0905 Turbo
1070
GPT-5 (Low)
1073
Kimi K2 Fast
1074
Kimi K2 0905
1075
GLM 4.5

Last updated about 1 month ago

RankNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
161GPT-5.1 Codex Mini (High)1054±152.2K3.9%5.9%70 tps4.6s400K$0.25$2.00
162GPT-5.1 Codex Mini (Medium)1057±151.9K4.9%4.6%69 tps4.1s400K$0.25$2.00
163LongCat Flash Chat1058±122.7K5.9%0.8%85 tps0.9s131K$0.14$0.68
164Seed 2.0 Lite (Medium)1058±205253.7%6.6%33 tps1.6s256K$0.25$2.00
165Gemini 2.5 Flash Lite Thinking1061±49.8K6.2%1.0%118 tps4.4s1M$0.03$0.13
166OpenAI o1-pro1061±206807.5%5.2%33 tps72.8s200K$150.00$600.00
167DeepSeek V3.1 Terminus Thinking1061±92.9K9.4%5.9%27 tps1.8s131K$0.56$1.68
168OpenAI o11062±69.9K3.3%4.2%92 tps5.5s200K$15.00$60.00
169Grok 4.20 Beta Non-reasoning1063±365004.8%1.1%151 tps0.6s2M$2.00$6.00
170gpt-oss-20b1066±67.7K7.1%0.5%216 tps0.5s131K$0.06$0.26
171Kimi K2 0905 Turbo1070±67.5K9.1%0.7%373 tps0.5s262K$1.70$6.50
172GPT-5 (Low)1070±146903.5%1.8%75 tps8.2s400K$1.25$10.00
173Kimi K2 Fast1073±535K6.4%0.8%365 tps0.5s131K$1.00$3.00
174Kimi K2 09051074±78.7K4.3%4.0%30 tps1.4s262K$0.63$2.39
175GLM 4.51075±56K7.0%3.7%46 tps1.4s131K$0.43$1.63
176Qwen3 Max Thinking1080±181.5K2.0%13.5%32 tps2.3s256K$1.20$6.00
177Mistral Medium1080±49.6K5.6%1.8%48 tps0.6s33K$1.48$4.55
178Seed 1.8 2512281081±103.2K3.1%3.7%41 tps2.1s256K$0.25$2.00
179DeepSeek V3 (Turbo)1082±201.5K5.1%1.5%32 tps1.5s64K$0.40$1.30
180Qwen3 Omni 30B A3B Instruct1085±137754.3%3.9%65 tps1.2s66K$0.35$0.97
181GPT-4.1 nano1085±517K5.0%0.6%175 tps0.5s1M$0.10$0.40
182GPT-4.1 mini1087±519.7K4.2%1.1%67 tps0.9s1M$0.34$1.60
183DeepSeek V3.2 Exp Thinking1089±75.9K3.5%7.2%26 tps3.0s131K$0.28$0.42
184DeepSeek V3.11089±122.3K4.7%0.8%197 tps0.4s164K$0.55$1.60
185OpenAI o3-pro1090±85.4K4.3%5.2%22 tps70.8s200K$20.00$80.00
186Qwen3 235B A22B1093±64.5K8.0%5.3%71 tps0.9s41K$0.23$0.63
187DeepSeek V3 0324 Turbo1093±515.5K5.7%6.3%12 tps2.4s164K$0.73$1.79
188Grok 31098±419.1K5.5%1.5%53 tps0.6s1M$3.67$18.33
189Gemini 2.5 Flash1098±435.9K3.2%1.3%2 tps3.7s1M$0.30$2.50
190Qwen3 Coder 480B A35B Instruct1099±83.1K4.5%3.3%61 tps2.0s262K$0.71$1.34
191DeepSeek V3 03241100±415.1K4.3%5.8%12 tps2.7s164K$0.38$0.93
192Step 3.5 Flash1102±248103.6%2.2%109 tps0.6s256K$0.05$0.15
193GPT-4o1102±58.5K3.7%1.0%49 tps2.4s128K$3.71$12.57
194Grok 3 Fast1102±142.5K4.7%1.7%52 tps2.4s131K$5.00$25.00
195Gemini 2.5 Flash Lite1103±521.3K6.2%1.3%210 tps0.7s1M$0.10$0.40
196Qwen Max1107±418.3K4.2%1.5%49 tps1.5s33K$1.60$6.40
197DeepSeek V3.2 Exp Chat1107±45.5K6.1%2.6%29 tps1.5s131K$0.27$0.39
198Qwen3 Omni 30B A3B Thinking1110±102.3K6.0%3.7%67 tps1.2s66K$0.97$1.79
199DeepSeek V3.1 Chat1110±74.9K6.6%2.8%21 tps1.6s131K$0.38$1.00
200GPT-5.2 Codex (Low)1113±191.2K3.2%4.5%41 tps5.0s400K$1.75$14.00
View All (286 models)