Models
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1127
Qwen3 Max Thinking Preview
1125
Grok 4
1123
GPT-4.1
1122
Gemini 2.5 Flash Lite Preview 0925
1118
Gemini 2.5 Flash Thinking
1114
GPT-5 Mini Minimal
1113
GPT-5.2 Codex (Low)
1110
DeepSeek V3.1 Chat
1110
Qwen3 Omni 30B A3B Thinking
1107
DeepSeek V3.2 Exp Chat
1107
Qwen Max
1103
Gemini 2.5 Flash Lite
1102
Grok 3 Fast
1102
GPT-4o
1102
Step 3.5 Flash

Last updated about 1 month ago

RankNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
81Qwen3 Max Thinking Preview1127±106.3K5.7%3.1%40 tps2.1s256K$1.20$6.00
82Grok 41125±339.6K4.4%3.9%29 tps11.1s256K$3.00$15.00
83GPT-4.11123±532.8K5.2%3.7%112 tps1.3s1M$2.00$8.00
84Gemini 2.5 Flash Lite Preview 09251122±78.5K6.6%1.2%209 tps0.7s1M$0.25$0.35
85Gemini 2.5 Flash Thinking1118±413.7K3.6%2.2%88 tps6.4s1M$0.30$2.50
86GPT-5 Mini Minimal1114±123.2K8.5%1.2%63 tps1.4s400K$0.25$2.00
87GPT-5.2 Codex (Low)1113±191.2K3.2%4.5%41 tps5.0s400K$1.75$14.00
88DeepSeek V3.1 Chat1110±74.9K6.6%2.8%21 tps1.6s131K$0.38$1.00
89Qwen3 Omni 30B A3B Thinking1110±102.3K6.0%3.7%67 tps1.2s66K$0.97$1.79
90DeepSeek V3.2 Exp Chat1107±45.5K6.1%2.6%29 tps1.5s131K$0.27$0.39
91Qwen Max1107±418.3K4.2%1.5%49 tps1.5s33K$1.60$6.40
92Gemini 2.5 Flash Lite1103±521.3K6.2%1.3%210 tps0.7s1M$0.10$0.40
93Grok 3 Fast1102±142.5K4.7%1.7%52 tps2.4s131K$5.00$25.00
94GPT-4o1102±58.5K3.7%1.0%49 tps2.4s128K$3.71$12.57
95Step 3.5 Flash1102±248103.6%2.2%109 tps0.6s256K$0.05$0.15
96DeepSeek V3 03241100±415.1K4.3%5.8%12 tps2.7s164K$0.38$0.93
97Qwen3 Coder 480B A35B Instruct1099±83.1K4.5%3.3%61 tps2.0s262K$0.71$1.34
98Gemini 2.5 Flash1098±435.9K3.2%1.3%2 tps3.7s1M$0.30$2.50
99Grok 31098±419.1K5.5%1.5%53 tps0.6s1M$3.67$18.33
100DeepSeek V3 0324 Turbo1093±515.5K5.7%6.3%12 tps2.4s164K$0.73$1.79
101Qwen3 235B A22B1093±64.5K8.0%5.3%71 tps0.9s41K$0.23$0.63
102OpenAI o3-pro1090±85.4K4.3%5.2%22 tps70.8s200K$20.00$80.00
103DeepSeek V3.11089±122.3K4.7%0.8%197 tps0.4s164K$0.55$1.60
104DeepSeek V3.2 Exp Thinking1089±75.9K3.5%7.2%26 tps3.0s131K$0.28$0.42
105GPT-4.1 mini1087±519.7K4.2%1.1%67 tps0.9s1M$0.34$1.60
106GPT-4.1 nano1085±517K5.0%0.6%175 tps0.5s1M$0.10$0.40
107Qwen3 Omni 30B A3B Instruct1085±137754.3%3.9%65 tps1.2s66K$0.35$0.97
108DeepSeek V3 (Turbo)1082±201.5K5.1%1.5%32 tps1.5s64K$0.40$1.30
109Seed 1.8 2512281081±103.2K3.1%3.7%41 tps2.1s256K$0.25$2.00
110Mistral Medium1080±49.6K5.6%1.8%48 tps0.6s33K$1.48$4.55
111Qwen3 Max Thinking1080±181.5K2.0%13.5%32 tps2.3s256K$1.20$6.00
112GLM 4.51075±56K7.0%3.7%46 tps1.4s131K$0.43$1.63
113Kimi K2 09051074±78.7K4.3%4.0%30 tps1.4s262K$0.63$2.39
114Kimi K2 Fast1073±535K6.4%0.8%365 tps0.5s131K$1.00$3.00
115GPT-5 (Low)1070±146903.5%1.8%75 tps8.2s400K$1.25$10.00
116Kimi K2 0905 Turbo1070±67.5K9.1%0.7%373 tps0.5s262K$1.70$6.50
117gpt-oss-20b1066±67.7K7.1%0.5%216 tps0.5s131K$0.06$0.26
118Grok 4.20 Beta Non-reasoning1063±365004.8%1.1%151 tps0.6s2M$2.00$6.00
119OpenAI o11062±69.9K3.3%4.2%92 tps5.5s200K$15.00$60.00
120DeepSeek V3.1 Terminus Thinking1061±92.9K9.4%5.9%27 tps1.8s131K$0.56$1.68
View All (286 models)