Models
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1226
Grok 4.1 Fast Non-Reasoning
1226
Qwen3 Max Instruct Preview
1224
GPT-5 Codex (Medium)
1220
Qwen3 30B A3B Instruct 2507
1218
GPT-5.2 (Extra High)
1211
GPT-5.1 Instant
1209
GPT-5
1204
MiniMax M2.1
1202
GPT-5.1 Codex (Medium)
1197
Qwen3 VL 235B A22B Instruct
1194
GPT-5 (High)
1194
Grok 4 Fast Reasoning
1191
Kimi K2.5 Instant
1188
GPT-5 Codex (Low)
1187
MiniMax M2.1 Lightning

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
4131Grok 4.1 Fast Non-Reasoning1226±135.8K6.3%0.9%101 tps0.5s2M$0.20$0.50
4243Qwen3 Max Instruct Preview1226±94.7K7.7%1.1%31 tps1.7s256K$1.43$6.61
4336GPT-5 Codex (Medium)1224±106.2K3.9%4.1%122 tps5.2s400K$1.25$10.00
4449Qwen3 30B A3B Instruct 25071220±75.7K7.3%1.2%55 tps1.3s131K$0.13$0.72
4536GPT-5.2 (Extra High) 1218±135.4K3.6%13.2%17 tps20.5s400K$1.75$14.00
4660GPT-5.1 Instant1211±85.8K4.2%1.3%50 tps1.9s400K$1.25$10.00
4749GPT-51209±612.4K6.7%3.1%78 tps23.1s400K$1.25$9.67
4849MiniMax M2.11204±106.9K5.1%2.1%66 tps2.6s205K$0.30$1.20
4960GPT-5.1 Codex (Medium)1202±202.5K2.9%4.6%71 tps3.7s400K$1.25$10.00
5036Qwen3 VL 235B A22B Instruct1197±92.9K7.7%3.1%75 tps1.9s129K$0.37$1.81
5127GPT-5 (High)1194±88.9K4.0%4.5%81 tps35.9s400K$1.25$10.00
5260Grok 4 Fast Reasoning1194±87.7K6.5%2.1%102 tps3.1s2M$0.30$0.75
5336Kimi K2.5 Instant1191±141.4K3.4%2.9%32 tps3.0s262K$0.50$3.00
5469GPT-5 Codex (Low)1188±103.3K4.2%2.7%112 tps3.5s400K$1.25$10.00
5543MiniMax M2.1 Lightning1187±316553.0%1.7%52 tps2.1s205K$0.30$2.40
5660Claude Sonnet 3.5 v21186±84.9K3.4%<0.1%46 tps1.4s200K$3.00$15.00
5760Qwen3 235B A22B Instruct 25071181±76.8K7.8%6.8%13 tps1.9s262K$0.13$0.52
5849GLM 4.61179±104.4K8.3%5.4%39 tps1.5s200K$0.42$1.66
5990Grok 3 Fast1179±221.1K1.7%1.7%52 tps2.4s131K$5.00$25.00
6049Kimi K2 Thinking Turbo1177±135.3K4.5%2.0%75 tps1.4s262K$1.15$8.00
6149DeepSeek V3.21173±133K5.9%1.4%83 tps5.1s131K$0.43$1.09
6269DeepSeek V3.1 Terminus Chat1171±92.6K10.5%3.4%27 tps1.5s131K$0.86$1.80
6331MiniMax M2.5 Lightning1171±271.1K2.3%1.5%51 tps2.0s205K$0.60$2.40
6460Gemini 2.5 Pro1167±523K5.7%2.3%45 tps2.6s1M$1.25$10.00
6549MiniMax M21166±85.4K7.5%2.2%39 tps2.3s205K$0.21$0.85
6643Gemini 2.5 Pro High1164±610.4K7.1%1.5%48 tps2.3s1M$1.25$10.00
6760Grok 4.20 Beta Reasoning1157±209304.1%1.1%77 tps4.5s2M$2.00$5.50
6860DeepSeek V3.2 Thinking1153±96.7K4.6%9.0%30 tps2.6s131K$0.28$0.42
6969gpt-oss-120b1152±78.5K8.0%0.7%213 tps0.5s131K$0.11$0.50
7060Grok 4.1 Fast Reasoning1151±612.8K5.4%1.5%58 tps7.3s2M$0.20$0.50
7174Qwen Plus (Aug'24)1134±78.3K4.9%1.4%53 tps1.3s30K$0.40$1.20
7269GLM 4.71134±95.8K5.5%5.8%40 tps1.5s200K$0.77$1.73
7377GPT-5 Mini1133±66.5K6.0%2.6%66 tps14.2s400K$0.25$2.00
7485GPT-5.2 Codex (Low)1131±271K3.3%4.5%41 tps5.0s400K$1.75$14.00
7577Grok 41129±521.3K5.4%3.9%29 tps11.1s256K$3.00$15.00
7677GPT-4.11127±516.2K1.8%3.7%112 tps1.3s1M$2.00$8.00
7785Gemini 2.5 Flash Thinking1123±710.7K3.7%2.2%88 tps6.4s1M$0.30$2.50
7877Qwen3 Max Thinking Preview1122±143K7.8%3.1%40 tps2.1s256K$1.20$6.00
7998Grok 31120±89.2K4.8%1.5%53 tps0.6s1M$3.67$18.33
8090Qwen Max1119±69K4.7%1.5%49 tps1.5s33K$1.60$6.40
View All (273 models)