Models
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1223
GPT-4.5 Preview
1221
Nova Experimental Chat 10-20
1221
GPT-5.2 (Extra High)
1220
Qwen3 VL 235B A22B Instruct
1214
GPT-5 Codex (Medium)
1211
GPT-5.2 Codex (Medium)
1210
Claude Sonnet 3.7 (Thinking)
1206
Mistral Medium 3.1
1205
Claude Sonnet 4
1205
Gemini 3 Flash Preview
1204
Gemini 2.5 Pro High
1203
Qwen3 Max Instruct Preview
1201
Claude Sonnet 3.7
1200
GPT-5.1 Codex Max
1197
MiniMax M2.1 Lightning

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
4144GPT-4.5 Preview1223±72.5K1.8%<0.1%36 tps3.0s200K$75.00$150.00
4244Nova Experimental Chat 10-201221±54.4K8.1%<0.1%30 tps0.5s98K$0$0
4336GPT-5.2 (Extra High) 1221±98K3.5%13.2%17 tps20.5s400K$1.75$14.00
4436Qwen3 VL 235B A22B Instruct1220±75.6K6.7%3.1%75 tps1.9s129K$0.37$1.81
4536GPT-5 Codex (Medium)1214±68.8K3.9%4.1%122 tps5.2s400K$1.25$10.00
4636GPT-5.2 Codex (Medium)1211±122.4K3.0%5.7%37 tps6.3s400K$1.75$14.00
4753Claude Sonnet 3.7 (Thinking)1210±313.6K3.1%<0.1%41 tps2.6s200K$3.00$15.00
4853Mistral Medium 3.11206±516.4K5.1%<0.1%77 tps0.7s128K$0.40$2.00
4943Claude Sonnet 41205±343.2K3.7%1.8%49 tps1.3s200K$3.00$15.00
5043Gemini 3 Flash Preview1205±117.2K3.7%1.3%138 tps1.4s1M$0.50$3.00
5143Gemini 2.5 Pro High1204±321.1K5.7%1.5%48 tps2.3s1M$1.25$10.00
5243Qwen3 Max Instruct Preview1203±616.1K4.6%1.1%31 tps1.7s256K$1.43$6.61
5358Claude Sonnet 3.71201±412.1K3.2%<0.1%39 tps1.6s200K$3.00$15.00
5443GPT-5.1 Codex Max1200±126.4K3.9%3.0%118 tps4.1s400K$1.25$10.00
5543MiniMax M2.1 Lightning1197±238753.3%1.7%52 tps2.1s205K$0.30$2.40
5649Qwen3 30B A3B Instruct 25071194±512.7K5.7%1.2%55 tps1.3s131K$0.13$0.72
5762OpenAI o1-mini1192±415K4.6%<0.1%118 tpsN/A128K$1.13$4.51
5849MiniMax M2.11192±819.4K3.6%2.1%66 tps2.6s205K$0.30$1.20
5949DeepSeek V3.21189±85.1K4.7%1.4%83 tps5.1s131K$0.43$1.09
6062Qwen Plus 07281189±82.1K7.5%<0.1%55 tps0.9s1M$0.40$1.20
6149MiniMax M2.5 FP81185±176103.2%3.6%33 tps1.7s205K$0.45$1.75
6249GPT-51185±421.3K5.3%3.1%78 tps23.1s400K$1.25$9.67
6349Grok 4 Fast Non-Reasoning1185±58.1K7.1%1.5%93 tps0.6s2M$0.27$0.67
6449MiniMax M21183±519.7K4.2%2.2%39 tps2.3s205K$0.21$0.85
6549Nova Experimental Chat 12-101182±92.9K3.8%2.4%84 tps12.9s98K$0$0
6649GLM 4.61182±717.2K4.4%5.4%39 tps1.5s200K$0.42$1.66
6749GPT-5.3 Codex (Low)1178±285101.0%1.8%61 tps4.3s400K$1.75$14.00
6860Grok 4.1 Fast Reasoning1178±739.5K4.4%1.5%58 tps7.3s2M$0.20$0.50
6960Grok 4 Fast Reasoning1177±314.5K5.0%2.1%102 tps3.1s2M$0.30$0.75
7060Gemini 2.5 Pro1176±337.9K4.8%2.3%45 tps2.6s1M$1.25$10.00
7175Gemini 2.5 Flash Thinking Preview 09251173±79.2K6.8%<0.1%111 tps4.7s1M$0.30$2.50
7260Qwen3 235B A22B Instruct 25071172±412.6K6.4%6.8%13 tps1.9s262K$0.13$0.52
7360Claude Sonnet 3.5 v21171±65.5K3.4%<0.1%46 tps1.4s200K$3.00$15.00
7460GPT-5.1 Codex (Medium)1171±143K3.2%4.6%71 tps3.7s400K$1.25$10.00
7560GPT-5.1 Instant1171±88.3K4.1%1.3%50 tps1.9s400K$1.25$10.00
7675Gemini 2.5 Pro Low1170±49.6K8.1%<0.1%89 tps2.4s1M$1.25$10.00
7760Grok 4.20 Beta Reasoning1167±221.2K4.1%1.1%77 tps4.5s2M$2.00$5.50
7869Qwen3.5 35B A3B1164±258653.9%2.1%116 tps2.1s256K$0.63$1.13
7969GPT-5 Codex (Low)1163±105K4.1%2.7%112 tps3.5s400K$1.25$10.00
8069GLM 4.71161±716.8K3.7%5.8%40 tps1.5s200K$0.77$1.73
View All (305 models)