Models
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1170
Gemini 2.5 Pro Low
1171
GPT-5.1 Instant
1171
GPT-5.1 Codex (Medium)
1171
Claude Sonnet 3.5 v2
1172
Qwen3 235B A22B Instruct 2507
1173
Gemini 2.5 Flash Thinking Preview 0925
1176
Gemini 2.5 Pro
1177
Grok 4 Fast Reasoning
1178
DeepSeek V3.2 Thinking
1178
Grok 4.1 Fast Reasoning
1178
GPT-5.3 Codex (Low)
1182
GLM 4.6
1182
Nova Experimental Chat 12-10
1183
MiniMax M2
1185
Grok 4 Fast Non-Reasoning

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
32175Gemini 2.5 Pro Low1170±49.6K8.1%<0.1%89 tps2.4s1M$1.25$10.00
32260GPT-5.1 Instant1171±88.3K4.1%1.3%50 tps1.9s400K$1.25$10.00
32360GPT-5.1 Codex (Medium)1171±143K3.2%4.6%71 tps3.7s400K$1.25$10.00
32460Claude Sonnet 3.5 v21171±65.5K3.4%<0.1%46 tps1.4s200K$3.00$15.00
32560Qwen3 235B A22B Instruct 25071172±412.6K6.4%6.8%13 tps1.9s262K$0.13$0.52
32675Gemini 2.5 Flash Thinking Preview 09251173±79.2K6.8%<0.1%111 tps4.7s1M$0.30$2.50
32760Gemini 2.5 Pro1176±337.9K4.8%2.3%45 tps2.6s1M$1.25$10.00
32860Grok 4 Fast Reasoning1177±314.5K5.0%2.1%102 tps3.1s2M$0.30$0.75
32960DeepSeek V3.2 Thinking1178±923.3K4.0%9.0%30 tps2.6s131K$0.28$0.42
33060Grok 4.1 Fast Reasoning1178±739.5K4.4%1.5%58 tps7.3s2M$0.20$0.50
33149GPT-5.3 Codex (Low)1178±285101.0%1.8%61 tps4.3s400K$1.75$14.00
33249GLM 4.61182±717.2K4.4%5.4%39 tps1.5s200K$0.42$1.66
33349Nova Experimental Chat 12-101182±92.9K3.8%2.4%84 tps12.9s98K$0$0
33449MiniMax M21183±519.7K4.2%2.2%39 tps2.3s205K$0.21$0.85
33549Grok 4 Fast Non-Reasoning1185±58.1K7.1%1.5%93 tps0.6s2M$0.27$0.67
33649GPT-51185±421.3K5.3%3.1%78 tps23.1s400K$1.25$9.67
33749MiniMax M2.5 FP81185±176103.2%3.6%33 tps1.7s205K$0.45$1.75
33862Qwen Plus 07281189±82.1K7.5%<0.1%55 tps0.9s1M$0.40$1.20
33949DeepSeek V3.21189±85.1K4.7%1.4%83 tps5.1s131K$0.43$1.09
34049MiniMax M2.11192±819.4K3.6%2.1%66 tps2.6s205K$0.30$1.20
34162OpenAI o1-mini1192±415K4.6%<0.1%118 tpsN/A128K$1.13$4.51
34249Kimi K2 Thinking Turbo1192±620.3K3.4%2.0%75 tps1.4s262K$1.15$8.00
34349Qwen3 30B A3B Instruct 25071194±512.7K5.7%1.2%55 tps1.3s131K$0.13$0.72
34443MiniMax M2.1 Lightning1197±238753.3%1.7%52 tps2.1s205K$0.30$2.40
34543GPT-5.1 Codex Max1200±126.4K3.9%3.0%118 tps4.1s400K$1.25$10.00
34658Claude Sonnet 3.71201±412.1K3.2%<0.1%39 tps1.6s200K$3.00$15.00
34743Qwen3 Max Instruct Preview1203±616.1K4.6%1.1%31 tps1.7s256K$1.43$6.61
34843Gemini 2.5 Pro High1204±321.1K5.7%1.5%48 tps2.3s1M$1.25$10.00
34943Gemini 3 Flash Preview1205±117.2K3.7%1.3%138 tps1.4s1M$0.50$3.00
35043Claude Sonnet 41205±343.2K3.7%1.8%49 tps1.3s200K$3.00$15.00
35153Mistral Medium 3.11206±516.4K5.1%<0.1%77 tps0.7s128K$0.40$2.00
35253Claude Sonnet 3.7 (Thinking)1210±313.6K3.1%<0.1%41 tps2.6s200K$3.00$15.00
35336Kimi K2.5 Instant1210±81.8K3.2%2.9%32 tps3.0s262K$0.50$3.00
35436Qwen3.5 27B1211±169104.7%3.7%55 tps2.6s256K$0.30$2.40
35536GPT-5.2 Codex (Medium)1211±122.4K3.0%5.7%37 tps6.3s400K$1.75$14.00
35636GPT-5 Codex (Medium)1214±68.8K3.9%4.1%122 tps5.2s400K$1.25$10.00
35736Qwen3.5 122B A17B1216±151.9K3.1%1.5%82 tps1.4s256K$0.40$3.20
35836Qwen3 VL 235B A22B Instruct1220±75.6K6.7%3.1%75 tps1.9s129K$0.37$1.81
35936GPT-5.2 (Extra High) 1221±98K3.5%13.2%17 tps20.5s400K$1.75$14.00
36044Nova Experimental Chat 10-201221±54.4K8.1%<0.1%30 tps0.5s98K$0$0
View All (404 models)