Models
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

870
Qwen3 Max Thinking Preview
873
gpt-oss-120b
877
DeepSeek V3.2 Exp Thinking
909
Mistral Medium 3.1
936
Kimi K2 0905
1001
Grok 4 Fast Reasoning
1007
Qwen3 Max Instruct Preview
1035
GPT-5 (High)
1069
GPT-5 Codex (High)
1090
Gemini 2.5 Pro High
1092
DeepSeek V3.2 Thinking
1098
Grok 4.1 Fast Reasoning
1103
GPT-5.1 Codex (High)
1113
Kimi K2 Thinking Turbo
1124
MiniMax M2.1

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
184Qwen3 Max Thinking Preview870±241.6K2.2%3.1%40 tps2.1s256K$1.20$6.00
251gpt-oss-120b873±175K1.9%0.7%213 tps0.5s131K$0.11$0.50
3103DeepSeek V3.2 Exp Thinking877±193.7K2.1%7.2%26 tps3.0s131K$0.28$0.42
419Mistral Medium 3.1909±205.1K1.8%<0.1%77 tps0.7s128K$0.40$2.00
5146Kimi K2 0905936±264.8K2.2%4.0%30 tps1.4s262K$0.63$2.39
651Grok 4 Fast Reasoning1001±215.2K2.2%2.1%102 tps3.1s2M$0.30$0.75
745Qwen3 Max Instruct Preview1007±205.5K1.4%1.1%31 tps1.7s256K$1.43$6.61
827GPT-5 (High)1035±193.4K1.9%4.5%81 tps35.9s400K$1.25$10.00
934GPT-5 Codex (High)1069±184.9K1.9%3.2%122 tps7.1s400K$1.25$10.00
1033Gemini 2.5 Pro High1090±214.6K1.4%1.5%48 tps2.3s1M$1.25$10.00
1159DeepSeek V3.2 Thinking1092±1416.2K3.8%9.0%30 tps2.6s131K$0.28$0.42
1247Grok 4.1 Fast Reasoning1098±1327.4K4.1%1.5%58 tps7.3s2M$0.20$0.50
1359GPT-5.1 Codex (High)1103±1324.3K3.4%3.2%96 tps3.9s400K$1.25$10.00
1447Kimi K2 Thinking Turbo1113±1614.3K2.6%2.0%75 tps1.4s262K$1.15$8.00
1564MiniMax M2.11124±1411.4K2.7%2.1%66 tps2.6s205K$0.30$1.20
1666MiniMax M21133±1612.4K2.4%2.2%39 tps2.3s205K$0.21$0.85
178GPT-5.1 (High)1133±176.4K2.2%3.2%76 tps6.9s400K$1.25$10.00
1873GLM 4.71133±159.7K2.6%5.8%40 tps1.5s200K$0.77$1.73
1969GLM 4.61147±1311.1K1.9%5.4%39 tps1.5s200K$0.42$1.66
2017GPT-5.2 (High)1159±1312.3K2.8%6.7%18 tps16.3s400K$1.75$14.00
215Claude Sonnet 4.6 (Thinking)1170±235.6K6.1%4.7%57 tps1.1s200K$3.00$15.00
2214Gemini 3 Flash Preview Thinking1178±1320.9K3.4%1.6%3 tps6.2s1M$0.50$3.00
2322GLM 51200±178.3K3.5%3.4%36 tps2.7s200K$0.72$2.55
2410Claude Sonnet 4.5 (Thinking)1239±1013.8K2.1%1.9%44 tps1.1s200K$3.00$15.00
2510Gemini 3 Pro1269±1123.4K2.3%2.1%50 tps3.6s1M$2.00$12.00
264GPT-5.4 (High)1284±172.5K5.3%4.6%68 tps7.9s1M$2.50$15.00
277Claude Opus 4.5 (Thinking)1293±1316.9K1.9%1.8%49 tps1.4s200K$5.00$25.00
286Gemini 3.1 Pro1312±147.6K3.9%3.5%35 tps4.1s1M$2.00$12.00
291Claude Opus 4.6 (Thinking)1468±154.4K3.1%2.5%56 tps1.6s200K$5.00$25.00