Models
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1468
Claude Opus 4.6 (Thinking)
1312
Gemini 3.1 Pro
1293
Claude Opus 4.5 (Thinking)
1269
Gemini 3 Pro
1239
Claude Sonnet 4.5 (Thinking)
1200
GLM 5
1178
Gemini 3 Flash Preview Thinking
1170
Claude Sonnet 4.6 (Thinking)
1159
GPT-5.2 (High)
1147
GLM 4.6
1133
GLM 4.7
1133
GPT-5.1 (High)
1133
MiniMax M2
1124
MiniMax M2.1
1103
GPT-5.1 Codex (High)

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
11Claude Opus 4.6 (Thinking)1468±154.4K3.1%2.5%56 tps1.6s200K$5.00$25.00
26Gemini 3.1 Pro1312±147.6K3.9%3.5%35 tps4.1s1M$2.00$12.00
37Claude Opus 4.5 (Thinking)1293±1316.9K1.9%1.8%49 tps1.4s200K$5.00$25.00
410Gemini 3 Pro1269±1123.4K2.3%2.1%50 tps3.6s1M$2.00$12.00
510Claude Sonnet 4.5 (Thinking)1239±1013.8K2.1%1.9%44 tps1.1s200K$3.00$15.00
622GLM 51200±178.3K3.5%3.4%36 tps2.7s200K$0.72$2.55
714Gemini 3 Flash Preview Thinking1178±1320.9K3.4%1.6%3 tps6.2s1M$0.50$3.00
85Claude Sonnet 4.6 (Thinking)1170±235.6K6.1%4.7%57 tps1.1s200K$3.00$15.00
917GPT-5.2 (High)1159±1312.3K2.8%6.7%18 tps16.3s400K$1.75$14.00
1069GLM 4.61147±1311.1K1.9%5.4%39 tps1.5s200K$0.42$1.66
1173GLM 4.71133±159.7K2.6%5.8%40 tps1.5s200K$0.77$1.73
128GPT-5.1 (High)1133±176.4K2.2%3.2%76 tps6.9s400K$1.25$10.00
1366MiniMax M21133±1612.4K2.4%2.2%39 tps2.3s205K$0.21$0.85
1464MiniMax M2.11124±1411.4K2.7%2.1%66 tps2.6s205K$0.30$1.20
1559GPT-5.1 Codex (High)1103±1324.3K3.4%3.2%96 tps3.9s400K$1.25$10.00
1647Grok 4.1 Fast Reasoning1098±1327.4K4.1%1.5%58 tps7.3s2M$0.20$0.50
1733Gemini 2.5 Pro High1090±214.6K1.4%1.5%48 tps2.3s1M$1.25$10.00
1834GPT-5 Codex (High)1069±184.9K1.9%3.2%122 tps7.1s400K$1.25$10.00
1927GPT-5 (High)1035±193.4K1.9%4.5%81 tps35.9s400K$1.25$10.00
2045Qwen3 Max Instruct Preview1007±205.5K1.4%1.1%31 tps1.7s256K$1.43$6.61
2151Grok 4 Fast Reasoning1001±215.2K2.2%2.1%102 tps3.1s2M$0.30$0.75
22146Kimi K2 0905936±264.8K2.2%4.0%30 tps1.4s262K$0.63$2.39
2384Qwen3 Max Thinking Preview870±241.6K2.2%3.1%40 tps2.1s256K$1.20$6.00