Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1132
Gemini 3.1 Flash Lite Preview Thinking
1134
Gemini 2.5 Flash Lite Preview 0925
1138
Claude Sonnet 4
1146
GPT-5.2 (Extra High)
1158
Grok 4.20 Multi Agent Beta
1159
MiniMax M2.1
1161
Gemini 2.5 Flash Thinking
1163
Claude Haiku 4.5
1168
GPT-5.1 Instant
1184
Gemini 2.5 Pro
1191
Gemini 2.5 Pro High
1196
Claude Haiku 4.5 (Extended Thinking)
1209
Grok 4.20 Beta Reasoning
1230
Claude Sonnet 4.5
1236
GPT-5.2 (High)

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
12156Gemini 3.1 Flash Lite Preview Thinking1132±121.7K3.9%1.7%75 tps4.7s1M$0.25$1.50
12271Gemini 2.5 Flash Lite Preview 09251134±93.5K3.4%1.2%209 tps0.7s1M$0.25$0.35
12386Claude Sonnet 41138±77.4K2.3%1.8%49 tps1.3s200K$3.00$15.00
12442GPT-5.2 (Extra High) 1146±103.5K2.3%13.2%17 tps20.5s400K$1.75$14.00
12533Grok 4.20 Multi Agent Beta1158±236651.5%1.2%56 tps8.8s2M$2.00$6.00
12660MiniMax M2.11159±112.1K2.1%2.1%66 tps2.6s205K$0.30$1.20
12771Gemini 2.5 Flash Thinking1161±46.8K3.0%2.2%88 tps6.4s1M$0.30$2.50
12852Claude Haiku 4.51163±85.3K4.1%1.1%100 tps0.9s200K$1.00$5.00
12962GPT-5.1 Instant1168±74.3K2.5%1.3%50 tps1.9s400K$1.25$10.00
13044Gemini 2.5 Pro1184±513.8K2.9%2.3%45 tps2.6s1M$1.25$10.00
13132Gemini 2.5 Pro High1191±66.1K3.2%1.5%48 tps2.3s1M$1.25$10.00
13226Claude Haiku 4.5 (Extended Thinking)1196±63K2.6%1.4%115 tps0.7s200K$1.00$5.00
13317Grok 4.20 Beta Reasoning1209±219152.1%1.1%77 tps4.5s2M$2.00$5.50
13437Claude Sonnet 4.51230±64.7K4.1%1.4%41 tps1.3s200K$1.80$9.00
13517GPT-5.2 (High)1236±811.2K1.8%6.7%18 tps16.3s400K$1.75$14.00
13613GPT-5.3 Instant1240±144.5K1.9%0.9%63 tps0.8s400K$1.75$14.00
13722GPT-5 Chat1243±711.4K2.5%1.3%95 tps0.9s400K$1.25$10.00
13817Gemini 3 Flash Preview1248±123.9K2.2%1.3%138 tps1.4s1M$0.50$3.00
1398GPT-5.1 (High)1252±86.4K1.9%3.2%76 tps6.9s400K$1.25$10.00
14016GPT-5.21254±114.5K1.8%4.1%18 tps2.7s400K$1.75$14.00
14117Claude Opus 4.51259±74.1K2.9%1.5%45 tps1.5s200K$5.00$25.00
14210GPT-5.2 Instant1262±76.9K1.8%1.7%52 tps2.0s400K$1.75$14.00
14314Gemini 3 Pro (Low)1262±66.1K2.2%2.4%51 tps3.5s1M$2.00$12.00
14414Gemini 3 Flash Preview Thinking1272±97.9K1.8%1.6%3 tps6.2s1M$0.50$3.00
1457Claude Opus 4.5 (Thinking)1272±711.3K2.0%1.8%49 tps1.4s200K$5.00$25.00
14610Gemini 3 Pro1285±917.6K1.5%2.1%50 tps3.6s1M$2.00$12.00
1478GPT-5.11295±74.3K2.2%2.3%71 tps1.4s400K$1.42$11.33
1486Gemini 3.1 Pro1317±87.9K1.6%3.5%35 tps4.1s1M$2.00$12.00
14910Claude Sonnet 4.5 (Thinking)1319±46.7K2.4%1.9%44 tps1.1s200K$3.00$15.00
1504Claude Sonnet 4.61345±114.7K1.3%1.6%47 tps1.2s200K$3.00$15.00
1515Claude Sonnet 4.6 (Thinking)1377±94.9K1.3%4.7%57 tps1.1s200K$3.00$15.00
1522Claude Opus 4.61420±116.5K1.1%2.1%48 tps1.7s200K$5.00$25.00
1531Claude Opus 4.6 (Thinking)1440±95.1K1.2%2.5%56 tps1.6s200K$5.00$25.00
1542GPT-5.41446±141.7K1.7%2.6%55 tps0.8s1M$2.50$15.00
View All (154 models)