Models
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1362
Claude Sonnet 4.5 (Thinking)
1381
GPT-5.3 Codex (High)
1409
Claude Opus 4.5
1418
Gemini 3.1 Pro
1446
Claude Opus 4.5 (Thinking)
1506
Claude Sonnet 4.6 (Thinking)
1566
Claude Opus 4.6 (Thinking)
1594
GPT-5.4
1594
Claude Sonnet 4.6
1596
Claude Opus 4.6

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
20110Claude Sonnet 4.5 (Thinking)1362±458.2K3.3%1.9%44 tps1.1s200K$3.00$15.00
2029GPT-5.3 Codex (High)1381±93.2K1.2%2.0%61 tps17.8s400K$1.75$14.00
2037Claude Opus 4.51409±515.1K2.2%1.5%45 tps1.5s200K$5.00$25.00
2047Gemini 3.1 Pro1418±922K2.5%3.5%35 tps4.1s1M$2.00$12.00
2056Claude Opus 4.5 (Thinking)1446±460.8K1.9%1.8%49 tps1.4s200K$5.00$25.00
2065Claude Sonnet 4.6 (Thinking)1506±816.2K3.5%4.7%57 tps1.1s200K$3.00$15.00
2074Claude Opus 4.6 (Thinking)1566±816.5K1.6%2.5%56 tps1.6s200K$5.00$25.00
2081GPT-5.41594±144.4K1.6%2.6%55 tps0.8s1M$2.50$15.00
2091Claude Sonnet 4.61594±1015.7K1.4%1.6%47 tps1.2s200K$3.00$15.00
2101Claude Opus 4.61596±621.6K1.1%2.1%48 tps1.7s200K$5.00$25.00
View All (210 models)