Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1316
Grok 4.20 Beta Non-reasoning
1342
Claude Opus 4.6 (Thinking)
1364
GPT-5.4

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
28122Grok 4.20 Beta Non-reasoning1316±136303.8%1.1%151 tps0.6s2M$2.00$6.00
2821Claude Opus 4.6 (Thinking)1342±55.7K0.9%2.5%56 tps1.6s200K$5.00$25.00
2832GPT-5.41364±72.3K1.1%2.6%55 tps0.8s1M$2.50$15.00
View All (283 models)