Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1297
GPT-5 Chat
1066
Gemini 3 Pro
944
Gemini 2.5 Pro
900
Claude Sonnet 4
868
Grok 4
712
Gemini 2.5 Flash

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
122GPT-5 Chat1297±836001.6%1.3%95 tps0.9s400K$1.25$10.00
210Gemini 3 Pro1066±2197400.7%2.1%50 tps3.6s1M$2.00$12.00
344Gemini 2.5 Pro944±1016501.5%2.3%45 tps2.6s1M$1.25$10.00
486Claude Sonnet 4900±1467301.4%1.8%49 tps1.3s200K$3.00$15.00
568Grok 4868±1357701.9%3.9%29 tps11.1s256K$3.00$15.00
695Gemini 2.5 Flash712±1816400.8%1.3%2 tps3.7s1M$0.30$2.50