Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

870
GPT-5 Mini
859
Gemini 2.5 Flash Lite Thinking Preview 0925
856
Gemini 2.0 Flash Lite
854
OpenAI o4-mini-high
843
DeepSeek V3 0324 Turbo
836
GPT-5 Nano
836
Command A
824
GPT-4.1 nano
816
gpt-oss-20b
804
Gemini 2.5 Flash Lite Thinking
803
Claude Haiku 3.5
802
YouTube
768
OpenAI o4-mini
767
Qwen3 Next 80B A3B Thinking
722
Llama 4 Scout

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
8171GPT-5 Mini870±159404.1%2.6%66 tps14.2s400K$0.25$2.00
8295Gemini 2.5 Flash Lite Thinking Preview 0925859±161.2K2.5%1.5%152 tps3.0s1M$0.10$0.40
83143Gemini 2.0 Flash Lite856±245853.3%<0.1%42 tps0.5s1M$0.08$0.30
84148OpenAI o4-mini-high854±225601.8%1.9%117 tps15.9s200K$1.10$4.40
8593DeepSeek V3 0324 Turbo843±216350.8%6.3%12 tps2.4s164K$0.73$1.79
86157GPT-5 Nano836±346854.2%3.2%113 tps20.9s400K$0.05$0.40
87129Command A836±158551.2%2.2%42 tps0.8s256K$2.00$7.33
88133GPT-4.1 nano824±227352.0%0.6%175 tps0.5s1M$0.10$0.40
89101gpt-oss-20b816±205551.8%0.5%216 tps0.5s131K$0.06$0.26
90113Gemini 2.5 Flash Lite Thinking804±187753.1%1.0%118 tps4.4s1M$0.03$0.13
91213Claude Haiku 3.5803±245456.0%0.8%40 tps2.8s200K$0.80$4.00
92302YouTube802±224854.0%<0.1%34 tps2.7s32K$0.99$0.99
93139OpenAI o4-mini768±325450.9%1.4%97 tps7.0s128K$1.10$4.40
94157Qwen3 Next 80B A3B Thinking767±238102.4%0.6%175 tps1.3s256K$0.21$2.26
95160Llama 4 Scout722±337002.1%0.6%88 tps5.1s131K$0.18$0.46
96161Llama 4 Maverick719±271K2.9%1.2%88 tps2.4s1M$0.23$0.83
97177OpenAI o3-mini676±276901.4%0.8%143 tps3.3s200K$1.10$4.40
98175OpenAI o3-mini-low659±215052.9%0.7%139 tps1.5s200K$1.10$4.40
View All (98 models)