Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

988
Grok 4 Fast Reasoning
986
Gemini 2.5 Flash Lite Thinking
979
GPT-5 Mini
976
ERNIE 4.5 300B A47B
976
Gemini 2.5 Flash
974
MiniMax M2
968
Gemini 2.5 Flash Lite Preview 0925
964
Gemini 2.5 Flash Preview 0925
963
GPT-4.1 mini
958
Claude Sonnet 4
947
Llama 4 Scout
946
Kimi K2 0905 Turbo
944
GPT-4.1 nano
933
GLM 4.7
925
Kimi K2 0711

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
4148Grok 4 Fast Reasoning988±176350.8%2.1%102 tps3.1s2M$0.30$0.75
42113Gemini 2.5 Flash Lite Thinking986±195801.7%1.0%118 tps4.4s1M$0.03$0.13
4371GPT-5 Mini979±275350.9%2.6%66 tps14.2s400K$0.25$2.00
44119ERNIE 4.5 300B A47B976±151.2K2.0%4.7%23 tps2.3s123K$0.28$1.10
4595Gemini 2.5 Flash976±143.7K1.6%1.3%2 tps3.7s1M$0.30$2.50
4662MiniMax M2974±217601.3%2.2%39 tps2.3s205K$0.21$0.85
4771Gemini 2.5 Flash Lite Preview 0925968±186801.4%1.2%209 tps0.7s1M$0.25$0.35
4860Gemini 2.5 Flash Preview 0925964±247001.4%1.2%5 tps0.9s1M$0.13$0.97
49118GPT-4.1 mini963±131.9K1.3%1.1%67 tps0.9s1M$0.34$1.60
5086Claude Sonnet 4958±123.4K1.0%1.8%49 tps1.3s200K$3.00$15.00
51160Llama 4 Scout947±141.6K1.9%0.6%88 tps5.1s131K$0.18$0.46
52124Kimi K2 0905 Turbo946±197951.9%0.7%373 tps0.5s262K$1.70$6.50
53133GPT-4.1 nano944±131.5K2.0%0.6%175 tps0.5s1M$0.10$0.40
5468GLM 4.7933±208701.7%5.8%40 tps1.5s200K$0.77$1.73
55170Kimi K2 0711925±205103.8%1.6%29 tps1.3s131K$0.72$2.60
56139OpenAI o4-mini917±156502.3%1.4%97 tps7.0s128K$1.10$4.40
57113Mistral Medium908±218451.7%1.8%48 tps0.6s33K$1.48$4.55
5879Qwen3 Max Thinking Preview903±335350.9%3.1%40 tps2.1s256K$1.20$6.00
5971Seed 1.8 251228894±256601.5%3.7%41 tps2.1s256K$0.25$2.00
60143Gemini 2.0 Flash887±236802.2%<0.1%76 tps0.5s1M$0.14$0.56
6171Gemini 2.5 Flash Thinking886±226301.6%2.2%88 tps6.4s1M$0.30$2.50
62143Gemini 2.0 Flash Lite883±221.6K1.2%<0.1%42 tps0.5s1M$0.08$0.30
63148OpenAI o4-mini-high877±196852.1%1.9%117 tps15.9s200K$1.10$4.40
64186Gemma 3n E4B805±255451.8%2.0%30 tps0.5s8K$0.01$0.02
65177OpenAI o3-mini738±217002.1%0.8%143 tps3.3s200K$1.10$4.40
66175OpenAI o3-mini-low736±285552.6%0.7%139 tps1.5s200K$1.10$4.40
67186Grok 3 Mini Fast733±255403.6%1.6%44 tps0.5s131K$0.60$4.00
68186Grok 3 Mini725±296253.1%1.2%43 tps0.5s131K$0.30$0.50
View All (68 models)