Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1012
Gemini 2.5 Flash Lite
1007
MiniMax M2.1
996
Grok 3
992
Kimi K2 Fast
989
Grok 4
988
Grok 4 Fast Reasoning
986
Gemini 2.5 Flash Lite Thinking
979
GPT-5 Mini
976
ERNIE 4.5 300B A47B
976
Gemini 2.5 Flash
974
DeepSeek V3
974
MiniMax M2
968
Gemini 2.5 Flash Lite Preview 0925
968
QwQ 32B
964
Gemini 2.5 Flash Preview 0925

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
41101Gemini 2.5 Flash Lite1012±161.5K2.2%1.3%210 tps0.7s1M$0.10$0.40
4260MiniMax M2.11007±209551.0%2.1%66 tps2.6s205K$0.30$1.20
43106Grok 3996±121.5K1.6%1.5%53 tps0.6s1M$3.67$18.33
44113Kimi K2 Fast992±103.5K2.8%0.8%365 tps0.5s131K$1.00$3.00
4568Grok 4989±113.3K0.9%3.9%29 tps11.1s256K$3.00$15.00
4648Grok 4 Fast Reasoning988±176350.8%2.1%102 tps3.1s2M$0.30$0.75
47113Gemini 2.5 Flash Lite Thinking986±195801.7%1.0%118 tps4.4s1M$0.03$0.13
4871GPT-5 Mini979±275350.9%2.6%66 tps14.2s400K$0.25$2.00
49119ERNIE 4.5 300B A47B976±151.2K2.0%4.7%23 tps2.3s123K$0.28$1.10
5095Gemini 2.5 Flash976±143.7K1.6%1.3%2 tps3.7s1M$0.30$2.50
51126DeepSeek V3974±161.5K1.4%0.9%69 tps1.1s64K$0.59$1.49
5262MiniMax M2974±217601.3%2.2%39 tps2.3s205K$0.21$0.85
5371Gemini 2.5 Flash Lite Preview 0925968±186801.4%1.2%209 tps0.7s1M$0.25$0.35
54121QwQ 32B968±206501.5%5.4%41 tps2.1s16K$0.43$0.56
5560Gemini 2.5 Flash Preview 0925964±247001.4%1.2%5 tps0.9s1M$0.13$0.97
56118GPT-4.1 mini963±131.9K1.3%1.1%67 tps0.9s1M$0.34$1.60
5786Claude Sonnet 4958±123.4K1.0%1.8%49 tps1.3s200K$3.00$15.00
5844Kimi K2 Thinking Turbo954±285051.0%2.0%75 tps1.4s262K$1.15$8.00
59160Llama 4 Scout947±141.6K1.9%0.6%88 tps5.1s131K$0.18$0.46
60124Kimi K2 0905 Turbo946±197951.9%0.7%373 tps0.5s262K$1.70$6.50
61133GPT-4.1 nano944±131.5K2.0%0.6%175 tps0.5s1M$0.10$0.40
6268GLM 4.7933±208701.7%5.8%40 tps1.5s200K$0.77$1.73
63170Kimi K2 0711925±205103.8%1.6%29 tps1.3s131K$0.72$2.60
64129Command A924±172K2.0%2.2%42 tps0.8s256K$2.00$7.33
65139OpenAI o4-mini917±156502.3%1.4%97 tps7.0s128K$1.10$4.40
6665Mistral Large 3911±305353.6%2.1%51 tps1.0s256K$0.50$1.50
67113Mistral Medium908±218451.7%1.8%48 tps0.6s33K$1.48$4.55
6879Qwen3 Max Thinking Preview903±335350.9%3.1%40 tps2.1s256K$1.20$6.00
6971Seed 1.8 251228894±256601.5%3.7%41 tps2.1s256K$0.25$2.00
70161Llama 4 Maverick890±142.2K1.8%1.2%88 tps2.4s1M$0.23$0.83
71143Gemini 2.0 Flash887±236802.2%<0.1%76 tps0.5s1M$0.14$0.56
7271Gemini 2.5 Flash Thinking886±226301.6%2.2%88 tps6.4s1M$0.30$2.50
73143Gemini 2.0 Flash Lite883±221.6K1.2%<0.1%42 tps0.5s1M$0.08$0.30
74148OpenAI o4-mini-high877±196852.1%1.9%117 tps15.9s200K$1.10$4.40
75186Gemma 3n E4B805±255451.8%2.0%30 tps0.5s8K$0.01$0.02
76177OpenAI o3-mini738±217002.1%0.8%143 tps3.3s200K$1.10$4.40
77175OpenAI o3-mini-low736±285552.6%0.7%139 tps1.5s200K$1.10$4.40
78186Grok 3 Mini Fast733±255403.6%1.6%44 tps0.5s131K$0.60$4.00
79186Grok 3 Mini725±296253.1%1.2%43 tps0.5s131K$0.30$0.50
View All (79 models)