Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1139
GPT-5 Mini Minimal
1138
GPT-5
1137
Grok 4.1 Fast Non-Reasoning
1136
GLM 4.6
1134
Claude Haiku 4.5
1134
GPT-5.1 Instant
1130
Grok 4
1130
GPT-5 Mini
1130
DeepSeek V3.2
1125
Grok 4 Fast Reasoning
1124
GPT-4o
1116
Qwen Plus (Aug'24)
1115
Nova Experimental Chat 12-10
1114
Gemini 3.1 Flash Lite Preview
1112
GLM 4.7

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
4184GPT-5 Mini Minimal1139±82.8K9.7%1.2%63 tps1.4s400K$0.25$2.00
4252GPT-51138±414K7.9%3.1%78 tps23.1s400K$1.25$9.67
4326Grok 4.1 Fast Non-Reasoning1137±57.4K6.6%0.9%101 tps0.5s2M$0.20$0.50
4465GLM 4.61136±514.1K4.7%5.4%39 tps1.5s200K$0.42$1.66
4552Claude Haiku 4.51134±59.9K6.9%1.1%100 tps0.9s200K$1.00$5.00
4662GPT-5.1 Instant1134±55.5K5.7%1.3%50 tps1.9s400K$1.25$10.00
4768Grok 41130±223.2K6.4%3.9%29 tps11.1s256K$3.00$15.00
4871GPT-5 Mini1130±46.1K7.9%2.6%66 tps14.2s400K$0.25$2.00
4940DeepSeek V3.21130±54.4K5.1%1.4%83 tps5.1s131K$0.43$1.09
5048Grok 4 Fast Reasoning1125±511.8K5.5%2.1%102 tps3.1s2M$0.30$0.75
5181GPT-4o1124±56.5K6.1%1.0%49 tps2.4s128K$3.71$12.57
5268Qwen Plus (Aug'24)1116±58.9K9.4%1.4%53 tps1.3s30K$0.40$1.20
5329Nova Experimental Chat 12-101115±82.2K4.8%2.4%84 tps12.9s98K$0$0
5471Gemini 3.1 Flash Lite Preview1114±276302.3%1.0%8 tps1.2s1M$0.25$1.50
5568GLM 4.71112±88.8K4.7%5.8%40 tps1.5s200K$0.77$1.73
5660Gemini 2.5 Flash Preview 09251110±46.7K7.5%1.2%5 tps0.9s1M$0.13$0.97
5752Grok 4 Fast Non-Reasoning1110±57.1K8.3%1.5%93 tps0.6s2M$0.27$0.67
58106Claude Sonnet 3.5 v21109±72.9K8.2%<0.1%46 tps1.4s200K$3.00$15.00
5933Qwen3 30B A3B Instruct 25071108±58.5K9.7%1.2%55 tps1.3s131K$0.13$0.72
6056DeepSeek V3.1 Turbo1106±92.6K5.1%0.9%173 tps1.3s164K$2.00$3.75
6195Gemini 2.5 Flash Lite Thinking Preview 09251104±54.9K7.8%1.5%152 tps3.0s1M$0.10$0.40
62106Grok 31102±59.3K9.3%1.5%53 tps0.6s1M$3.67$18.33
6386Claude Sonnet 41102±418.3K7.0%1.8%49 tps1.3s200K$3.00$15.00
6444DeepSeek V3.1 Terminus Chat1100±45.1K9.6%3.4%27 tps1.5s131K$0.86$1.80
6595Gemini 2.5 Flash1098±521.4K5.2%1.3%2 tps3.7s1M$0.30$2.50
66179Switchpoint Router1097±111.1K9.5%1.7%71 tps4.9s131K$0.85$3.40
6771Seed 1.8 2512281096±64.1K3.4%3.7%41 tps2.1s256K$0.25$2.00
68106DeepSeek V3 03241090±59.7K8.2%5.8%12 tps2.7s164K$0.38$0.93
69113Mistral Medium1087±45.3K9.0%1.8%48 tps0.6s33K$1.48$4.55
7062MiniMax M21087±616.5K5.2%2.2%39 tps2.3s205K$0.21$0.85
7162Qwen3 Omni 30B A3B Instruct1085±145706.6%3.9%65 tps1.2s66K$0.35$0.97
7286DeepSeek V3.1 Chat1084±63.7K10.1%2.8%21 tps1.6s131K$0.38$1.00
7393Qwen Max1077±58.8K9.1%1.5%49 tps1.5s33K$1.60$6.40
7471Gemini 2.5 Flash Lite Preview 09251070±56.7K8.6%1.2%209 tps0.7s1M$0.25$0.35
7595Qwen3 32B1070±186207.5%3.9%30 tps3.1s41K$0.12$0.42
7695Kimi K2 Thinking1064±121.6K6.8%4.2%61 tps5.9s262K$0.24$1.03
77118GPT-4.1 mini1060±411.7K6.8%1.1%67 tps0.9s1M$0.34$1.60
78113Gemini 2.5 Flash Lite Thinking1059±56.6K9.5%1.0%118 tps4.4s1M$0.03$0.13
7993DeepSeek V3 0324 Turbo1055±59.3K10.3%6.3%12 tps2.4s164K$0.73$1.79
8071Qwen3.5 397B A17B1055±111.6K2.1%4.3%57 tps1.4s256K$0.52$3.00
View All (170 models)