Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1146
Qwen3 235B A22B Instruct 2507
1143
Grok 4.20 Multi Agent Beta
1141
Qwen3 Next 80B A3B Instruct
1139
GPT-5 Mini Minimal
1138
GPT-5
1137
Grok 4.1 Fast Non-Reasoning
1136
GLM 4.6
1134
Claude Haiku 4.5
1134
GPT-5.1 Instant
1130
Grok 4
1130
GPT-5 Mini
1130
DeepSeek V3.2
1128
MiniMax M2.5 Lightning
1125
Grok 4 Fast Reasoning
1124
Qwen3.5 122B A17B

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
4140Qwen3 235B A22B Instruct 25071146±38.8K12.2%6.8%13 tps1.9s262K$0.13$0.52
4233Grok 4.20 Multi Agent Beta1143±167651.9%1.2%56 tps8.8s2M$2.00$6.00
4333Qwen3 Next 80B A3B Instruct1141±47.6K7.7%0.6%84 tps1.1s256K$0.20$1.42
4484GPT-5 Mini Minimal1139±82.8K9.7%1.2%63 tps1.4s400K$0.25$2.00
4552GPT-51138±414K7.9%3.1%78 tps23.1s400K$1.25$9.67
4626Grok 4.1 Fast Non-Reasoning1137±57.4K6.6%0.9%101 tps0.5s2M$0.20$0.50
4765GLM 4.61136±514.1K4.7%5.4%39 tps1.5s200K$0.42$1.66
4852Claude Haiku 4.51134±59.9K6.9%1.1%100 tps0.9s200K$1.00$5.00
4962GPT-5.1 Instant1134±55.5K5.7%1.3%50 tps1.9s400K$1.25$10.00
5068Grok 41130±223.2K6.4%3.9%29 tps11.1s256K$3.00$15.00
5171GPT-5 Mini1130±46.1K7.9%2.6%66 tps14.2s400K$0.25$2.00
5240DeepSeek V3.21130±54.4K5.1%1.4%83 tps5.1s131K$0.43$1.09
5379MiniMax M2.5 Lightning1128±149952.5%1.5%51 tps2.0s205K$0.60$2.40
5448Grok 4 Fast Reasoning1125±511.8K5.5%2.1%102 tps3.1s2M$0.30$0.75
5552Qwen3.5 122B A17B1124±171.1K3.2%1.5%82 tps1.4s256K$0.40$3.20
5637Kimi K2.5 Instant1124±131.4K2.4%2.9%32 tps3.0s262K$0.50$3.00
5781GPT-4o1124±56.5K6.1%1.0%49 tps2.4s128K$3.71$12.57
5856DeepSeek V3.2 Thinking1117±610K3.8%9.0%30 tps2.6s131K$0.28$0.42
5968Qwen Plus (Aug'24)1116±58.9K9.4%1.4%53 tps1.3s30K$0.40$1.20
6029Nova Experimental Chat 12-101115±82.2K4.8%2.4%84 tps12.9s98K$0$0
6171Gemini 3.1 Flash Lite Preview1114±276302.3%1.0%8 tps1.2s1M$0.25$1.50
6268GLM 4.71112±88.8K4.7%5.8%40 tps1.5s200K$0.77$1.73
6360Gemini 2.5 Flash Preview 09251110±46.7K7.5%1.2%5 tps0.9s1M$0.13$0.97
6452Grok 4 Fast Non-Reasoning1110±57.1K8.3%1.5%93 tps0.6s2M$0.27$0.67
65106Claude Sonnet 3.5 v21109±72.9K8.2%<0.1%46 tps1.4s200K$3.00$15.00
6665Mistral Large 31108±74K6.3%2.1%51 tps1.0s256K$0.50$1.50
6733Qwen3 30B A3B Instruct 25071108±58.5K9.7%1.2%55 tps1.3s131K$0.13$0.72
6856DeepSeek V3.1 Turbo1106±92.6K5.1%0.9%173 tps1.3s164K$2.00$3.75
6995Gemini 2.5 Flash Lite Thinking Preview 09251104±54.9K7.8%1.5%152 tps3.0s1M$0.10$0.40
70106Grok 31102±59.3K9.3%1.5%53 tps0.6s1M$3.67$18.33
7186Claude Sonnet 41102±418.3K7.0%1.8%49 tps1.3s200K$3.00$15.00
7244DeepSeek V3.1 Terminus Chat1100±45.1K9.6%3.4%27 tps1.5s131K$0.86$1.80
7395Gemini 2.5 Flash1098±521.4K5.2%1.3%2 tps3.7s1M$0.30$2.50
74179Switchpoint Router1097±111.1K9.5%1.7%71 tps4.9s131K$0.85$3.40
7571Seed 1.8 2512281096±64.1K3.4%3.7%41 tps2.1s256K$0.25$2.00
76106DeepSeek V3 03241090±59.7K8.2%5.8%12 tps2.7s164K$0.38$0.93
77113Mistral Medium1087±45.3K9.0%1.8%48 tps0.6s33K$1.48$4.55
7862MiniMax M21087±616.5K5.2%2.2%39 tps2.3s205K$0.21$0.85
7948gpt-oss-120b1086±415.1K7.5%0.7%213 tps0.5s131K$0.11$0.50
8062Qwen3 Omni 30B A3B Instruct1085±145706.6%3.9%65 tps1.2s66K$0.35$0.97
View All (237 models)