Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1144
DeepSeek V3.2
1141
MiniMax M2.5 FP8
1140
DeepSeek V3.1 Turbo
1139
Claude Sonnet 4.5
1138
Claude Sonnet 4 (Thinking)
1136
Gemini 2.5 Pro
1133
GPT-5.2 (Extra High)
1129
MiniMax M2.1
1128
Grok 4 Fast Reasoning
1128
Claude Haiku 4.5
1128
Grok 4.1 Fast Reasoning
1128
Grok 4 Fast Non-Reasoning
1125
MiniMax M2
1124
DeepSeek V3.1
1119
GPT-5

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
4140DeepSeek V3.21144±320.7K1.9%1.4%83 tps5.1s131K$0.43$1.09
4271MiniMax M2.5 FP81141±42.9K1.7%3.6%33 tps1.7s205K$0.45$1.75
4356DeepSeek V3.1 Turbo1140±214.5K2.3%0.9%173 tps1.3s164K$2.00$3.75
4437Claude Sonnet 4.51139±237.7K4.3%1.4%41 tps1.3s200K$1.80$9.00
4548Claude Sonnet 4 (Thinking)1138±230.7K2.6%1.5%52 tps1.5s200K$3.00$13.67
4644Gemini 2.5 Pro1136±168.8K3.9%2.3%45 tps2.6s1M$1.25$10.00
4742GPT-5.2 (Extra High) 1133±320.9K1.9%13.2%17 tps20.5s400K$1.75$14.00
4860MiniMax M2.11129±241.8K2.0%2.1%66 tps2.6s205K$0.30$1.20
4948Grok 4 Fast Reasoning1128±225.9K3.9%2.1%102 tps3.1s2M$0.30$0.75
5052Claude Haiku 4.51128±231.4K3.7%1.1%100 tps0.9s200K$1.00$5.00
5144Grok 4.1 Fast Reasoning1128±257K3.1%1.5%58 tps7.3s2M$0.20$0.50
5252Grok 4 Fast Non-Reasoning1128±321.3K4.7%1.5%93 tps0.6s2M$0.27$0.67
5362MiniMax M21125±233.6K3.5%2.2%39 tps2.3s205K$0.21$0.85
5471DeepSeek V3.11124±36.8K2.0%0.8%197 tps0.4s164K$0.55$1.60
5552GPT-51119±244.3K3.9%3.1%78 tps23.1s400K$1.25$9.67
5695Qwen3 32B1117±63.3K2.8%3.9%30 tps3.1s41K$0.12$0.42
5786DeepSeek V3.1 Nex N11112±62.1K1.7%3.4%24 tps7.2s131K$0.14$0.50
5865GLM 4.61108±325.8K4.3%5.4%39 tps1.5s200K$0.42$1.66
5984MiniMax M2.51105±82.1K1.6%1.4%70 tps1.9s205K$0.28$1.20
6071Seed 1.8 2512281104±319K1.5%3.7%41 tps2.1s256K$0.25$2.00
6186DeepSeek V3.1 Chat1102±313.4K4.1%2.8%21 tps1.6s131K$0.38$1.00
6260Gemini 2.5 Flash Preview 09251102±219.5K4.3%1.2%5 tps0.9s1M$0.13$0.97
6368GLM 4.71101±335.7K2.1%5.8%40 tps1.5s200K$0.77$1.73
6468Grok 41100±1120.3K2.1%3.9%29 tps11.1s256K$3.00$15.00
65101DeepSeek V3 (Turbo)1100±34.8K2.5%1.5%32 tps1.5s64K$0.40$1.30
6686Amazon Nova 2 Lite1099±312.6K3.1%1.0%137 tps0.6s300K$0.35$2.95
6768Qwen Plus (Aug'24)1098±260.9K2.4%1.4%53 tps1.3s30K$0.40$1.20
6862GPT-5.1 Instant1098±221.7K2.4%1.3%50 tps1.9s400K$1.25$10.00
6971Qwen3.5 397B A17B1092±57.2K1.8%4.3%57 tps1.4s256K$0.52$3.00
7079Qwen3 Max Thinking Preview1089±217.8K3.3%3.1%40 tps2.1s256K$1.20$6.00
7181GPT-4o1088±230.3K2.1%1.0%49 tps2.4s128K$3.71$12.57
72133Nemotron 3 Nano1087±51.9K2.5%1.3%216 tps0.8s256K$0.05$4.94
7356Gemini 3.1 Flash Lite Preview Thinking1084±63.6K3.1%1.7%75 tps4.7s1M$0.25$1.50
7471Gemini 2.5 Flash Lite Preview 09251083±220.9K4.8%1.2%209 tps0.7s1M$0.25$0.35
7586Seed 2.0 Lite (Medium)1082±62.1K1.9%6.6%33 tps1.6s256K$0.25$2.00
76111LongCat Flash Chat1082±46.5K3.2%0.8%85 tps0.9s131K$0.14$0.68
7771GPT-5 Mini1082±217.5K4.3%2.6%66 tps14.2s400K$0.25$2.00
7884GPT-5 Mini Minimal1081±36.8K6.5%1.2%63 tps1.4s400K$0.25$2.00
79153Ministral 14B 3.01079±53K3.4%2.0%119 tps0.5s128K$0.20$0.20
8093Qwen Max1078±165.6K2.1%1.5%49 tps1.5s33K$1.60$6.40
View All (208 models)