Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1100
DeepSeek V3 0324
1099
Qwen3 Coder 480B A35B Instruct
1098
Gemini 2.5 Flash
1098
Grok 3
1093
DeepSeek V3 0324 Turbo
1093
Qwen3 235B A22B
1091
Nova Experimental Chat 10-09
1090
Sherlock Dash Alpha
1090
OpenAI o3-pro
1089
DeepSeek V3.1
1089
DeepSeek V3.2 Exp Thinking
1088
Claude Sonnet 3.5
1087
Qwen Plus 0728 (Thinking)
1087
GPT-4.1 mini
1085
GPT-4.1 nano

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
12190DeepSeek V3 03241100±415.1K4.3%5.8%12 tps2.7s164K$0.38$0.93
12290Qwen3 Coder 480B A35B Instruct1099±83.1K4.5%3.3%61 tps2.0s262K$0.71$1.34
12398Gemini 2.5 Flash1098±435.9K3.2%1.3%2 tps3.7s1M$0.30$2.50
12498Grok 31098±419.1K5.5%1.5%53 tps0.6s1M$3.67$18.33
12598DeepSeek V3 0324 Turbo1093±515.5K5.7%6.3%12 tps2.4s164K$0.73$1.79
12698Qwen3 235B A22B1093±64.5K8.0%5.3%71 tps0.9s41K$0.23$0.63
127123Nova Experimental Chat 10-091091±73.2K10.7%<0.1%59 tps6.1s98K$0$0
128123Sherlock Dash Alpha1090±198356.7%<0.1%68 tps0.7s2M$0$0
12998OpenAI o3-pro1090±85.4K4.3%5.2%22 tps70.8s200K$20.00$80.00
13098DeepSeek V3.11089±122.3K4.7%0.8%197 tps0.4s164K$0.55$1.60
13198DeepSeek V3.2 Exp Thinking1089±75.9K3.5%7.2%26 tps3.0s131K$0.28$0.42
132132Claude Sonnet 3.51088±102.9K4.9%1.0%40 tps2.7s200K$3.00$15.00
133132Qwen Plus 0728 (Thinking)1087±91.2K8.9%<0.1%56 tps1.1s1M$0.40$4.00
134105GPT-4.1 mini1087±519.7K4.2%1.1%67 tps0.9s1M$0.34$1.60
135105GPT-4.1 nano1085±517K5.0%0.6%175 tps0.5s1M$0.10$0.40
136105Qwen3 Omni 30B A3B Instruct1085±137754.3%3.9%65 tps1.2s66K$0.35$0.97
137105DeepSeek V3 (Turbo)1082±201.5K5.1%1.5%32 tps1.5s64K$0.40$1.30
138132Solar Pro 2 2507101081±510.6K6.9%<0.1%9 tpsN/A66K$0.50$0.50
139105Seed 1.8 2512281081±103.2K3.1%3.7%41 tps2.1s256K$0.25$2.00
140105Mistral Medium1080±49.6K5.6%1.8%48 tps0.6s33K$1.48$4.55
141105Qwen3 Max Thinking1080±181.5K2.0%13.5%32 tps2.3s256K$1.20$6.00
142112GLM 4.51075±56K7.0%3.7%46 tps1.4s131K$0.43$1.63
143112Kimi K2 09051074±78.7K4.3%4.0%30 tps1.4s262K$0.63$2.39
144112Kimi K2 Fast1073±535K6.4%0.8%365 tps0.5s131K$1.00$3.00
145112GPT-5 (Low)1070±146903.5%1.8%75 tps8.2s400K$1.25$10.00
146112Kimi K2 0905 Turbo1070±67.5K9.1%0.7%373 tps0.5s262K$1.70$6.50
147112gpt-oss-20b1066±67.7K7.1%0.5%216 tps0.5s131K$0.06$0.26
148144Qwen Turbo1064±510K6.0%<0.1%53 tps1.1s1M$0.05$0.20
149112Grok 4.20 Beta Non-reasoning1063±365004.8%1.1%151 tps0.6s2M$2.00$6.00
150119OpenAI o11062±69.9K3.3%4.2%92 tps5.5s200K$15.00$60.00
151119DeepSeek V3.1 Terminus Thinking1061±92.9K9.4%5.9%27 tps1.8s131K$0.56$1.68
152119OpenAI o1-pro1061±206807.5%5.2%33 tps72.8s200K$150.00$600.00
153119Gemini 2.5 Flash Lite Thinking1061±49.8K6.2%1.0%118 tps4.4s1M$0.03$0.13
154151GLM 4.5 FP81060±186108.3%<0.1%59 tps1.2s131K$0.41$1.65
155151Llama 3 8B Turbo1059±246001.6%<0.1%97 tps0.1s8K$0.12$0.13
156119Seed 2.0 Lite (Medium)1058±205253.7%6.6%33 tps1.6s256K$0.25$2.00
157119LongCat Flash Chat1058±122.7K5.9%0.8%85 tps0.9s131K$0.14$0.68
158151OpenAI Codex Mini1057±59.8K3.3%<0.1%46 tps2.1s200K$1.50$6.00
159119GPT-5.1 Codex Mini (Medium)1057±151.9K4.9%4.6%69 tps4.1s400K$0.25$2.00
160119GPT-5.1 Codex Mini (High)1054±152.2K3.9%5.9%70 tps4.6s400K$0.25$2.00
View All (404 models)