Models
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1038
GLM 4.5
1035
Gemini 2.5 Flash Lite Thinking Preview 0925
1034
Gemini 3.1 Flash Lite Preview
1034
Gemini 2.0 Flash
1032
DeepSeek V3
1032
LongCat Flash Chat
1030
QwQ 32B
1027
OpenAI o4-mini
1026
DeepSeek V3.2 Speciale
1024
Qwen3 235B A22B Thinking 2507
1021
Command A
1020
DeepSeek V3 (Turbo)
1019
OpenAI o1
1018
Grok Code Fast 1
1018
GPT-5 (Low)

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
121112GLM 4.51038±93.2K8.8%3.7%46 tps1.4s131K$0.43$1.63
122135Gemini 2.5 Flash Lite Thinking Preview 09251035±104.6K7.3%1.5%152 tps3.0s1M$0.10$0.40
123135Gemini 3.1 Flash Lite Preview1034±307754.3%1.0%8 tps1.2s1M$0.25$1.50
124144Gemini 2.0 Flash1034±77.7K3.6%<0.1%76 tps0.5s1M$0.14$0.56
125135DeepSeek V31032±68.4K2.7%0.9%69 tps1.1s64K$0.59$1.49
126119LongCat Flash Chat1032±122.2K5.9%0.8%85 tps0.9s131K$0.14$0.68
127135QwQ 32B1030±96.4K7.9%5.4%41 tps2.1s16K$0.43$0.56
128128OpenAI o4-mini1027±114.1K8.7%1.4%97 tps7.0s128K$1.10$4.40
129135DeepSeek V3.2 Speciale1026±181.6K6.9%6.0%43 tps1.4s131K$0.84$1.52
130148Qwen3 235B A22B Thinking 25071024±221.9K5.1%2.5%53 tps1.6s131K$0.59$5.70
131144Command A1021±711.9K4.1%2.2%42 tps0.8s256K$2.00$7.33
132105DeepSeek V3 (Turbo)1020±258656.0%1.5%32 tps1.5s64K$0.40$1.30
133119OpenAI o11019±134.3K4.2%4.2%92 tps5.5s200K$15.00$60.00
134159Grok Code Fast 11018±102K5.6%5.9%294 tps0.5s256K$0.20$1.50
135112GPT-5 (Low)1018±284805.0%1.8%75 tps8.2s400K$1.25$10.00
136148Nemotron 3 Nano (Thinking)1018±171.4K6.3%2.0%200 tps0.5s256K$0$0
137119DeepSeek V3.1 Terminus Thinking1016±152.1K10.7%5.9%27 tps1.8s131K$0.56$1.68
138128Kimi K2 Thinking1013±132.5K5.4%4.2%61 tps5.9s262K$0.24$1.03
139105Seed 1.8 2512281010±192.1K3.0%3.7%41 tps2.1s256K$0.25$2.00
140148DeepSeek-R1 Turbo1005±151.6K6.3%2.6%29 tps1.8s64K$2.85$4.75
141159GPT-5 Nano1001±123.6K8.3%3.2%113 tps20.9s400K$0.05$0.40
142167Llama 3.1 8B Turbo999±161.6K1.5%2.1%650 tps0.5s128K$0.13$0.14
143128GLM 4.5 AirX999±396358.6%3.3%75 tps1.2s131K$1.10$4.50
144144OpenAI o3991±134.7K3.7%0.9%85 tps6.8s128K$7.33$29.33
145167Mistral Small 3.2 24B990±114.4K4.7%2.8%141 tps0.7s33K$0.02$0.08
146167Pixtral Large988±202.3K3.6%2.5%57 tps1.3s128K$1.50$4.50
147148Seed 1.6 250615988±221.2K6.0%3.1%46 tps2.2s256K$0.25$2.00
148148Qwen3 30B A3B987±104.5K8.3%5.1%163 tps1.0s41K$0.06$0.21
149112gpt-oss-20b983±104.1K10.1%0.5%216 tps0.5s131K$0.06$0.26
150148Qwen3 Coder Plus981±335003.8%5.1%56 tps2.3s128K$1.80$9.80
15190Step 3.5 Flash981±445103.8%2.2%109 tps0.6s256K$0.05$0.15
152167Devstral Medium978±173.4K5.0%1.5%77 tps0.6s131K$0.40$2.00
153167Qwen 2.5 32B Instruct976±112.9K6.2%2.5%48 tps1.0s131K$0.21$0.25
154167DeepSeek V3.1 Thinking974±113.7K11.1%7.1%18 tps1.8s131K$0.23$0.75
155167Qwen3 VL 30B A3B Thinking973±171.5K9.3%4.5%84 tps2.9s127K$0.20$1.47
156148OpenAI o3-mini-high972±83.9K5.1%2.4%231 tps10.5s200K$1.10$4.40
157159GLM 4.6V972±182.3K5.8%6.4%21 tps1.8s128K$0.38$0.90
158159Mistral Small 3.1 24B Instruct970±152.8K4.2%7.5%15 tps2.4s131K$0.06$0.18
159167Qwen 2.5 72B969±221.3K4.0%1.2%96 tps1.2s131K$0.14$0.26
160167Llama 4 Scout958±79K4.2%0.6%88 tps5.1s131K$0.18$0.46
View All (273 models)