Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1023
Qwen Plus (Aug'24)
1021
DeepSeek V3.2 Thinking
1020
Grok 4.1 Fast Reasoning
1019
MiniMax M2.1 Lightning
1017
GPT-5 Mini
1013
DeepSeek V3 0324
1010
Qwen3 235B A22B Thinking 2507
1009
DeepSeek-R1 Turbo
1009
Qwen Max
1001
DeepSeek-R1 0528
1000
DeepSeek V3.1 Terminus Thinking
991
GLM 4.6
983
Seed 1.8 251228
975
Kimi K2 Fast
974
Gemini 2.0 Flash

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
8168Qwen Plus (Aug'24)1023±92.4K2.9%1.4%53 tps1.3s30K$0.40$1.20
8256DeepSeek V3.2 Thinking1021±131.9K1.8%9.0%30 tps2.6s131K$0.28$0.42
8344Grok 4.1 Fast Reasoning1020±73.7K3.0%1.5%58 tps7.3s2M$0.20$0.50
8456MiniMax M2.1 Lightning1019±248301.8%1.7%52 tps2.1s205K$0.30$2.40
8571GPT-5 Mini1017±103.1K5.2%2.6%66 tps14.2s400K$0.25$2.00
86106DeepSeek V3 03241013±112.1K3.1%5.8%12 tps2.7s164K$0.38$0.93
87124Qwen3 235B A22B Thinking 25071010±167453.2%2.5%53 tps1.6s131K$0.59$5.70
8895DeepSeek-R1 Turbo1009±206603.6%2.6%29 tps1.8s64K$2.85$4.75
8993Qwen Max1009±112.7K2.7%1.5%49 tps1.5s33K$1.60$6.40
90133DeepSeek-R1 05281001±151.1K4.1%1.3%93 tps0.5s64K$1.60$3.67
91106DeepSeek V3.1 Terminus Thinking1000±147452.6%5.9%27 tps1.8s131K$0.56$1.68
9265GLM 4.6991±159453.6%5.4%39 tps1.5s200K$0.42$1.66
9371Seed 1.8 251228983±103K2.6%3.7%41 tps2.1s256K$0.25$2.00
94113Kimi K2 Fast975±104.8K2.3%0.8%365 tps0.5s131K$1.00$3.00
95143Gemini 2.0 Flash974±191.9K4.7%<0.1%76 tps0.5s1M$0.14$0.56
96133GPT-4.1 nano974±112.3K3.4%0.6%175 tps0.5s1M$0.10$0.40
97148OpenAI o3970±101.2K3.1%0.9%85 tps6.8s128K$7.33$29.33
98129Command A965±83K2.9%2.2%42 tps0.8s256K$2.00$7.33
99111LongCat Flash Chat963±255604.3%0.8%85 tps0.9s131K$0.14$0.68
100153OpenAI o1960±112.3K2.4%4.2%92 tps5.5s200K$15.00$60.00
101126DeepSeek V3960±73.4K2.3%0.9%69 tps1.1s64K$0.59$1.49
102148OpenAI o4-mini-high958±112.2K3.1%1.9%117 tps15.9s200K$1.10$4.40
103139OpenAI o4-mini956±161.4K2.8%1.4%97 tps7.0s128K$1.10$4.40
104129DeepSeek V3.1 Thinking955±141.1K2.2%7.1%18 tps1.8s131K$0.23$0.75
10579Qwen3 Max Thinking Preview952±201.1K2.2%3.1%40 tps2.1s256K$1.20$6.00
10681OpenAI o3-pro951±191.6K3.4%5.2%22 tps70.8s200K$20.00$80.00
107101gpt-oss-20b950±181.4K4.7%0.5%216 tps0.5s131K$0.06$0.26
108101Qwen3.5 35B A3B949±275302.8%2.1%116 tps2.1s256K$0.63$1.13
10965Mistral Large 3947±201.3K4.4%2.1%51 tps1.0s256K$0.50$1.50
110143Seed 1.6 250615928±216355.2%3.1%46 tps2.2s256K$0.25$2.00
11195Kimi K2 Thinking926±177402.0%4.2%61 tps5.9s262K$0.24$1.03
112124Kimi K2 0905 Turbo925±131.5K4.7%0.7%373 tps0.5s262K$1.70$6.50
113133Kimi K2 0905922±218054.2%4.0%30 tps1.4s262K$0.63$2.39
114126Qwen3 VL 235B A22B Thinking922±187454.5%4.3%47 tps3.0s127K$0.47$3.31
115119ERNIE 4.5 300B A47B918±171.6K2.7%4.7%23 tps2.3s123K$0.28$1.10
116126Qwen3 30B A3B918±158653.4%5.1%163 tps1.0s41K$0.06$0.21
117143Gemini 2.0 Flash Lite917±112.5K6.7%<0.1%42 tps0.5s1M$0.08$0.30
118121QwQ 32B910±131.8K3.1%5.4%41 tps2.1s16K$0.43$0.56
11962MiniMax M2905±181.4K3.5%2.2%39 tps2.3s205K$0.21$0.85
120177OpenAI o3-mini901±122.5K3.1%0.8%143 tps3.3s200K$1.10$4.40
View All (154 models)