Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1051
Gemini 2.5 Flash Lite Thinking Preview 0925
1050
MiniMax M2.1 Lightning
1049
Qwen3 Next 80B A3B Thinking
1047
GLM 4.7
1047
GPT-5 Mini
1046
Kimi K2.5 Instant
1046
DeepSeek V3.2 Exp Thinking
1045
DeepSeek V3 0324
1044
Grok 3
1041
Gemini 2.5 Flash Lite Thinking
1038
DeepSeek-R1 0528
1037
OpenAI o3-pro
1031
DeepSeek R1T2 Chimera
1028
Claude Sonnet 4 (Thinking)
1027
Qwen3 VL 235B A22B Thinking

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
8195Gemini 2.5 Flash Lite Thinking Preview 09251051±151.5K4.2%1.5%152 tps3.0s1M$0.10$0.40
8256MiniMax M2.1 Lightning1050±236151.6%1.7%52 tps2.1s205K$0.30$2.40
83157Qwen3 Next 80B A3B Thinking1049±92K2.6%0.6%175 tps1.3s256K$0.21$2.26
8468GLM 4.71047±152.4K1.2%5.8%40 tps1.5s200K$0.77$1.73
8571GPT-5 Mini1047±92.2K2.7%2.6%66 tps14.2s400K$0.25$2.00
8637Kimi K2.5 Instant1046±166202.4%2.9%32 tps3.0s262K$0.50$3.00
8795DeepSeek V3.2 Exp Thinking1046±187354.5%7.2%26 tps3.0s131K$0.28$0.42
88106DeepSeek V3 03241045±84.1K1.0%5.8%12 tps2.7s164K$0.38$0.93
89106Grok 31044±76K1.1%1.5%53 tps0.6s1M$3.67$18.33
90113Gemini 2.5 Flash Lite Thinking1041±92.2K1.8%1.0%118 tps4.4s1M$0.03$0.13
91133DeepSeek-R1 05281038±131.7K2.0%1.3%93 tps0.5s64K$1.60$3.67
9281OpenAI o3-pro1037±189502.6%5.2%22 tps70.8s200K$20.00$80.00
93165DeepSeek R1T2 Chimera1031±175753.4%3.0%28 tps1.8s164K$0.13$0.45
9448Claude Sonnet 4 (Thinking)1028±153.7K2.9%1.5%52 tps1.5s200K$3.00$13.67
95126Qwen3 VL 235B A22B Thinking1027±139354.1%4.3%47 tps3.0s127K$0.47$3.31
9662MiniMax M21027±92.5K5.2%2.2%39 tps2.3s205K$0.21$0.85
97129Qwen3 Max Thinking1022±121.3K1.1%13.5%32 tps2.3s256K$1.20$6.00
9871Qwen3.5 397B A17B1021±229101.1%4.3%57 tps1.4s256K$0.52$3.00
99124Kimi K2 0905 Turbo1017±122.1K2.3%0.7%373 tps0.5s262K$1.70$6.50
100121QwQ 32B1015±74.6K1.4%5.4%41 tps2.1s16K$0.43$0.56
101119ERNIE 4.5 300B A47B1014±114K1.1%4.7%23 tps2.3s123K$0.28$1.10
10286Claude Sonnet 41013±710.4K1.2%1.8%49 tps1.3s200K$3.00$15.00
103153Qwen 2.5 32B Instruct1004±141.2K1.7%2.5%48 tps1.0s131K$0.21$0.25
10465GLM 4.61001±151.3K4.4%5.4%39 tps1.5s200K$0.42$1.66
105118GPT-4.1 mini999±105.1K1.4%1.1%67 tps0.9s1M$0.34$1.60
10686Qwen3 235B A22B998±181.4K3.2%5.3%71 tps0.9s41K$0.23$0.63
10771Seed 1.8 251228997±132.4K1.3%3.7%41 tps2.1s256K$0.25$2.00
108161Qwen3 8B991±111.4K2.5%2.4%61 tps1.4s41K$0.02$0.07
109113GLM 4.5986±151.5K2.0%3.7%46 tps1.4s131K$0.43$1.63
110126Qwen3 30B A3B986±161.9K3.4%5.1%163 tps1.0s41K$0.06$0.21
111129DeepSeek V3.1 Thinking984±131.4K3.7%7.1%18 tps1.8s131K$0.23$0.75
112126DeepSeek V3975±105.5K0.5%0.9%69 tps1.1s64K$0.59$1.49
11365Mistral Large 3974±201.2K5.5%2.1%51 tps1.0s256K$0.50$1.50
114139OpenAI o4-mini971±82.2K2.6%1.4%97 tps7.0s128K$1.10$4.40
115148OpenAI o3968±121.6K1.6%0.9%85 tps6.8s128K$7.33$29.33
116143Gemini 2.0 Flash965±121.8K1.3%<0.1%76 tps0.5s1M$0.14$0.56
11771DeepSeek V3.1962±207503.2%0.8%197 tps0.4s164K$0.55$1.60
118129Command A959±86.2K1.2%2.2%42 tps0.8s256K$2.00$7.33
119133GPT-4.1 nano958±84K1.4%0.6%175 tps0.5s1M$0.10$0.40
120157GPT-5 Nano955±181.2K4.1%3.2%113 tps20.9s400K$0.05$0.40
View All (173 models)