Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1020
GPT-4.1 nano
1020
GLM 4.5 AirX
1027
Amazon Nova 2 Lite
1028
DeepSeek V3
1029
Command A
1029
DeepSeek V3.1
1032
DeepSeek-R1 Turbo
1032
ERNIE 4.5 300B A47B
1034
DeepSeek V3 (Turbo)
1035
GPT-5 Nano
1037
OpenAI o1
1038
Gemini 2.5 Flash Lite
1038
Gemini 2.0 Flash
1041
Kimi K2 0905 Turbo
1042
Pixtral Large

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
121133GPT-4.1 nano1020±48.8K9.7%0.6%175 tps0.5s1M$0.10$0.40
122113GLM 4.5 AirX1020±108059.0%3.3%75 tps1.2s131K$1.10$4.50
12386Amazon Nova 2 Lite1027±102.6K7.9%1.0%137 tps0.6s300K$0.35$2.95
124126DeepSeek V31028±75.9K5.7%0.9%69 tps1.1s64K$0.59$1.49
125129Command A1029±511K8.4%2.2%42 tps0.8s256K$2.00$7.33
12671DeepSeek V3.11029±101.1K4.5%0.8%197 tps0.4s164K$0.55$1.60
12795DeepSeek-R1 Turbo1032±101.4K5.5%2.6%29 tps1.8s64K$2.85$4.75
128119ERNIE 4.5 300B A47B1032±66.1K8.7%4.7%23 tps2.3s123K$0.28$1.10
129101DeepSeek V3 (Turbo)1034±111K5.9%1.5%32 tps1.5s64K$0.40$1.30
130157GPT-5 Nano1035±63.8K10.6%3.2%113 tps20.9s400K$0.05$0.40
131153OpenAI o11037±151.2K4.8%4.2%92 tps5.5s200K$15.00$60.00
132101Gemini 2.5 Flash Lite1038±512.8K12.6%1.3%210 tps0.7s1M$0.10$0.40
133143Gemini 2.0 Flash1038±63.7K8.9%<0.1%76 tps0.5s1M$0.14$0.56
134124Kimi K2 0905 Turbo1041±46.8K12.4%0.7%373 tps0.5s262K$1.70$6.50
135165Pixtral Large1042±82.5K5.1%2.5%57 tps1.3s128K$1.50$4.50
136143Seed 1.6 2506151042±131.2K4.8%3.1%46 tps2.2s256K$0.25$2.00
13779Qwen3 Max Thinking Preview1044±55.1K7.7%3.1%40 tps2.1s256K$1.20$6.00
138119GLM 4.7 FP81046±194903.0%6.9%40 tps1.3s200K$0.30$1.20
139106DeepSeek V3.1 Terminus Thinking1047±72.5K11.6%5.9%27 tps1.8s131K$0.56$1.68
14037Qwen3 Omni 30B A3B Thinking1047±111.3K5.9%3.7%67 tps1.2s66K$0.97$1.79
14181OpenAI o3-pro1048±82.2K3.5%5.2%22 tps70.8s200K$20.00$80.00
142111Grok 3 Fast1051±171.1K2.6%1.7%52 tps2.4s131K$5.00$25.00
143139Seed 2.0 Mini (Medium)1053±216054.0%11.9%33 tps1.7s256K$0.15$0.60
14471Qwen3.5 397B A17B1055±111.6K2.1%4.3%57 tps1.4s256K$0.52$3.00
14593DeepSeek V3 0324 Turbo1055±59.3K10.3%6.3%12 tps2.4s164K$0.73$1.79
146113Gemini 2.5 Flash Lite Thinking1059±56.6K9.5%1.0%118 tps4.4s1M$0.03$0.13
147118GPT-4.1 mini1060±411.7K6.8%1.1%67 tps0.9s1M$0.34$1.60
14895DeepSeek V3.2 Exp Thinking1063±85K3.4%7.2%26 tps3.0s131K$0.28$0.42
14995Kimi K2 Thinking1064±121.6K6.8%4.2%61 tps5.9s262K$0.24$1.03
15095Qwen3 32B1070±186207.5%3.9%30 tps3.1s41K$0.12$0.42
15171Gemini 2.5 Flash Lite Preview 09251070±56.7K8.6%1.2%209 tps0.7s1M$0.25$0.35
15286Qwen3 235B A22B1074±102.8K14.4%5.3%71 tps0.9s41K$0.23$0.63
15393Qwen Max1077±58.8K9.1%1.5%49 tps1.5s33K$1.60$6.40
15465DeepSeek V3.2 Exp Chat1079±44.3K8.8%2.6%29 tps1.5s131K$0.27$0.39
15548Step 3.5 Flash1079±236452.3%2.2%109 tps0.6s256K$0.05$0.15
15681Qwen3.5 27B1082±265502.7%3.7%55 tps2.6s256K$0.30$2.40
15786DeepSeek V3.1 Chat1084±63.7K10.1%2.8%21 tps1.6s131K$0.38$1.00
15862Qwen3 Omni 30B A3B Instruct1085±145706.6%3.9%65 tps1.2s66K$0.35$0.97
15948gpt-oss-120b1086±415.1K7.5%0.7%213 tps0.5s131K$0.11$0.50
16062MiniMax M21087±616.5K5.2%2.2%39 tps2.3s205K$0.21$0.85
View All (237 models)