Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1098
GPT-5.1 Instant
1097
Qwen3.5 27B
1093
gpt-oss-20b
1092
Qwen3.5 397B A17B
1090
Qwen3 235B A22B
1089
Qwen3 Max Thinking Preview
1088
GPT-4o
1087
Nemotron 3 Nano
1084
Gemini 3.1 Flash Lite Preview Thinking
1083
Gemini 2.5 Flash Lite Preview 0925
1082
Seed 2.0 Lite (Medium)
1082
LongCat Flash Chat
1082
GPT-5 Mini
1081
GPT-5 Mini Minimal
1079
Ministral 14B 3.0

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
8162GPT-5.1 Instant1098±221.7K2.4%1.3%50 tps1.9s400K$1.25$10.00
8281Qwen3.5 27B1097±62.3K2.4%3.7%55 tps2.6s256K$0.30$2.40
83101gpt-oss-20b1093±220.3K4.6%0.5%216 tps0.5s131K$0.06$0.26
8471Qwen3.5 397B A17B1092±57.2K1.8%4.3%57 tps1.4s256K$0.52$3.00
8586Qwen3 235B A22B1090±311.9K5.1%5.3%71 tps0.9s41K$0.23$0.63
8679Qwen3 Max Thinking Preview1089±217.8K3.3%3.1%40 tps2.1s256K$1.20$6.00
8781GPT-4o1088±230.3K2.1%1.0%49 tps2.4s128K$3.71$12.57
88133Nemotron 3 Nano1087±51.9K2.5%1.3%216 tps0.8s256K$0.05$4.94
8956Gemini 3.1 Flash Lite Preview Thinking1084±63.6K3.1%1.7%75 tps4.7s1M$0.25$1.50
9071Gemini 2.5 Flash Lite Preview 09251083±220.9K4.8%1.2%209 tps0.7s1M$0.25$0.35
9186Seed 2.0 Lite (Medium)1082±62.1K1.9%6.6%33 tps1.6s256K$0.25$2.00
92111LongCat Flash Chat1082±46.5K3.2%0.8%85 tps0.9s131K$0.14$0.68
9371GPT-5 Mini1082±217.5K4.3%2.6%66 tps14.2s400K$0.25$2.00
9484GPT-5 Mini Minimal1081±36.8K6.5%1.2%63 tps1.4s400K$0.25$2.00
95153Ministral 14B 3.01079±53K3.4%2.0%119 tps0.5s128K$0.20$0.20
9693Qwen Max1078±165.6K2.1%1.5%49 tps1.5s33K$1.60$6.40
9771Gemini 2.5 Flash Thinking1076±222.8K2.3%2.2%88 tps6.4s1M$0.30$2.50
9893DeepSeek V3 0324 Turbo1076±256.4K2.9%6.3%12 tps2.4s164K$0.73$1.79
99153Apriel 1.5 15B Thinker1076±52.1K1.6%2.4%146 tps0.4s131K$0$0
10095Kimi K2 Thinking1074±38.3K2.9%4.2%61 tps5.9s262K$0.24$1.03
101106DeepSeek V3.1 Terminus Thinking1071±38.5K4.9%5.9%27 tps1.8s131K$0.56$1.68
10295DeepSeek-R1 Turbo1069±37.3K2.9%2.6%29 tps1.8s64K$2.85$4.75
10395DeepSeek V3.2 Exp Thinking1068±39.9K2.7%7.2%26 tps3.0s131K$0.28$0.42
104121QwQ 32B1068±228.3K4.4%5.4%41 tps2.1s16K$0.43$0.56
105106DeepSeek V3 03241065±146.7K2.5%5.8%12 tps2.7s164K$0.38$0.93
10686Claude Sonnet 41065±2113.6K2.4%1.8%49 tps1.3s200K$3.00$15.00
107101Qwen3.5 35B A3B1064±82.1K2.3%2.1%116 tps2.1s256K$0.63$1.13
108121NVIDIA Llama 3.3 Nemotron Super 49B v1.51064±54.5K3.7%2.0%50 tps0.6s131K$0.09$0.33
109106Grok 31063±265.8K2.6%1.5%53 tps0.6s1M$3.67$18.33
110111Grok 3 Fast1060±312.4K1.1%1.7%52 tps2.4s131K$5.00$25.00
111101Gemini 2.5 Flash Lite1060±250.1K4.8%1.3%210 tps0.7s1M$0.10$0.40
112143Solar Pro 2 2512151059±99852.5%1.8%107 tps1.5s66K$0.15$0.60
113121Qwen3 32B Fast1059±225K3.8%11.6%30 tps3.1s41K$0.10$0.25
114119ERNIE 4.5 300B A47B1058±251.6K1.9%4.7%23 tps2.3s123K$0.28$1.10
11595Gemini 2.5 Flash1058±1118.2K1.8%1.3%2 tps3.7s1M$0.30$2.50
116113GLM 4.5 AirX1057±44.1K3.0%3.3%75 tps1.2s131K$1.10$4.50
117118GPT-4.1 mini1055±267.2K2.2%1.1%67 tps0.9s1M$0.34$1.60
118106Claude Sonnet 3.5 v21055±221.4K1.9%<0.1%46 tps1.4s200K$3.00$15.00
119119GLM 4.7 FP81053±62.7K1.1%6.9%40 tps1.3s200K$0.30$1.20
120101GPT-5 (Low)1051±52.1K1.4%1.8%75 tps8.2s400K$1.25$10.00
View All (288 models)