Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1015
QwQ 32B
1014
ERNIE 4.5 300B A47B
1014
Claude Sonnet 3.7 (Thinking)
1013
Claude Sonnet 4
1011
GPT-5 Mini Low
1004
Qwen 2.5 32B Instruct
1001
GLM 4.6
999
GPT-4.1 mini
998
Qwen3 235B A22B
997
Seed 1.8 251228
994
GLM Z1 32B
991
Qwen3 8B
986
GLM 4.5
986
Qwen3 30B A3B
984
DeepSeek V3.1 Thinking

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
121121QwQ 32B1015±74.6K1.4%5.4%41 tps2.1s16K$0.43$0.56
122119ERNIE 4.5 300B A47B1014±114K1.1%4.7%23 tps2.3s123K$0.28$1.10
12384Claude Sonnet 3.7 (Thinking)1014±111.8K3.0%<0.1%41 tps2.6s200K$3.00$15.00
12486Claude Sonnet 41013±710.4K1.2%1.8%49 tps1.3s200K$3.00$15.00
125108GPT-5 Mini Low1011±155104.7%<0.1%69 tps3.2s400K$0.25$2.00
126153Qwen 2.5 32B Instruct1004±141.2K1.7%2.5%48 tps1.0s131K$0.21$0.25
12765GLM 4.61001±151.3K4.4%5.4%39 tps1.5s200K$0.42$1.66
128118GPT-4.1 mini999±105.1K1.4%1.1%67 tps0.9s1M$0.34$1.60
12986Qwen3 235B A22B998±181.4K3.2%5.3%71 tps0.9s41K$0.23$0.63
13071Seed 1.8 251228997±132.4K1.3%3.7%41 tps2.1s256K$0.25$2.00
131277GLM Z1 32B994±205801.7%<0.1%18 tps9.3s33K$0.09$0.11
132161Qwen3 8B991±111.4K2.5%2.4%61 tps1.4s41K$0.02$0.07
133113GLM 4.5986±151.5K2.0%3.7%46 tps1.4s131K$0.43$1.63
134126Qwen3 30B A3B986±161.9K3.4%5.1%163 tps1.0s41K$0.06$0.21
135129DeepSeek V3.1 Thinking984±131.4K3.7%7.1%18 tps1.8s131K$0.23$0.75
136126DeepSeek V3975±105.5K0.5%0.9%69 tps1.1s64K$0.59$1.49
13765Mistral Large 3974±201.2K5.5%2.1%51 tps1.0s256K$0.50$1.50
138139OpenAI o4-mini971±82.2K2.6%1.4%97 tps7.0s128K$1.10$4.40
139148OpenAI o3968±121.6K1.6%0.9%85 tps6.8s128K$7.33$29.33
140143Gemini 2.0 Flash965±121.8K1.3%<0.1%76 tps0.5s1M$0.14$0.56
14171DeepSeek V3.1962±207503.2%0.8%197 tps0.4s164K$0.55$1.60
142129Command A959±86.2K1.2%2.2%42 tps0.8s256K$2.00$7.33
143133GPT-4.1 nano958±84K1.4%0.6%175 tps0.5s1M$0.10$0.40
144157GPT-5 Nano955±181.2K4.1%3.2%113 tps20.9s400K$0.05$0.40
145211Gemini 1.5 Pro952±455903.3%<0.1%15 tps0.0s2M$0.78$3.13
14686Amazon Nova 2 Lite948±219707.6%1.0%137 tps0.6s300K$0.35$2.95
147143Gemini 2.0 Flash Lite948±73.5K1.7%<0.1%42 tps0.5s1M$0.08$0.30
148153OpenAI o1946±113.6K1.0%4.2%92 tps5.5s200K$15.00$60.00
149165Pixtral Large941±189403.6%2.5%57 tps1.3s128K$1.50$4.50
150165Qwen3 4B940±141.7K5.2%1.9%94 tps1.5s128K$0.01$0.01
151179GLM 4.7 Flash932±266902.1%5.8%61 tps2.8s128K$0.07$0.39
152170Kimi K2 0711932±122.4K1.7%1.6%29 tps1.3s131K$0.72$2.60
153170Mistral Small 3.2 24B929±149852.5%2.8%141 tps0.7s33K$0.02$0.08
154106DeepSeek V3.1 Terminus Thinking927±178404.5%5.9%27 tps1.8s131K$0.56$1.68
155161Mistral Small 3.1924±366152.4%7.4%13 tps2.6s32K$0.17$0.28
156111Grok 3 Fast922±125300.9%1.7%52 tps2.4s131K$5.00$25.00
157133DeepSeek V3.2 Speciale922±257806.6%6.0%43 tps1.4s131K$0.84$1.52
158148OpenAI o4-mini-high915±104.7K1.5%1.9%117 tps15.9s200K$1.10$4.40
159148DeepSeek-R1904±161.7K2.8%0.8%133 tps0.6s64K$0.91$3.07
160292AFM 4.5B903±221.1K2.6%<0.1%81 tps0.3s66K$0.05$0.20
View All (223 models)