Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1039
GLM 4.7 FP8
1035
DeepSeek V3.1 Terminus Thinking
1035
Mistral Medium
1035
GLM 4.5 Air
1030
GLM 4.6
1030
Qwen3 235B A22B
1029
DeepSeek V3.2 Exp Thinking
1026
GLM 4.7
1025
GPT-5 Mini
1025
Weather
1024
Qwen3 VL 235B A22B Thinking
1023
Gemini 2.5 Pro Preview 0325
1022
Gemini 2.0 Flash
1019
Qwen 2.5 32B Instruct
1019
GLM 4.5

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
121119GLM 4.7 FP81039±95151.0%6.9%40 tps1.3s200K$0.30$1.20
122106DeepSeek V3.1 Terminus Thinking1035±111.4K2.8%5.9%27 tps1.8s131K$0.56$1.68
123113Mistral Medium1035±53.6K1.8%1.8%48 tps0.6s33K$1.48$4.55
124147GLM 4.5 Air1035±73.2K2.3%<0.1%22 tps1.4s131K$0.10$0.38
12565GLM 4.61030±82.6K2.8%5.4%39 tps1.5s200K$0.42$1.66
12686Qwen3 235B A22B1030±93.1K1.6%5.3%71 tps0.9s41K$0.23$0.63
12795DeepSeek V3.2 Exp Thinking1029±111.4K0.7%7.2%26 tps3.0s131K$0.28$0.42
12868GLM 4.71026±64.5K0.8%5.8%40 tps1.5s200K$0.77$1.73
12971GPT-5 Mini1025±63.2K2.0%2.6%66 tps14.2s400K$0.25$2.00
130314Weather1025±195804.1%<0.1%36 tps1.1s32K$0$0
131126Qwen3 VL 235B A22B Thinking1024±111.6K4.2%4.3%47 tps3.0s127K$0.47$3.31
132159Gemini 2.5 Pro Preview 03251023±187352.6%<0.1%3 tps16.6s1M$1.25$10.00
133143Gemini 2.0 Flash1022±72.5K2.5%<0.1%76 tps0.5s1M$0.14$0.56
134153Qwen 2.5 32B Instruct1019±81.4K1.8%2.5%48 tps1.0s131K$0.21$0.25
135113GLM 4.51019±62.5K1.6%3.7%46 tps1.4s131K$0.43$1.63
13671Seed 1.8 2512281018±64.4K1.0%3.7%41 tps2.1s256K$0.25$2.00
137139GLM 4.6V1018±121.6K1.2%6.4%21 tps1.8s128K$0.38$0.90
138148Qwen3 30B A3B Thinking 25071017±92.2K1.8%0.5%124 tps1.2s131K$0.16$1.70
13956Claude Opus 4.1 (Thinking)1017±81.5K1.3%<0.1%20 tps3.9s200K$15.00$75.00
140133Kimi K2 09051013±112.1K3.7%4.0%30 tps1.4s262K$0.63$2.39
141126DeepSeek V31013±68.8K1.3%0.9%69 tps1.1s64K$0.59$1.49
142101DeepSeek V3 (Turbo)1013±127051.4%1.5%32 tps1.5s64K$0.40$1.30
143129Qwen3 Max Thinking1012±62.1K0.2%13.5%32 tps2.3s256K$1.20$6.00
144129Command A1005±58.6K1.7%2.2%42 tps0.8s256K$2.00$7.33
145143Seed 1.6 2506151005±208802.2%3.1%46 tps2.2s256K$0.25$2.00
146213Claude Haiku 3.51005±121.5K3.0%0.8%40 tps2.8s200K$0.80$4.00
147133DeepSeek V3.2 Speciale1003±101.3K2.2%6.0%43 tps1.4s131K$0.84$1.52
148113Kimi K2 Fast1003±410K1.8%0.8%365 tps0.5s131K$1.00$3.00
149113Gemini 2.5 Flash Lite Thinking1003±83.7K2.4%1.0%118 tps4.4s1M$0.03$0.13
150133Qwen3 14B1002±63.6K1.6%1.7%109 tps0.8s41K$0.04$0.15
151148DeepSeek-R11001±65K1.7%0.8%133 tps0.6s64K$0.91$3.07
152157Qwen3 Next 80B A3B Thinking1000±73.2K3.0%0.6%175 tps1.3s256K$0.21$2.26
153133DeepSeek-R1 0528998±44.9K1.5%1.3%93 tps0.5s64K$1.60$3.67
154292GPT-5 Nano Minimal992±165154.6%<0.1%88 tps0.8s400K$0.05$0.40
155161Qwen3 8B992±83.1K1.6%2.4%61 tps1.4s41K$0.02$0.07
15671MiniMax M2.5 FP8988±195251.9%3.6%33 tps1.7s205K$0.45$1.75
157143Gemini 2.0 Flash Lite988±64.1K2.6%<0.1%42 tps0.5s1M$0.08$0.30
15884Claude Sonnet 3.7 (Thinking)983±64.8K2.1%<0.1%41 tps2.6s200K$3.00$15.00
159200K2 Think982±118950.6%<0.1%418 tps2.8sN/A$0$0
160153OpenAI o1981±59.1K1.7%4.2%92 tps5.5s200K$15.00$60.00
View All (260 models)