Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1039
Command A
1040
GPT-4.1 nano
1044
Qwen3 235B A22B Thinking 2507
1044
GLM 4.5
1048
Mistral Medium
1049
OpenAI o3-pro
1051
Gemini 2.5 Flash Lite Thinking Preview 0925
1051
Qwen3 30B A3B
1051
GPT-5 (Low)
1053
GLM 4.7 FP8
1055
Claude Sonnet 3.5 v2
1055
GPT-4.1 mini
1057
GLM 4.5 AirX
1058
Gemini 2.5 Flash
1058
ERNIE 4.5 300B A47B

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
161129Command A1039±282.1K2.2%2.2%42 tps0.8s256K$2.00$7.33
162133GPT-4.1 nano1040±161.4K2.5%0.6%175 tps0.5s1M$0.10$0.40
163124Qwen3 235B A22B Thinking 25071044±36.5K2.4%2.5%53 tps1.6s131K$0.59$5.70
164113GLM 4.51044±215.4K5.0%3.7%46 tps1.4s131K$0.43$1.63
165113Mistral Medium1048±237.8K2.5%1.8%48 tps0.6s33K$1.48$4.55
16681OpenAI o3-pro1049±46.7K3.4%5.2%22 tps70.8s200K$20.00$80.00
16795Gemini 2.5 Flash Lite Thinking Preview 09251051±414.2K4.3%1.5%152 tps3.0s1M$0.10$0.40
168126Qwen3 30B A3B1051±215.1K4.4%5.1%163 tps1.0s41K$0.06$0.21
169101GPT-5 (Low)1051±52.1K1.4%1.8%75 tps8.2s400K$1.25$10.00
170119GLM 4.7 FP81053±62.7K1.1%6.9%40 tps1.3s200K$0.30$1.20
171106Claude Sonnet 3.5 v21055±221.4K1.9%<0.1%46 tps1.4s200K$3.00$15.00
172118GPT-4.1 mini1055±267.2K2.2%1.1%67 tps0.9s1M$0.34$1.60
173113GLM 4.5 AirX1057±44.1K3.0%3.3%75 tps1.2s131K$1.10$4.50
17495Gemini 2.5 Flash1058±1118.2K1.8%1.3%2 tps3.7s1M$0.30$2.50
175119ERNIE 4.5 300B A47B1058±251.6K1.9%4.7%23 tps2.3s123K$0.28$1.10
176121Qwen3 32B Fast1059±225K3.8%11.6%30 tps3.1s41K$0.10$0.25
177143Solar Pro 2 2512151059±99852.5%1.8%107 tps1.5s66K$0.15$0.60
178101Gemini 2.5 Flash Lite1060±250.1K4.8%1.3%210 tps0.7s1M$0.10$0.40
179111Grok 3 Fast1060±312.4K1.1%1.7%52 tps2.4s131K$5.00$25.00
180106Grok 31063±265.8K2.6%1.5%53 tps0.6s1M$3.67$18.33
181121NVIDIA Llama 3.3 Nemotron Super 49B v1.51064±54.5K3.7%2.0%50 tps0.6s131K$0.09$0.33
182101Qwen3.5 35B A3B1064±82.1K2.3%2.1%116 tps2.1s256K$0.63$1.13
18386Claude Sonnet 41065±2113.6K2.4%1.8%49 tps1.3s200K$3.00$15.00
184106DeepSeek V3 03241065±146.7K2.5%5.8%12 tps2.7s164K$0.38$0.93
185121QwQ 32B1068±228.3K4.4%5.4%41 tps2.1s16K$0.43$0.56
18695DeepSeek V3.2 Exp Thinking1068±39.9K2.7%7.2%26 tps3.0s131K$0.28$0.42
18795DeepSeek-R1 Turbo1069±37.3K2.9%2.6%29 tps1.8s64K$2.85$4.75
188106DeepSeek V3.1 Terminus Thinking1071±38.5K4.9%5.9%27 tps1.8s131K$0.56$1.68
18995Kimi K2 Thinking1074±38.3K2.9%4.2%61 tps5.9s262K$0.24$1.03
190153Apriel 1.5 15B Thinker1076±52.1K1.6%2.4%146 tps0.4s131K$0$0
19193DeepSeek V3 0324 Turbo1076±256.4K2.9%6.3%12 tps2.4s164K$0.73$1.79
19271Gemini 2.5 Flash Thinking1076±222.8K2.3%2.2%88 tps6.4s1M$0.30$2.50
19393Qwen Max1078±165.6K2.1%1.5%49 tps1.5s33K$1.60$6.40
194153Ministral 14B 3.01079±53K3.4%2.0%119 tps0.5s128K$0.20$0.20
19584GPT-5 Mini Minimal1081±36.8K6.5%1.2%63 tps1.4s400K$0.25$2.00
19671GPT-5 Mini1082±217.5K4.3%2.6%66 tps14.2s400K$0.25$2.00
197111LongCat Flash Chat1082±46.5K3.2%0.8%85 tps0.9s131K$0.14$0.68
19886Seed 2.0 Lite (Medium)1082±62.1K1.9%6.6%33 tps1.6s256K$0.25$2.00
19971Gemini 2.5 Flash Lite Preview 09251083±220.9K4.8%1.2%209 tps0.7s1M$0.25$0.35
20056Gemini 3.1 Flash Lite Preview Thinking1084±63.6K3.1%1.7%75 tps4.7s1M$0.25$1.50
View All (288 models)