Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1071
LongCat Flash Chat
1067
DeepSeek V3 0324
1067
GLM 4.5 AirX
1066
Gemini 2.5 Flash Lite
1063
DeepSeek V3.1 Thinking
1061
Qwen3 235B A22B Thinking 2507
1060
Gemini 2.5 Flash
1059
Apriel 1.5 15B Thinker
1057
GLM 4.7 FP8
1054
Gemini 2.5 Flash Thinking
1053
Solar Pro 2 251215
1052
Gemini 2.5 Flash Lite Thinking Preview 0925
1050
GLM 4.5
1049
Grok 3
1049
ERNIE 4.5 300B A47B

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
81111LongCat Flash Chat1071±43.8K1.9%0.8%85 tps0.9s131K$0.14$0.68
82106DeepSeek V3 03241067±237.1K1.0%5.8%12 tps2.7s164K$0.38$0.93
83113GLM 4.5 AirX1067±53.5K1.8%3.3%75 tps1.2s131K$1.10$4.50
84101Gemini 2.5 Flash Lite1066±236K1.7%1.3%210 tps0.7s1M$0.10$0.40
85129DeepSeek V3.1 Thinking1063±49.3K2.2%7.1%18 tps1.8s131K$0.23$0.75
86124Qwen3 235B A22B Thinking 25071061±43.6K1.2%2.5%53 tps1.6s131K$0.59$5.70
8795Gemini 2.5 Flash1060±297.6K1.0%1.3%2 tps3.7s1M$0.30$2.50
88153Apriel 1.5 15B Thinker1059±81.5K2.5%2.4%146 tps0.4s131K$0$0
89119GLM 4.7 FP81057±62.2K1.6%6.9%40 tps1.3s200K$0.30$1.20
9071Gemini 2.5 Flash Thinking1054±47.9K1.4%2.2%88 tps6.4s1M$0.30$2.50
91143Solar Pro 2 2512151053±97551.3%1.8%107 tps1.5s66K$0.15$0.60
9295Gemini 2.5 Flash Lite Thinking Preview 09251052±29.4K2.4%1.5%152 tps3.0s1M$0.10$0.40
93113GLM 4.51050±212.3K1.6%3.7%46 tps1.4s131K$0.43$1.63
94106Grok 31049±256K1.4%1.5%53 tps0.6s1M$3.67$18.33
95119ERNIE 4.5 300B A47B1049±244.6K1.0%4.7%23 tps2.3s123K$0.28$1.10
96113Gemini 2.5 Flash Lite Thinking1048±312.2K2.2%1.0%118 tps4.4s1M$0.03$0.13
97113Mistral Medium1044±233.2K1.3%1.8%48 tps0.6s33K$1.48$4.55
9886Seed 2.0 Lite (Medium)1043±91.2K2.0%6.6%33 tps1.6s256K$0.25$2.00
9984MiniMax M2.51038±111.5K2.0%1.4%70 tps1.9s205K$0.28$1.20
100124Kimi K2 0905 Turbo1037±318.8K2.4%0.7%373 tps0.5s262K$1.70$6.50
101106Claude Sonnet 3.5 v21035±416.6K1.0%<0.1%46 tps1.4s200K$3.00$15.00
102148Qwen3 30B A3B Thinking 25071035±44K1.4%0.5%124 tps1.2s131K$0.16$1.70
103118GPT-4.1 mini1031±257.7K1.3%1.1%67 tps0.9s1M$0.34$1.60
104153Ministral 14B 3.01031±62.3K3.1%2.0%119 tps0.5s128K$0.20$0.20
105111Grok 3 Fast1030±312K1.1%1.7%52 tps2.4s131K$5.00$25.00
106126Qwen3 VL 235B A22B Thinking1027±47.3K2.9%4.3%47 tps3.0s127K$0.47$3.31
107133DeepSeek V3.2 Speciale1027±55.9K2.2%6.0%43 tps1.4s131K$0.84$1.52
108165Qwen3 4B1027±49.4K3.3%1.9%94 tps1.5s128K$0.01$0.01
10986Claude Sonnet 41026±288.9K1.5%1.8%49 tps1.3s200K$3.00$15.00
110143Mistral Medium 31023±91.2K1.7%2.4%47 tps0.8s33K$0.40$2.00
111161Qwen3 8B1020±56.1K2.6%2.4%61 tps1.4s41K$0.02$0.07
112143Seed 1.6 2506151018±43.6K1.6%3.1%46 tps2.2s256K$0.25$2.00
113148OpenAI o31018±64.2K1.8%0.9%85 tps6.8s128K$7.33$29.33
114133Kimi K2 09051016±49.2K2.1%4.0%30 tps1.4s262K$0.63$2.39
115157Qwen3 Next 80B A3B Thinking1015±312.3K2.3%0.6%175 tps1.3s256K$0.21$2.26
116133GPT-4.1 nano1014±252.1K1.3%0.6%175 tps0.5s1M$0.10$0.40
117101Qwen3.5 35B A3B1011±151.1K1.4%2.1%116 tps2.1s256K$0.63$1.13
118179Baichuan-M2-32B1011±81.4K2.7%<0.1%32 tps3.3s131K$0.07$0.07
119139Seed 2.0 Mini (Medium)1010±101.3K2.6%11.9%33 tps1.7s256K$0.15$0.60
120148OpenAI o4-mini-high1009±219.5K2.1%1.9%117 tps15.9s200K$1.10$4.40
View All (203 models)