Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1048
Gemini 2.5 Flash Lite Thinking
1049
ERNIE 4.5 300B A47B
1049
Grok 3
1050
GLM 4.5
1052
Gemini 2.5 Flash Lite Thinking Preview 0925
1053
Solar Pro 2 251215
1054
Gemini 2.5 Flash Thinking
1057
GLM 4.7 FP8
1058
QwQ 32B
1059
Apriel 1.5 15B Thinker
1060
Qwen3 32B Fast
1060
Gemini 2.5 Flash
1060
MiniMax M2.5 Lightning
1061
Qwen3 235B A22B Thinking 2507
1063
DeepSeek V3.1 Thinking

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
161113Gemini 2.5 Flash Lite Thinking1048±312.2K2.2%1.0%118 tps4.4s1M$0.03$0.13
162119ERNIE 4.5 300B A47B1049±244.6K1.0%4.7%23 tps2.3s123K$0.28$1.10
163106Grok 31049±256K1.4%1.5%53 tps0.6s1M$3.67$18.33
164113GLM 4.51050±212.3K1.6%3.7%46 tps1.4s131K$0.43$1.63
16595Gemini 2.5 Flash Lite Thinking Preview 09251052±29.4K2.4%1.5%152 tps3.0s1M$0.10$0.40
166143Solar Pro 2 2512151053±97551.3%1.8%107 tps1.5s66K$0.15$0.60
16771Gemini 2.5 Flash Thinking1054±47.9K1.4%2.2%88 tps6.4s1M$0.30$2.50
168119GLM 4.7 FP81057±62.2K1.6%6.9%40 tps1.3s200K$0.30$1.20
169121QwQ 32B1058±315.3K2.1%5.4%41 tps2.1s16K$0.43$0.56
170153Apriel 1.5 15B Thinker1059±81.5K2.5%2.4%146 tps0.4s131K$0$0
171121Qwen3 32B Fast1060±411.4K2.3%11.6%30 tps3.1s41K$0.10$0.25
17295Gemini 2.5 Flash1060±297.6K1.0%1.3%2 tps3.7s1M$0.30$2.50
17379MiniMax M2.5 Lightning1060±54.1K1.5%1.5%51 tps2.0s205K$0.60$2.40
174124Qwen3 235B A22B Thinking 25071061±43.6K1.2%2.5%53 tps1.6s131K$0.59$5.70
175129DeepSeek V3.1 Thinking1063±49.3K2.2%7.1%18 tps1.8s131K$0.23$0.75
17681Qwen3.5 27B1065±131.2K2.4%3.7%55 tps2.6s256K$0.30$2.40
177101Gemini 2.5 Flash Lite1066±236K1.7%1.3%210 tps0.7s1M$0.10$0.40
178113GLM 4.5 AirX1067±53.5K1.8%3.3%75 tps1.2s131K$1.10$4.50
179148DeepSeek-R11067±54.9K1.4%0.8%133 tps0.6s64K$0.91$3.07
180113Kimi K2 Fast1067±294.2K1.2%0.8%365 tps0.5s131K$1.00$3.00
181106DeepSeek V3 03241067±237.1K1.0%5.8%12 tps2.7s164K$0.38$0.93
182133DeepSeek-R1 05281070±64.4K1.6%1.3%93 tps0.5s64K$1.60$3.67
183111LongCat Flash Chat1071±43.8K1.9%0.8%85 tps0.9s131K$0.14$0.68
18422MiniMax M2.7-highspeed1071±116752.2%2.3%50 tps2.1s205K$0.60$2.40
185126Qwen3 30B A3B1073±49.6K2.1%5.1%163 tps1.0s41K$0.06$0.21
18671Gemini 3.1 Flash Lite Preview1073±111.3K2.2%1.0%8 tps1.2s1M$0.25$1.50
187121NVIDIA Llama 3.3 Nemotron Super 49B v1.51074±63.5K2.2%2.0%50 tps0.6s131K$0.09$0.33
18865GLM 4.61075±311.7K2.8%5.4%39 tps1.5s200K$0.42$1.66
189161DeepSeek Prover v21075±101.4K1.4%5.2%14 tps1.3s164K$0.40$1.56
190106DeepSeek V3.1 Terminus Thinking1075±46.7K1.9%5.9%27 tps1.8s131K$0.56$1.68
191133Nemotron 3 Nano1076±81.6K1.9%1.3%216 tps0.8s256K$0.05$4.94
192101gpt-oss-20b1080±214.2K1.7%0.5%216 tps0.5s131K$0.06$0.26
19393DeepSeek V3 0324 Turbo1081±350.9K1.4%6.3%12 tps2.4s164K$0.73$1.79
19495DeepSeek V3.2 Exp Thinking1084±54.8K1.9%7.2%26 tps3.0s131K$0.28$0.42
19593Qwen Max1084±254.8K0.9%1.5%49 tps1.5s33K$1.60$6.40
19695Qwen3 32B1085±52.6K1.5%3.9%30 tps3.1s41K$0.12$0.42
19771GPT-5 Mini1087±311.3K2.1%2.6%66 tps14.2s400K$0.25$2.00
19886DeepSeek V3.1 Chat1087±310.7K1.8%2.8%21 tps1.6s131K$0.38$1.00
19971Gemini 2.5 Flash Lite Preview 09251087±215.1K2.5%1.2%209 tps0.7s1M$0.25$0.35
200133Qwen3 14B1088±48.2K2.3%1.7%109 tps0.8s41K$0.04$0.15
View All (283 models)