Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1079
Ministral 14B 3.0
1078
Qwen Max
1077
Solar Pro 2 250710
1076
Gemini 2.5 Flash Thinking
1076
DeepSeek V3 0324 Turbo
1076
Apriel 1.5 15B Thinker
1075
Arcee AI Maestro Reasoning
1074
Kimi K2 Thinking
1072
Llama 3.1 405B Instruct
1071
DeepSeek V3.1 Terminus Thinking
1070
Claude Sonnet 3.7
1069
DeepSeek-R1 Turbo
1068
DeepSeek V3.2 Exp Thinking
1068
QwQ 32B
1066
Llama 3 8B Turbo

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
121153Ministral 14B 3.01079±53K3.4%2.0%119 tps0.5s128K$0.20$0.20
12293Qwen Max1078±165.6K2.1%1.5%49 tps1.5s33K$1.60$6.40
123133Solar Pro 2 2507101077±228.1K4.6%<0.1%9 tpsN/A66K$0.50$0.50
12471Gemini 2.5 Flash Thinking1076±222.8K2.3%2.2%88 tps6.4s1M$0.30$2.50
12593DeepSeek V3 0324 Turbo1076±256.4K2.9%6.3%12 tps2.4s164K$0.73$1.79
126153Apriel 1.5 15B Thinker1076±52.1K1.6%2.4%146 tps0.4s131K$0$0
127147Arcee AI Maestro Reasoning1075±313.7K2.8%<0.1%85 tps0.3s131K$0.90$3.30
12895Kimi K2 Thinking1074±38.3K2.9%4.2%61 tps5.9s262K$0.24$1.03
129159Llama 3.1 405B Instruct1072±81.7K2.0%<0.1%52 tps0.5s128K$2.60$4.27
130106DeepSeek V3.1 Terminus Thinking1071±38.5K4.9%5.9%27 tps1.8s131K$0.56$1.68
131111Claude Sonnet 3.71070±237.9K1.9%<0.1%39 tps1.6s200K$3.00$15.00
13295DeepSeek-R1 Turbo1069±37.3K2.9%2.6%29 tps1.8s64K$2.85$4.75
13395DeepSeek V3.2 Exp Thinking1068±39.9K2.7%7.2%26 tps3.0s131K$0.28$0.42
134121QwQ 32B1068±228.3K4.4%5.4%41 tps2.1s16K$0.43$0.56
135200Llama 3 8B Turbo1066±62.7K1.3%<0.1%97 tps0.1s8K$0.12$0.13
136106DeepSeek V3 03241065±146.7K2.5%5.8%12 tps2.7s164K$0.38$0.93
13786Claude Sonnet 41065±2113.6K2.4%1.8%49 tps1.3s200K$3.00$15.00
138101Qwen3.5 35B A3B1064±82.1K2.3%2.1%116 tps2.1s256K$0.63$1.13
139121NVIDIA Llama 3.3 Nemotron Super 49B v1.51064±54.5K3.7%2.0%50 tps0.6s131K$0.09$0.33
140106Grok 31063±265.8K2.6%1.5%53 tps0.6s1M$3.67$18.33
141111Grok 3 Fast1060±312.4K1.1%1.7%52 tps2.4s131K$5.00$25.00
142101Gemini 2.5 Flash Lite1060±250.1K4.8%1.3%210 tps0.7s1M$0.10$0.40
143143Solar Pro 2 2512151059±99852.5%1.8%107 tps1.5s66K$0.15$0.60
144121Qwen3 32B Fast1059±225K3.8%11.6%30 tps3.1s41K$0.10$0.25
145177Llama 3 70B Turbo1058±218.1K0.8%<0.1%31 tps0.0s8K$0.73$0.83
146119ERNIE 4.5 300B A47B1058±251.6K1.9%4.7%23 tps2.3s123K$0.28$1.10
14795Gemini 2.5 Flash1058±1118.2K1.8%1.3%2 tps3.7s1M$0.30$2.50
148113GLM 4.5 AirX1057±44.1K3.0%3.3%75 tps1.2s131K$1.10$4.50
149118GPT-4.1 mini1055±267.2K2.2%1.1%67 tps0.9s1M$0.34$1.60
150106Claude Sonnet 3.5 v21055±221.4K1.9%<0.1%46 tps1.4s200K$3.00$15.00
151119GLM 4.7 FP81053±62.7K1.1%6.9%40 tps1.3s200K$0.30$1.20
152147GLM 4.5 Air1052±218.5K5.3%<0.1%22 tps1.4s131K$0.10$0.38
153101GPT-5 (Low)1051±52.1K1.4%1.8%75 tps8.2s400K$1.25$10.00
154126Qwen3 30B A3B1051±215.1K4.4%5.1%163 tps1.0s41K$0.06$0.21
15595Gemini 2.5 Flash Lite Thinking Preview 09251051±414.2K4.3%1.5%152 tps3.0s1M$0.10$0.40
15681OpenAI o3-pro1049±46.7K3.4%5.2%22 tps70.8s200K$20.00$80.00
157113Mistral Medium1048±237.8K2.5%1.8%48 tps0.6s33K$1.48$4.55
158200K2 Think1047±44.7K2.1%<0.1%418 tps2.8sN/A$0$0
159182Fauna Fox1045±312.4K3.3%<0.1%194 tps0.3s128K$0.04$0.15
160113GLM 4.51044±215.4K5.0%3.7%46 tps1.4s131K$0.43$1.63
View All (432 models)