Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1087
Mistral Medium
1090
DeepSeek V3 0324
1096
Seed 1.8 251228
1097
Switchpoint Router
1098
Gemini 2.5 Flash
1100
DeepSeek V3.1 Terminus Chat
1102
Claude Sonnet 4
1102
Grok 3
1104
Gemini 2.5 Flash Lite Thinking Preview 0925
1106
DeepSeek V3.1 Turbo
1108
Qwen3 30B A3B Instruct 2507
1108
Mistral Large 3
1109
Claude Sonnet 3.5 v2
1110
Grok 4 Fast Non-Reasoning
1110
Gemini 2.5 Flash Preview 0925

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
161113Mistral Medium1087±45.3K9.0%1.8%48 tps0.6s33K$1.48$4.55
162106DeepSeek V3 03241090±59.7K8.2%5.8%12 tps2.7s164K$0.38$0.93
16371Seed 1.8 2512281096±64.1K3.4%3.7%41 tps2.1s256K$0.25$2.00
164179Switchpoint Router1097±111.1K9.5%1.7%71 tps4.9s131K$0.85$3.40
16595Gemini 2.5 Flash1098±521.4K5.2%1.3%2 tps3.7s1M$0.30$2.50
16644DeepSeek V3.1 Terminus Chat1100±45.1K9.6%3.4%27 tps1.5s131K$0.86$1.80
16786Claude Sonnet 41102±418.3K7.0%1.8%49 tps1.3s200K$3.00$15.00
168106Grok 31102±59.3K9.3%1.5%53 tps0.6s1M$3.67$18.33
16995Gemini 2.5 Flash Lite Thinking Preview 09251104±54.9K7.8%1.5%152 tps3.0s1M$0.10$0.40
17056DeepSeek V3.1 Turbo1106±92.6K5.1%0.9%173 tps1.3s164K$2.00$3.75
17133Qwen3 30B A3B Instruct 25071108±58.5K9.7%1.2%55 tps1.3s131K$0.13$0.72
17265Mistral Large 31108±74K6.3%2.1%51 tps1.0s256K$0.50$1.50
173106Claude Sonnet 3.5 v21109±72.9K8.2%<0.1%46 tps1.4s200K$3.00$15.00
17452Grok 4 Fast Non-Reasoning1110±57.1K8.3%1.5%93 tps0.6s2M$0.27$0.67
17560Gemini 2.5 Flash Preview 09251110±46.7K7.5%1.2%5 tps0.9s1M$0.13$0.97
17668GLM 4.71112±88.8K4.7%5.8%40 tps1.5s200K$0.77$1.73
17771Gemini 3.1 Flash Lite Preview1114±276302.3%1.0%8 tps1.2s1M$0.25$1.50
17829Nova Experimental Chat 12-101115±82.2K4.8%2.4%84 tps12.9s98K$0$0
17968Qwen Plus (Aug'24)1116±58.9K9.4%1.4%53 tps1.3s30K$0.40$1.20
18056DeepSeek V3.2 Thinking1117±610K3.8%9.0%30 tps2.6s131K$0.28$0.42
18181GPT-4o1124±56.5K6.1%1.0%49 tps2.4s128K$3.71$12.57
18237Kimi K2.5 Instant1124±131.4K2.4%2.9%32 tps3.0s262K$0.50$3.00
18352Qwen3.5 122B A17B1124±171.1K3.2%1.5%82 tps1.4s256K$0.40$3.20
18448Grok 4 Fast Reasoning1125±511.8K5.5%2.1%102 tps3.1s2M$0.30$0.75
18579MiniMax M2.5 Lightning1128±149952.5%1.5%51 tps2.0s205K$0.60$2.40
18640DeepSeek V3.21130±54.4K5.1%1.4%83 tps5.1s131K$0.43$1.09
18771GPT-5 Mini1130±46.1K7.9%2.6%66 tps14.2s400K$0.25$2.00
18868Grok 41130±223.2K6.4%3.9%29 tps11.1s256K$3.00$15.00
18962GPT-5.1 Instant1134±55.5K5.7%1.3%50 tps1.9s400K$1.25$10.00
19052Claude Haiku 4.51134±59.9K6.9%1.1%100 tps0.9s200K$1.00$5.00
19165GLM 4.61136±514.1K4.7%5.4%39 tps1.5s200K$0.42$1.66
19226Grok 4.1 Fast Non-Reasoning1137±57.4K6.6%0.9%101 tps0.5s2M$0.20$0.50
19352GPT-51138±414K7.9%3.1%78 tps23.1s400K$1.25$9.67
19484GPT-5 Mini Minimal1139±82.8K9.7%1.2%63 tps1.4s400K$0.25$2.00
19533Qwen3 Next 80B A3B Instruct1141±47.6K7.7%0.6%84 tps1.1s256K$0.20$1.42
19633Grok 4.20 Multi Agent Beta1143±167651.9%1.2%56 tps8.8s2M$2.00$6.00
19740Qwen3 235B A22B Instruct 25071146±38.8K12.2%6.8%13 tps1.9s262K$0.13$0.52
19844Grok 4.1 Fast Reasoning1149±621.2K4.2%1.5%58 tps7.3s2M$0.20$0.50
19960MiniMax M2.11149±610.4K4.3%2.1%66 tps2.6s205K$0.30$1.20
20042Qwen3 Max Instruct Preview1150±413.5K5.8%1.1%31 tps1.7s256K$1.43$6.61
View All (237 models)