Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1110
Qwen Plus 0728
1110
Gemini 2.5 Flash Preview 0925
1110
Grok 4 Fast Non-Reasoning
1109
Claude Sonnet 3.5 v2
1108
Mistral Large 3
1108
GPT-5 Mini Low
1108
Qwen3 30B A3B Instruct 2507
1108
GPT-4.5 Preview
1106
DeepSeek V3.1 Turbo
1104
Gemini 2.5 Flash Lite Thinking Preview 0925
1102
Grok 3
1102
Claude Sonnet 4
1100
DeepSeek V3.1 Terminus Chat
1100
Gemini 2.5 Pro Preview 0325
1098
Gemini 2.5 Flash

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
8133Qwen Plus 07281110±101.7K9.8%<0.1%55 tps0.9s1M$0.40$1.20
8260Gemini 2.5 Flash Preview 09251110±46.7K7.5%1.2%5 tps0.9s1M$0.13$0.97
8352Grok 4 Fast Non-Reasoning1110±57.1K8.3%1.5%93 tps0.6s2M$0.27$0.67
84106Claude Sonnet 3.5 v21109±72.9K8.2%<0.1%46 tps1.4s200K$3.00$15.00
8565Mistral Large 31108±74K6.3%2.1%51 tps1.0s256K$0.50$1.50
86108GPT-5 Mini Low1108±42.5K9.0%<0.1%69 tps3.2s400K$0.25$2.00
8733Qwen3 30B A3B Instruct 25071108±58.5K9.7%1.2%55 tps1.3s131K$0.13$0.72
8877GPT-4.5 Preview1108±101.7K2.1%<0.1%36 tps3.0s200K$75.00$150.00
8956DeepSeek V3.1 Turbo1106±92.6K5.1%0.9%173 tps1.3s164K$2.00$3.75
9095Gemini 2.5 Flash Lite Thinking Preview 09251104±54.9K7.8%1.5%152 tps3.0s1M$0.10$0.40
91106Grok 31102±59.3K9.3%1.5%53 tps0.6s1M$3.67$18.33
9286Claude Sonnet 41102±418.3K7.0%1.8%49 tps1.3s200K$3.00$15.00
9344DeepSeek V3.1 Terminus Chat1100±45.1K9.6%3.4%27 tps1.5s131K$0.86$1.80
94159Gemini 2.5 Pro Preview 03251100±184755.0%<0.1%3 tps16.6s1M$1.25$10.00
9595Gemini 2.5 Flash1098±521.4K5.2%1.3%2 tps3.7s1M$0.30$2.50
96179Switchpoint Router1097±111.1K9.5%1.7%71 tps4.9s131K$0.85$3.40
9771Seed 1.8 2512281096±64.1K3.4%3.7%41 tps2.1s256K$0.25$2.00
98106DeepSeek V3 03241090±59.7K8.2%5.8%12 tps2.7s164K$0.38$0.93
99113Mistral Medium1087±45.3K9.0%1.8%48 tps0.6s33K$1.48$4.55
10062MiniMax M21087±616.5K5.2%2.2%39 tps2.3s205K$0.21$0.85
10148gpt-oss-120b1086±415.1K7.5%0.7%213 tps0.5s131K$0.11$0.50
10262Qwen3 Omni 30B A3B Instruct1085±145706.6%3.9%65 tps1.2s66K$0.35$0.97
10386DeepSeek V3.1 Chat1084±63.7K10.1%2.8%21 tps1.6s131K$0.38$1.00
10481Qwen3.5 27B1082±265502.7%3.7%55 tps2.6s256K$0.30$2.40
10548Step 3.5 Flash1079±236452.3%2.2%109 tps0.6s256K$0.05$0.15
10665DeepSeek V3.2 Exp Chat1079±44.3K8.8%2.6%29 tps1.5s131K$0.27$0.39
10793Qwen Max1077±58.8K9.1%1.5%49 tps1.5s33K$1.60$6.40
108182GLM 4.6 FP81076±1085016.7%<0.1%56 tps1.8s200K$0.40$1.75
10937Nova Experimental Chat 10-201076±43.6K11.6%<0.1%30 tps0.5s98K$0$0
11086Qwen3 235B A22B1074±102.8K14.4%5.3%71 tps0.9s41K$0.23$0.63
11171Gemini 2.5 Flash Lite Preview 09251070±56.7K8.6%1.2%209 tps0.7s1M$0.25$0.35
11295Qwen3 32B1070±186207.5%3.9%30 tps3.1s41K$0.12$0.42
11395Kimi K2 Thinking1064±121.6K6.8%4.2%61 tps5.9s262K$0.24$1.03
11495DeepSeek V3.2 Exp Thinking1063±85K3.4%7.2%26 tps3.0s131K$0.28$0.42
11548OpenAI o1-mini1062±46.2K12.1%<0.1%118 tpsN/A128K$1.13$4.51
116118GPT-4.1 mini1060±411.7K6.8%1.1%67 tps0.9s1M$0.34$1.60
117113Gemini 2.5 Flash Lite Thinking1059±56.6K9.5%1.0%118 tps4.4s1M$0.03$0.13
11893DeepSeek V3 0324 Turbo1055±59.3K10.3%6.3%12 tps2.4s164K$0.73$1.79
11971Qwen3.5 397B A17B1055±111.6K2.1%4.3%57 tps1.4s256K$0.52$3.00
120139Seed 2.0 Mini (Medium)1053±216054.0%11.9%33 tps1.7s256K$0.15$0.60
View All (312 models)