Models
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1161
Gemini 2.5 Flash Preview
1158
GPT-5 (Minimal)
1158
DeepSeek V3.1 Terminus Chat
1146
Qwen Plus (Aug'24)
1142
Qwen3.5 397B A17B
1140
Gemini 2.5 Flash Preview 0925
1136
Gemini 2.5 Flash Preview Thinking
1131
GPT-5 Mini
1130
DeepSeek V3.1 Turbo
1129
Grok 4.20 Multi Agent Beta
1127
Qwen3 Max Thinking Preview
1125
Grok 4
1125
Ministral 8B 2512
1123
GPT-4.1
1122
Gemini 2.5 Flash Lite Preview 0925

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
8186Gemini 2.5 Flash Preview1161±83K1.1%<0.1%138 tps6.9s1M$0.15$0.60
8286GPT-5 (Minimal)1158±58.3K7.4%<0.1%67 tps1.4s400K$1.25$10.00
8369DeepSeek V3.1 Terminus Chat1158±56.5K6.9%3.4%27 tps1.5s131K$0.86$1.80
8474Qwen Plus (Aug'24)1146±517.2K4.7%1.4%53 tps1.3s30K$0.40$1.20
8574Qwen3.5 397B A17B1142±142.5K2.9%4.3%57 tps1.4s256K$0.52$3.00
8674Gemini 2.5 Flash Preview 09251140±67.6K6.0%1.2%5 tps0.9s1M$0.13$0.97
8793Gemini 2.5 Flash Preview Thinking1136±101.4K1.8%<0.1%26 tps1.8s1M$0.15$1.76
8877GPT-5 Mini1131±58.6K5.4%2.6%66 tps14.2s400K$0.25$2.00
8977DeepSeek V3.1 Turbo1130±74.8K5.3%0.9%173 tps1.3s164K$2.00$3.75
9077Grok 4.20 Multi Agent Beta1129±199453.6%1.2%56 tps8.8s2M$2.00$6.00
9177Qwen3 Max Thinking Preview1127±106.3K5.7%3.1%40 tps2.1s256K$1.20$6.00
9277Grok 41125±339.6K4.4%3.9%29 tps11.1s256K$3.00$15.00
9397Ministral 8B 25121125±155107.3%<0.1%174 tps0.5s128K$0.15$0.15
9477GPT-4.11123±532.8K5.2%3.7%112 tps1.3s1M$2.00$8.00
9577Gemini 2.5 Flash Lite Preview 09251122±78.5K6.6%1.2%209 tps0.7s1M$0.25$0.35
9697Gemini 2.5 Pro Preview 06051121±101.7K2.3%<0.1%0 tps3.7s1M$1.25$10.00
9785Gemini 2.5 Flash Thinking1118±413.7K3.6%2.2%88 tps6.4s1M$0.30$2.50
9885GPT-5 Mini Minimal1114±123.2K8.5%1.2%63 tps1.4s400K$0.25$2.00
9985GPT-5.2 Codex (Low)1113±191.2K3.2%4.5%41 tps5.0s400K$1.75$14.00
100108Gemini 2.5 Pro Preview 03251111±111.5K3.2%<0.1%3 tps16.6s1M$1.25$10.00
10185DeepSeek V3.1 Chat1110±74.9K6.6%2.8%21 tps1.6s131K$0.38$1.00
10285Qwen3 Omni 30B A3B Thinking1110±102.3K6.0%3.7%67 tps1.2s66K$0.97$1.79
10390Qwen Max1107±418.3K4.2%1.5%49 tps1.5s33K$1.60$6.40
104114GPT-5 Mini Low1104±82.8K7.2%<0.1%69 tps3.2s400K$0.25$2.00
10590Gemini 2.5 Flash Lite1103±521.3K6.2%1.3%210 tps0.7s1M$0.10$0.40
10690Grok 3 Fast1102±142.5K4.7%1.7%52 tps2.4s131K$5.00$25.00
10790GPT-4o1102±58.5K3.7%1.0%49 tps2.4s128K$3.71$12.57
10890DeepSeek V3 03241100±415.1K4.3%5.8%12 tps2.7s164K$0.38$0.93
10990Qwen3 Coder 480B A35B Instruct1099±83.1K4.5%3.3%61 tps2.0s262K$0.71$1.34
11098Gemini 2.5 Flash1098±435.9K3.2%1.3%2 tps3.7s1M$0.30$2.50
11198Grok 31098±419.1K5.5%1.5%53 tps0.6s1M$3.67$18.33
11298DeepSeek V3 0324 Turbo1093±515.5K5.7%6.3%12 tps2.4s164K$0.73$1.79
113123Nova Experimental Chat 10-091091±73.2K10.7%<0.1%59 tps6.1s98K$0$0
114123Sherlock Dash Alpha1090±198356.7%<0.1%68 tps0.7s2M$0$0
11598OpenAI o3-pro1090±85.4K4.3%5.2%22 tps70.8s200K$20.00$80.00
11698DeepSeek V3.11089±122.3K4.7%0.8%197 tps0.4s164K$0.55$1.60
117132Claude Sonnet 3.51088±102.9K4.9%1.0%40 tps2.7s200K$3.00$15.00
118132Qwen Plus 0728 (Thinking)1087±91.2K8.9%<0.1%56 tps1.1s1M$0.40$4.00
119105GPT-4.1 mini1087±519.7K4.2%1.1%67 tps0.9s1M$0.34$1.60
120105GPT-4.1 nano1085±517K5.0%0.6%175 tps0.5s1M$0.10$0.40
View All (305 models)