Models
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1089
DeepSeek V3.1
1090
OpenAI o3-pro
1093
DeepSeek V3 0324 Turbo
1098
Grok 3
1098
Gemini 2.5 Flash
1099
Qwen3 Coder 480B A35B Instruct
1100
DeepSeek V3 0324
1102
GPT-4o
1102
Grok 3 Fast
1103
Gemini 2.5 Flash Lite
1107
Qwen Max
1110
Qwen3 Omni 30B A3B Thinking
1110
DeepSeek V3.1 Chat
1113
GPT-5.2 Codex (Low)
1114
GPT-5 Mini Minimal

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
12198DeepSeek V3.11089±122.3K4.7%0.8%197 tps0.4s164K$0.55$1.60
12298OpenAI o3-pro1090±85.4K4.3%5.2%22 tps70.8s200K$20.00$80.00
12398DeepSeek V3 0324 Turbo1093±515.5K5.7%6.3%12 tps2.4s164K$0.73$1.79
12498Grok 31098±419.1K5.5%1.5%53 tps0.6s1M$3.67$18.33
12598Gemini 2.5 Flash1098±435.9K3.2%1.3%2 tps3.7s1M$0.30$2.50
12690Qwen3 Coder 480B A35B Instruct1099±83.1K4.5%3.3%61 tps2.0s262K$0.71$1.34
12790DeepSeek V3 03241100±415.1K4.3%5.8%12 tps2.7s164K$0.38$0.93
12890GPT-4o1102±58.5K3.7%1.0%49 tps2.4s128K$3.71$12.57
12990Grok 3 Fast1102±142.5K4.7%1.7%52 tps2.4s131K$5.00$25.00
13090Gemini 2.5 Flash Lite1103±521.3K6.2%1.3%210 tps0.7s1M$0.10$0.40
13190Qwen Max1107±418.3K4.2%1.5%49 tps1.5s33K$1.60$6.40
13285Qwen3 Omni 30B A3B Thinking1110±102.3K6.0%3.7%67 tps1.2s66K$0.97$1.79
13385DeepSeek V3.1 Chat1110±74.9K6.6%2.8%21 tps1.6s131K$0.38$1.00
13485GPT-5.2 Codex (Low)1113±191.2K3.2%4.5%41 tps5.0s400K$1.75$14.00
13585GPT-5 Mini Minimal1114±123.2K8.5%1.2%63 tps1.4s400K$0.25$2.00
13685Gemini 2.5 Flash Thinking1118±413.7K3.6%2.2%88 tps6.4s1M$0.30$2.50
13777Gemini 2.5 Flash Lite Preview 09251122±78.5K6.6%1.2%209 tps0.7s1M$0.25$0.35
13877GPT-4.11123±532.8K5.2%3.7%112 tps1.3s1M$2.00$8.00
13977Grok 41125±339.6K4.4%3.9%29 tps11.1s256K$3.00$15.00
14077Qwen3 Max Thinking Preview1127±106.3K5.7%3.1%40 tps2.1s256K$1.20$6.00
14177Grok 4.20 Multi Agent Beta1129±199453.6%1.2%56 tps8.8s2M$2.00$6.00
14277DeepSeek V3.1 Turbo1130±74.8K5.3%0.9%173 tps1.3s164K$2.00$3.75
14377GPT-5 Mini1131±58.6K5.4%2.6%66 tps14.2s400K$0.25$2.00
14474Gemini 2.5 Flash Preview 09251140±67.6K6.0%1.2%5 tps0.9s1M$0.13$0.97
14574Qwen3.5 397B A17B1142±142.5K2.9%4.3%57 tps1.4s256K$0.52$3.00
14674Qwen Plus (Aug'24)1146±517.2K4.7%1.4%53 tps1.3s30K$0.40$1.20
14769DeepSeek V3.1 Terminus Chat1158±56.5K6.9%3.4%27 tps1.5s131K$0.86$1.80
14869GLM 4.71161±716.8K3.7%5.8%40 tps1.5s200K$0.77$1.73
14969GPT-5 Codex (Low)1163±105K4.1%2.7%112 tps3.5s400K$1.25$10.00
15069Qwen3.5 35B A3B1164±258653.9%2.1%116 tps2.1s256K$0.63$1.13
15160Grok 4.20 Beta Reasoning1167±221.2K4.1%1.1%77 tps4.5s2M$2.00$5.50
15260GPT-5.1 Instant1171±88.3K4.1%1.3%50 tps1.9s400K$1.25$10.00
15360GPT-5.1 Codex (Medium)1171±143K3.2%4.6%71 tps3.7s400K$1.25$10.00
15460Claude Sonnet 3.5 v21171±65.5K3.4%<0.1%46 tps1.4s200K$3.00$15.00
15560Qwen3 235B A22B Instruct 25071172±412.6K6.4%6.8%13 tps1.9s262K$0.13$0.52
15660Gemini 2.5 Pro1176±337.9K4.8%2.3%45 tps2.6s1M$1.25$10.00
15760Grok 4 Fast Reasoning1177±314.5K5.0%2.1%102 tps3.1s2M$0.30$0.75
15860Grok 4.1 Fast Reasoning1178±739.5K4.4%1.5%58 tps7.3s2M$0.20$0.50
15949GPT-5.3 Codex (Low)1178±285101.0%1.8%61 tps4.3s400K$1.75$14.00
16049GLM 4.61182±717.2K4.4%5.4%39 tps1.5s200K$0.42$1.66
View All (210 models)