Models
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1103
Gemini 2.5 Flash Lite
1102
Grok 3 Fast
1102
GPT-4o
1100
DeepSeek V3 0324
1099
Qwen3 Coder 480B A35B Instruct
1098
Gemini 2.5 Flash
1098
Grok 3
1093
DeepSeek V3 0324 Turbo
1090
OpenAI o3-pro
1089
DeepSeek V3.1
1087
GPT-4.1 mini
1085
GPT-4.1 nano
1085
Qwen3 Omni 30B A3B Instruct
1082
DeepSeek V3 (Turbo)
1081
Seed 1.8 251228

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
8190Gemini 2.5 Flash Lite1103±521.3K6.2%1.3%210 tps0.7s1M$0.10$0.40
8290Grok 3 Fast1102±142.5K4.7%1.7%52 tps2.4s131K$5.00$25.00
8390GPT-4o1102±58.5K3.7%1.0%49 tps2.4s128K$3.71$12.57
8490DeepSeek V3 03241100±415.1K4.3%5.8%12 tps2.7s164K$0.38$0.93
8590Qwen3 Coder 480B A35B Instruct1099±83.1K4.5%3.3%61 tps2.0s262K$0.71$1.34
8698Gemini 2.5 Flash1098±435.9K3.2%1.3%2 tps3.7s1M$0.30$2.50
8798Grok 31098±419.1K5.5%1.5%53 tps0.6s1M$3.67$18.33
8898DeepSeek V3 0324 Turbo1093±515.5K5.7%6.3%12 tps2.4s164K$0.73$1.79
8998OpenAI o3-pro1090±85.4K4.3%5.2%22 tps70.8s200K$20.00$80.00
9098DeepSeek V3.11089±122.3K4.7%0.8%197 tps0.4s164K$0.55$1.60
91105GPT-4.1 mini1087±519.7K4.2%1.1%67 tps0.9s1M$0.34$1.60
92105GPT-4.1 nano1085±517K5.0%0.6%175 tps0.5s1M$0.10$0.40
93105Qwen3 Omni 30B A3B Instruct1085±137754.3%3.9%65 tps1.2s66K$0.35$0.97
94105DeepSeek V3 (Turbo)1082±201.5K5.1%1.5%32 tps1.5s64K$0.40$1.30
95105Seed 1.8 2512281081±103.2K3.1%3.7%41 tps2.1s256K$0.25$2.00
96105Mistral Medium1080±49.6K5.6%1.8%48 tps0.6s33K$1.48$4.55
97105Qwen3 Max Thinking1080±181.5K2.0%13.5%32 tps2.3s256K$1.20$6.00
98112GLM 4.51075±56K7.0%3.7%46 tps1.4s131K$0.43$1.63
99112Kimi K2 09051074±78.7K4.3%4.0%30 tps1.4s262K$0.63$2.39
100112GPT-5 (Low)1070±146903.5%1.8%75 tps8.2s400K$1.25$10.00
101112Kimi K2 0905 Turbo1070±67.5K9.1%0.7%373 tps0.5s262K$1.70$6.50
102112Grok 4.20 Beta Non-reasoning1063±365004.8%1.1%151 tps0.6s2M$2.00$6.00
103119OpenAI o11062±69.9K3.3%4.2%92 tps5.5s200K$15.00$60.00
104119OpenAI o1-pro1061±206807.5%5.2%33 tps72.8s200K$150.00$600.00
105119Gemini 2.5 Flash Lite Thinking1061±49.8K6.2%1.0%118 tps4.4s1M$0.03$0.13
106119Seed 2.0 Lite (Medium)1058±205253.7%6.6%33 tps1.6s256K$0.25$2.00
107119LongCat Flash Chat1058±122.7K5.9%0.8%85 tps0.9s131K$0.14$0.68
108119GPT-5.1 Codex Mini (Medium)1057±151.9K4.9%4.6%69 tps4.1s400K$0.25$2.00
109119GPT-5.1 Codex Mini (High)1054±152.2K3.9%5.9%70 tps4.6s400K$0.25$2.00
110128ERNIE 4.5 300B A47B1049±413.5K3.9%4.7%23 tps2.3s123K$0.28$1.10
111128Cogito v2.1 671B1044±191.2K4.6%0.8%85 tps0.5s128K$1.25$1.25
112128Qwen3 32B1044±198506.6%3.9%30 tps3.1s41K$0.12$0.42
113128GLM 4.5 AirX1042±151.1K6.9%3.3%75 tps1.2s131K$1.10$4.50
114128Kimi K2 Thinking1042±103.3K5.1%4.2%61 tps5.9s262K$0.24$1.03
115128OpenAI o4-mini1042±58.5K6.4%1.4%97 tps7.0s128K$1.10$4.40
116128Gemini 3.1 Flash Lite Preview Thinking1039±161.4K4.2%1.7%75 tps4.7s1M$0.25$1.50
117135Qwen3 Next 80B A3B Thinking1035±56.2K7.4%0.6%175 tps1.3s256K$0.21$2.26
118135Gemini 2.5 Flash Lite Thinking Preview 09251035±75.8K6.8%1.5%152 tps3.0s1M$0.10$0.40
119135Gemini 3.1 Flash Lite Preview1034±219804.4%1.0%8 tps1.2s1M$0.25$1.50
120135DeepSeek V3.2 Speciale1030±102.3K6.1%6.0%43 tps1.4s131K$0.84$1.52
View All (210 models)