Models
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1085
Qwen3 Omni 30B A3B Instruct
1082
DeepSeek V3 (Turbo)
1081
Solar Pro 2 250710
1081
Seed 1.8 251228
1080
Mistral Medium
1080
Qwen3 Max Thinking
1075
GLM 4.5
1074
Kimi K2 0905
1070
GPT-5 (Low)
1070
Kimi K2 0905 Turbo
1064
Qwen Turbo
1063
Grok 4.20 Beta Non-reasoning
1062
OpenAI o1
1061
OpenAI o1-pro
1061
Gemini 2.5 Flash Lite Thinking

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
121105Qwen3 Omni 30B A3B Instruct1085±137754.3%3.9%65 tps1.2s66K$0.35$0.97
122105DeepSeek V3 (Turbo)1082±201.5K5.1%1.5%32 tps1.5s64K$0.40$1.30
123132Solar Pro 2 2507101081±510.6K6.9%<0.1%9 tpsN/A66K$0.50$0.50
124105Seed 1.8 2512281081±103.2K3.1%3.7%41 tps2.1s256K$0.25$2.00
125105Mistral Medium1080±49.6K5.6%1.8%48 tps0.6s33K$1.48$4.55
126105Qwen3 Max Thinking1080±181.5K2.0%13.5%32 tps2.3s256K$1.20$6.00
127112GLM 4.51075±56K7.0%3.7%46 tps1.4s131K$0.43$1.63
128112Kimi K2 09051074±78.7K4.3%4.0%30 tps1.4s262K$0.63$2.39
129112GPT-5 (Low)1070±146903.5%1.8%75 tps8.2s400K$1.25$10.00
130112Kimi K2 0905 Turbo1070±67.5K9.1%0.7%373 tps0.5s262K$1.70$6.50
131144Qwen Turbo1064±510K6.0%<0.1%53 tps1.1s1M$0.05$0.20
132112Grok 4.20 Beta Non-reasoning1063±365004.8%1.1%151 tps0.6s2M$2.00$6.00
133119OpenAI o11062±69.9K3.3%4.2%92 tps5.5s200K$15.00$60.00
134119OpenAI o1-pro1061±206807.5%5.2%33 tps72.8s200K$150.00$600.00
135119Gemini 2.5 Flash Lite Thinking1061±49.8K6.2%1.0%118 tps4.4s1M$0.03$0.13
136151GLM 4.5 FP81060±186108.3%<0.1%59 tps1.2s131K$0.41$1.65
137119Seed 2.0 Lite (Medium)1058±205253.7%6.6%33 tps1.6s256K$0.25$2.00
138119LongCat Flash Chat1058±122.7K5.9%0.8%85 tps0.9s131K$0.14$0.68
139151OpenAI Codex Mini1057±59.8K3.3%<0.1%46 tps2.1s200K$1.50$6.00
140119GPT-5.1 Codex Mini (Medium)1057±151.9K4.9%4.6%69 tps4.1s400K$0.25$2.00
141119GPT-5.1 Codex Mini (High)1054±152.2K3.9%5.9%70 tps4.6s400K$0.25$2.00
142151GLM 4.5 X1051±166455.8%<0.1%48 tps2.8s131K$2.20$8.90
143128ERNIE 4.5 300B A47B1049±413.5K3.9%4.7%23 tps2.3s123K$0.28$1.10
144164Arcee AI Maestro Reasoning1046±73.8K4.6%<0.1%85 tps0.3s131K$0.90$3.30
145128Cogito v2.1 671B1044±191.2K4.6%0.8%85 tps0.5s128K$1.25$1.25
146128Qwen3 32B1044±198506.6%3.9%30 tps3.1s41K$0.12$0.42
147164Grok 4 0709 EU1043±111.3K5.7%<0.1%33 tps8.2s128K$3.00$15.00
148128GLM 4.5 AirX1042±151.1K6.9%3.3%75 tps1.2s131K$1.10$4.50
149128Kimi K2 Thinking1042±103.3K5.1%4.2%61 tps5.9s262K$0.24$1.03
150128OpenAI o4-mini1042±58.5K6.4%1.4%97 tps7.0s128K$1.10$4.40
151164EXAONE Deep 32B1040±148801.7%<0.1%24 tpsN/A33K$0$0
152128Gemini 3.1 Flash Lite Preview Thinking1039±161.4K4.2%1.7%75 tps4.7s1M$0.25$1.50
153135Qwen3 Next 80B A3B Thinking1035±56.2K7.4%0.6%175 tps1.3s256K$0.21$2.26
154135Gemini 2.5 Flash Lite Thinking Preview 09251035±75.8K6.8%1.5%152 tps3.0s1M$0.10$0.40
155135Gemini 3.1 Flash Lite Preview1034±219804.4%1.0%8 tps1.2s1M$0.25$1.50
156135DeepSeek V3.2 Speciale1030±102.3K6.1%6.0%43 tps1.4s131K$0.84$1.52
157135Gemini 2.0 Flash Lite1029±514.7K9.5%<0.1%42 tps0.5s1M$0.08$0.30
158174Claude Haiku 3.51028±66.4K4.9%0.8%40 tps2.8s200K$0.80$4.00
159135Amazon Nova 2 Lite1026±103.6K6.0%1.0%137 tps0.6s300K$0.35$2.95
160144DeepSeek V3.1 Nex N11021±195655.0%3.4%24 tps7.2s131K$0.14$0.50
View All (305 models)