Models
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1046
Arcee AI Maestro Reasoning
1049
ERNIE 4.5 300B A47B
1051
GLM 4.5 X
1052
Qwen3 32B Fast
1054
GPT-5.1 Codex Mini (High)
1057
GPT-5.1 Codex Mini (Medium)
1057
OpenAI Codex Mini
1058
LongCat Flash Chat
1058
Seed 2.0 Lite (Medium)
1059
Llama 3 8B Turbo
1060
GLM 4.5 FP8
1061
Gemini 2.5 Flash Lite Thinking
1061
OpenAI o1-pro
1061
DeepSeek V3.1 Terminus Thinking
1062
OpenAI o1

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
241164Arcee AI Maestro Reasoning1046±73.8K4.6%<0.1%85 tps0.3s131K$0.90$3.30
242128ERNIE 4.5 300B A47B1049±413.5K3.9%4.7%23 tps2.3s123K$0.28$1.10
243151GLM 4.5 X1051±166455.8%<0.1%48 tps2.8s131K$2.20$8.90
244119Qwen3 32B Fast1052±611.4K5.2%11.6%30 tps3.1s41K$0.10$0.25
245119GPT-5.1 Codex Mini (High)1054±152.2K3.9%5.9%70 tps4.6s400K$0.25$2.00
246119GPT-5.1 Codex Mini (Medium)1057±151.9K4.9%4.6%69 tps4.1s400K$0.25$2.00
247151OpenAI Codex Mini1057±59.8K3.3%<0.1%46 tps2.1s200K$1.50$6.00
248119LongCat Flash Chat1058±122.7K5.9%0.8%85 tps0.9s131K$0.14$0.68
249119Seed 2.0 Lite (Medium)1058±205253.7%6.6%33 tps1.6s256K$0.25$2.00
250151Llama 3 8B Turbo1059±246001.6%<0.1%97 tps0.1s8K$0.12$0.13
251151GLM 4.5 FP81060±186108.3%<0.1%59 tps1.2s131K$0.41$1.65
252119Gemini 2.5 Flash Lite Thinking1061±49.8K6.2%1.0%118 tps4.4s1M$0.03$0.13
253119OpenAI o1-pro1061±206807.5%5.2%33 tps72.8s200K$150.00$600.00
254119DeepSeek V3.1 Terminus Thinking1061±92.9K9.4%5.9%27 tps1.8s131K$0.56$1.68
255119OpenAI o11062±69.9K3.3%4.2%92 tps5.5s200K$15.00$60.00
256112Grok 4.20 Beta Non-reasoning1063±365004.8%1.1%151 tps0.6s2M$2.00$6.00
257144Qwen Turbo1064±510K6.0%<0.1%53 tps1.1s1M$0.05$0.20
258112gpt-oss-20b1066±67.7K7.1%0.5%216 tps0.5s131K$0.06$0.26
259112Kimi K2 0905 Turbo1070±67.5K9.1%0.7%373 tps0.5s262K$1.70$6.50
260112GPT-5 (Low)1070±146903.5%1.8%75 tps8.2s400K$1.25$10.00
261112Kimi K2 Fast1073±535K6.4%0.8%365 tps0.5s131K$1.00$3.00
262112Kimi K2 09051074±78.7K4.3%4.0%30 tps1.4s262K$0.63$2.39
263112GLM 4.51075±56K7.0%3.7%46 tps1.4s131K$0.43$1.63
264105Qwen3 Max Thinking1080±181.5K2.0%13.5%32 tps2.3s256K$1.20$6.00
265105Mistral Medium1080±49.6K5.6%1.8%48 tps0.6s33K$1.48$4.55
266105Seed 1.8 2512281081±103.2K3.1%3.7%41 tps2.1s256K$0.25$2.00
267132Solar Pro 2 2507101081±510.6K6.9%<0.1%9 tpsN/A66K$0.50$0.50
268105DeepSeek V3 (Turbo)1082±201.5K5.1%1.5%32 tps1.5s64K$0.40$1.30
269105Qwen3 Omni 30B A3B Instruct1085±137754.3%3.9%65 tps1.2s66K$0.35$0.97
270105GPT-4.1 nano1085±517K5.0%0.6%175 tps0.5s1M$0.10$0.40
271105GPT-4.1 mini1087±519.7K4.2%1.1%67 tps0.9s1M$0.34$1.60
272132Qwen Plus 0728 (Thinking)1087±91.2K8.9%<0.1%56 tps1.1s1M$0.40$4.00
273132Claude Sonnet 3.51088±102.9K4.9%1.0%40 tps2.7s200K$3.00$15.00
27498DeepSeek V3.2 Exp Thinking1089±75.9K3.5%7.2%26 tps3.0s131K$0.28$0.42
27598DeepSeek V3.11089±122.3K4.7%0.8%197 tps0.4s164K$0.55$1.60
27698OpenAI o3-pro1090±85.4K4.3%5.2%22 tps70.8s200K$20.00$80.00
277123Sherlock Dash Alpha1090±198356.7%<0.1%68 tps0.7s2M$0$0
278123Nova Experimental Chat 10-091091±73.2K10.7%<0.1%59 tps6.1s98K$0$0
27998Qwen3 235B A22B1093±64.5K8.0%5.3%71 tps0.9s41K$0.23$0.63
28098DeepSeek V3 0324 Turbo1093±515.5K5.7%6.3%12 tps2.4s164K$0.73$1.79
View All (404 models)