Models
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

999
OpenAI o3-mini-high
1000
Qwen3 235B A22B Thinking 2507
1001
Qwen 2.5 VL 32B Instruct
1007
Qwen3 Coder Plus
1009
Qwen3 VL 235B A22B Thinking
1018
Gemini 2.0 Flash
1020
OpenAI o3
1021
DeepSeek V3.1 Nex N1
1026
Amazon Nova 2 Lite
1029
Gemini 2.0 Flash Lite
1030
DeepSeek V3.2 Speciale
1034
Gemini 3.1 Flash Lite Preview
1035
Gemini 2.5 Flash Lite Thinking Preview 0925
1035
Qwen3 Next 80B A3B Thinking
1039
Gemini 3.1 Flash Lite Preview Thinking

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
81148OpenAI o3-mini-high999±58.3K4.1%2.4%231 tps10.5s200K$1.10$4.40
82148Qwen3 235B A22B Thinking 25071000±102.8K4.4%2.5%53 tps1.6s131K$0.59$5.70
83148Qwen 2.5 VL 32B Instruct1001±218654.9%6.3%43 tps3.2s128K$0.35$0.62
84148Qwen3 Coder Plus1007±226104.7%5.1%56 tps2.3s128K$1.80$9.80
85148Qwen3 VL 235B A22B Thinking1009±64.6K8.3%4.3%47 tps3.0s127K$0.47$3.31
86144Gemini 2.0 Flash1018±78.2K3.8%<0.1%76 tps0.5s1M$0.14$0.56
87144OpenAI o31020±75.9K4.0%0.9%85 tps6.8s128K$7.33$29.33
88144DeepSeek V3.1 Nex N11021±195655.0%3.4%24 tps7.2s131K$0.14$0.50
89135Amazon Nova 2 Lite1026±103.6K6.0%1.0%137 tps0.6s300K$0.35$2.95
90135Gemini 2.0 Flash Lite1029±514.7K9.5%<0.1%42 tps0.5s1M$0.08$0.30
91135DeepSeek V3.2 Speciale1030±102.3K6.1%6.0%43 tps1.4s131K$0.84$1.52
92135Gemini 3.1 Flash Lite Preview1034±219804.4%1.0%8 tps1.2s1M$0.25$1.50
93135Gemini 2.5 Flash Lite Thinking Preview 09251035±75.8K6.8%1.5%152 tps3.0s1M$0.10$0.40
94135Qwen3 Next 80B A3B Thinking1035±56.2K7.4%0.6%175 tps1.3s256K$0.21$2.26
95128Gemini 3.1 Flash Lite Preview Thinking1039±161.4K4.2%1.7%75 tps4.7s1M$0.25$1.50
96128OpenAI o4-mini1042±58.5K6.4%1.4%97 tps7.0s128K$1.10$4.40
97128Kimi K2 Thinking1042±103.3K5.1%4.2%61 tps5.9s262K$0.24$1.03
98128GLM 4.5 AirX1042±151.1K6.9%3.3%75 tps1.2s131K$1.10$4.50
99128Qwen3 32B1044±198506.6%3.9%30 tps3.1s41K$0.12$0.42
100128Cogito v2.1 671B1044±191.2K4.6%0.8%85 tps0.5s128K$1.25$1.25
101128ERNIE 4.5 300B A47B1049±413.5K3.9%4.7%23 tps2.3s123K$0.28$1.10
102119GPT-5.1 Codex Mini (High)1054±152.2K3.9%5.9%70 tps4.6s400K$0.25$2.00
103119GPT-5.1 Codex Mini (Medium)1057±151.9K4.9%4.6%69 tps4.1s400K$0.25$2.00
104119LongCat Flash Chat1058±122.7K5.9%0.8%85 tps0.9s131K$0.14$0.68
105119Seed 2.0 Lite (Medium)1058±205253.7%6.6%33 tps1.6s256K$0.25$2.00
106119Gemini 2.5 Flash Lite Thinking1061±49.8K6.2%1.0%118 tps4.4s1M$0.03$0.13
107119OpenAI o1-pro1061±206807.5%5.2%33 tps72.8s200K$150.00$600.00
108119OpenAI o11062±69.9K3.3%4.2%92 tps5.5s200K$15.00$60.00
109112Grok 4.20 Beta Non-reasoning1063±365004.8%1.1%151 tps0.6s2M$2.00$6.00
110112Kimi K2 0905 Turbo1070±67.5K9.1%0.7%373 tps0.5s262K$1.70$6.50
111112GPT-5 (Low)1070±146903.5%1.8%75 tps8.2s400K$1.25$10.00
112112Kimi K2 09051074±78.7K4.3%4.0%30 tps1.4s262K$0.63$2.39
113112GLM 4.51075±56K7.0%3.7%46 tps1.4s131K$0.43$1.63
114105Qwen3 Max Thinking1080±181.5K2.0%13.5%32 tps2.3s256K$1.20$6.00
115105Mistral Medium1080±49.6K5.6%1.8%48 tps0.6s33K$1.48$4.55
116105Seed 1.8 2512281081±103.2K3.1%3.7%41 tps2.1s256K$0.25$2.00
117105DeepSeek V3 (Turbo)1082±201.5K5.1%1.5%32 tps1.5s64K$0.40$1.30
118105Qwen3 Omni 30B A3B Instruct1085±137754.3%3.9%65 tps1.2s66K$0.35$0.97
119105GPT-4.1 nano1085±517K5.0%0.6%175 tps0.5s1M$0.10$0.40
120105GPT-4.1 mini1087±519.7K4.2%1.1%67 tps0.9s1M$0.34$1.60
View All (210 models)