Models
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1117
Gemini 2.5 Flash Lite Preview 0925
1115
Qwen3 Max Thinking
1115
Gemini 2.5 Flash Preview 0925
1115
Gemini 2.5 Flash
1113
DeepSeek V3 0324
1111
DeepSeek V3 0324 Turbo
1108
DeepSeek V3.1 Chat
1104
GPT-5.1 Codex Mini (Medium)
1101
Nova Experimental Chat 12-10
1095
Qwen3.5 35B A3B
1091
Qwen3 Omni 30B A3B Thinking
1090
GPT-4o
1086
GPT-4.1 nano
1085
Qwen3 Coder 480B A35B Instruct
1085
Gemini 3.1 Flash Lite Preview Thinking

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
8177Gemini 2.5 Flash Lite Preview 09251117±84.3K7.4%1.2%209 tps0.7s1M$0.25$0.35
82105Qwen3 Max Thinking1115±239452.1%13.5%32 tps2.3s256K$1.20$6.00
8374Gemini 2.5 Flash Preview 09251115±114.1K6.8%1.2%5 tps0.9s1M$0.13$0.97
8498Gemini 2.5 Flash1115±517.3K3.5%1.3%2 tps3.7s1M$0.30$2.50
8590DeepSeek V3 03241113±77.7K5.1%5.8%12 tps2.7s164K$0.38$0.93
8698DeepSeek V3 0324 Turbo1111±56.5K6.3%6.3%12 tps2.4s164K$0.73$1.79
8785DeepSeek V3.1 Chat1108±122.4K9.3%2.8%21 tps1.6s131K$0.38$1.00
88119GPT-5.1 Codex Mini (Medium)1104±151.6K4.6%4.6%69 tps4.1s400K$0.25$2.00
8949Nova Experimental Chat 12-101101±151.4K5.3%2.4%84 tps12.9s98K$0$0
9069Qwen3.5 35B A3B1095±306753.6%2.1%116 tps2.1s256K$0.63$1.13
9185Qwen3 Omni 30B A3B Thinking1091±121.7K6.5%3.7%67 tps1.2s66K$0.97$1.79
9290GPT-4o1090±86.5K3.8%1.0%49 tps2.4s128K$3.71$12.57
93105GPT-4.1 nano1086±79.8K4.1%0.6%175 tps0.5s1M$0.10$0.40
9490Qwen3 Coder 480B A35B Instruct1085±141.8K4.0%3.3%61 tps2.0s262K$0.71$1.34
95128Gemini 3.1 Flash Lite Preview Thinking1085±311.1K4.0%1.7%75 tps4.7s1M$0.25$1.50
9698OpenAI o3-pro1085±152.8K3.8%5.2%22 tps70.8s200K$20.00$80.00
9790Gemini 2.5 Flash Lite1083±711.8K6.9%1.3%210 tps0.7s1M$0.10$0.40
98105Qwen3 Omni 30B A3B Instruct1083±216004.0%3.9%65 tps1.2s66K$0.35$0.97
9985GPT-5 Mini Minimal1082±122.3K9.3%1.2%63 tps1.4s400K$0.25$2.00
10098DeepSeek V3.11078±151.8K4.8%0.8%197 tps0.4s164K$0.55$1.60
10177DeepSeek V3.1 Turbo1068±162.8K6.1%0.9%173 tps1.3s164K$2.00$3.75
102119GPT-5.1 Codex Mini (High)1065±191.9K3.6%5.9%70 tps4.6s400K$0.25$2.00
10377Mistral Large 31065±103.2K5.6%2.1%51 tps1.0s256K$0.50$1.50
10490DeepSeek V3.2 Exp Chat1065±142.2K9.0%2.6%29 tps1.5s131K$0.27$0.39
105119Gemini 2.5 Flash Lite Thinking1064±87.4K6.8%1.0%118 tps4.4s1M$0.03$0.13
106112Kimi K2 Fast1061±516.4K9.2%0.8%365 tps0.5s131K$1.00$3.00
107148Qwen3 VL 235B A22B Thinking1055±122.5K9.4%4.3%47 tps3.0s127K$0.47$3.31
108105Mistral Medium1053±75.5K4.6%1.8%48 tps0.6s33K$1.48$4.55
109128Cogito v2.1 671B1053±219205.2%0.8%85 tps0.5s128K$1.25$1.25
11098Qwen3 235B A22B1050±93.5K8.8%5.3%71 tps0.9s41K$0.23$0.63
111135Gemini 2.0 Flash Lite1050±89.1K3.4%<0.1%42 tps0.5s1M$0.08$0.30
112112Kimi K2 0905 Turbo1050±133.2K13.0%0.7%373 tps0.5s262K$1.70$6.50
113112Kimi K2 09051047±131.8K8.5%4.0%30 tps1.4s262K$0.63$2.39
114105GPT-4.1 mini1046±88.7K4.0%1.1%67 tps0.9s1M$0.34$1.60
11598DeepSeek V3.2 Exp Thinking1046±171.7K6.2%7.2%26 tps3.0s131K$0.28$0.42
116128Qwen3 32B1046±345357.8%3.9%30 tps3.1s41K$0.12$0.42
117128ERNIE 4.5 300B A47B1044±78.7K3.6%4.7%23 tps2.3s123K$0.28$1.10
118135Qwen3 Next 80B A3B Thinking1043±152.7K11.3%0.6%175 tps1.3s256K$0.21$2.26
119119Qwen3 32B Fast1043±106.6K6.4%11.6%30 tps3.1s41K$0.10$0.25
120135Qwen3 VL 30B A3B Instruct1042±218607.0%1.8%80 tps2.6s129K$0.18$0.67
View All (273 models)