Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1123
Gemini 2.5 Pro Low
1122
Nova Experimental Chat 10-09
1122
Mistral Large 3
1121
MiniMax M2.5 Lightning
1119
GPT-5
1117
Qwen3 32B
1112
Qwen Plus 0728 (Thinking)
1112
DeepSeek V3.1 Nex N1
1108
GLM 4.6
1106
GPT-4.5 Preview
1105
MiniMax M2.5
1104
Claude Opus 4.1
1104
Seed 1.8 251228
1102
DeepSeek V3.1 Chat
1102
Gemini 2.5 Flash Preview 0925

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
8156Gemini 2.5 Pro Low1123±221K6.5%<0.1%89 tps2.4s1M$1.25$10.00
8284Nova Experimental Chat 10-091122±39.3K7.2%<0.1%59 tps6.1s98K$0$0
8365Mistral Large 31122±314.3K3.3%2.1%51 tps1.0s256K$0.50$1.50
8479MiniMax M2.5 Lightning1121±45.6K1.3%1.5%51 tps2.0s205K$0.60$2.40
8552GPT-51119±244.3K3.9%3.1%78 tps23.1s400K$1.25$9.67
8695Qwen3 32B1117±63.3K2.8%3.9%30 tps3.1s41K$0.12$0.42
87100Qwen Plus 0728 (Thinking)1112±33.7K4.4%<0.1%56 tps1.1s1M$0.40$4.00
8886DeepSeek V3.1 Nex N11112±62.1K1.7%3.4%24 tps7.2s131K$0.14$0.50
8965GLM 4.61108±325.8K4.3%5.4%39 tps1.5s200K$0.42$1.66
9077GPT-4.5 Preview1106±47K1.2%<0.1%36 tps3.0s200K$75.00$150.00
9184MiniMax M2.51105±82.1K1.6%1.4%70 tps1.9s205K$0.28$1.20
9277Claude Opus 4.11104±211.6K3.4%3.0%17 tps3.7s200K$15.00$75.00
9371Seed 1.8 2512281104±319K1.5%3.7%41 tps2.1s256K$0.25$2.00
9486DeepSeek V3.1 Chat1102±313.4K4.1%2.8%21 tps1.6s131K$0.38$1.00
9560Gemini 2.5 Flash Preview 09251102±219.5K4.3%1.2%5 tps0.9s1M$0.13$0.97
9684Claude Sonnet 3.7 (Thinking)1101±317.2K2.7%<0.1%41 tps2.6s200K$3.00$15.00
9768GLM 4.71101±335.7K2.1%5.8%40 tps1.5s200K$0.77$1.73
9868Grok 41100±1120.3K2.1%3.9%29 tps11.1s256K$3.00$15.00
99101DeepSeek V3 (Turbo)1100±34.8K2.5%1.5%32 tps1.5s64K$0.40$1.30
10086Amazon Nova 2 Lite1099±312.6K3.1%1.0%137 tps0.6s300K$0.35$2.95
10168Qwen Plus (Aug'24)1098±260.9K2.4%1.4%53 tps1.3s30K$0.40$1.20
10262GPT-5.1 Instant1098±221.7K2.4%1.3%50 tps1.9s400K$1.25$10.00
10381Qwen3.5 27B1097±62.3K2.4%3.7%55 tps2.6s256K$0.30$2.40
10480GPT-5 (Minimal)1096±218.1K6.0%<0.1%67 tps1.4s400K$1.25$10.00
105100Gemini 2.5 Flash Preview1095±312.4K0.7%<0.1%138 tps6.9s1M$0.15$0.60
106111Solar Pro 3 (Reasoning)1095±63.5K1.4%3.2%118 tps1.2s131K$0.15$0.60
107101gpt-oss-20b1093±220.3K4.6%0.5%216 tps0.5s131K$0.06$0.26
10871Qwen3.5 397B A17B1092±57.2K1.8%4.3%57 tps1.4s256K$0.52$3.00
109104Grok 3 Beta1092±38.1K0.6%<0.1%58 tps0.8s131K$3.00$15.00
11086Qwen3 235B A22B1090±311.9K5.1%5.3%71 tps0.9s41K$0.23$0.63
11179Qwen3 Max Thinking Preview1089±217.8K3.3%3.1%40 tps2.1s256K$1.20$6.00
11281GPT-4o1088±230.3K2.1%1.0%49 tps2.4s128K$3.71$12.57
113133Nemotron 3 Nano1087±51.9K2.5%1.3%216 tps0.8s256K$0.05$4.94
11456Gemini 3.1 Flash Lite Preview Thinking1084±63.6K3.1%1.7%75 tps4.7s1M$0.25$1.50
11571Gemini 2.5 Flash Lite Preview 09251083±220.9K4.8%1.2%209 tps0.7s1M$0.25$0.35
11686Seed 2.0 Lite (Medium)1082±62.1K1.9%6.6%33 tps1.6s256K$0.25$2.00
117111LongCat Flash Chat1082±46.5K3.2%0.8%85 tps0.9s131K$0.14$0.68
11871GPT-5 Mini1082±217.5K4.3%2.6%66 tps14.2s400K$0.25$2.00
119108GPT-5 Mini Low1081±35.9K5.9%<0.1%69 tps3.2s400K$0.25$2.00
12084GPT-5 Mini Minimal1081±36.8K6.5%1.2%63 tps1.4s400K$0.25$2.00
View All (432 models)