Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1155
Claude Opus 4.5 (Thinking)
1151
Grok 4.1 Fast Non-Reasoning
1150
Qwen Plus (Aug'24)
1150
gpt-oss-20b
1149
Arcee AI Maestro Reasoning
1147
Qwen3.5 122B A17B
1144
Claude Opus 4.5
1141
Qwen Plus 0728 (Thinking)
1140
Kimi K2.5 Instant
1140
Step 3.5 Flash
1139
Qwen3 Omni 30B A3B Thinking
1138
Nova Experimental Chat 12-10
1136
Solar Pro 3 (Reasoning)
1136
Claude Sonnet 4.5 (Thinking)
1131
GPT-5.2 (Extra High)

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
417Claude Opus 4.5 (Thinking)1155±55.3K1.6%1.8%49 tps1.4s200K$5.00$25.00
4226Grok 4.1 Fast Non-Reasoning1151±63.2K1.8%0.9%101 tps0.5s2M$0.20$0.50
4368Qwen Plus (Aug'24)1150±57.5K1.4%1.4%53 tps1.3s30K$0.40$1.20
44101gpt-oss-20b1150±74K1.7%0.5%216 tps0.5s131K$0.06$0.26
45147Arcee AI Maestro Reasoning1149±72K1.4%<0.1%85 tps0.3s131K$0.90$3.30
4652Qwen3.5 122B A17B1147±137651.9%1.5%82 tps1.4s256K$0.40$3.20
4717Claude Opus 4.51144±82.4K2.1%1.5%45 tps1.5s200K$5.00$25.00
48100Qwen Plus 0728 (Thinking)1141±145002.0%<0.1%56 tps1.1s1M$0.40$4.00
4937Kimi K2.5 Instant1140±101.1K1.4%2.9%32 tps3.0s262K$0.50$3.00
5048Step 3.5 Flash1140±159650.5%2.2%109 tps0.6s256K$0.05$0.15
5137Qwen3 Omni 30B A3B Thinking1139±71.6K1.2%3.7%67 tps1.2s66K$0.97$1.79
5229Nova Experimental Chat 12-101138±91.9K0.5%2.4%84 tps12.9s98K$0$0
53111Solar Pro 3 (Reasoning)1136±138301.2%3.2%118 tps1.2s131K$0.15$0.60
5410Claude Sonnet 4.5 (Thinking)1136±56.8K2.7%1.9%44 tps1.1s200K$3.00$15.00
5542GPT-5.2 (Extra High) 1131±53.7K0.9%13.2%17 tps20.5s400K$1.75$14.00
5626Claude Haiku 4.5 (Extended Thinking)1129±53.6K1.6%1.4%115 tps0.7s200K$1.00$5.00
57213DeepSeek R1T Chimera1128±81.9K2.5%<0.1%46 tps1.1s164K$0.09$0.36
5842Qwen3 Max Instruct Preview1126±44.3K2.8%1.1%31 tps1.7s256K$1.43$6.61
5944Gemini 2.5 Pro1126±416.2K1.5%2.3%45 tps2.6s1M$1.25$10.00
6077Claude Opus 4.11122±91.3K2.3%3.0%17 tps3.7s200K$15.00$75.00
6144Grok 4.1 Fast Reasoning1119±65.4K1.5%1.5%58 tps7.3s2M$0.20$0.50
6237Claude Sonnet 4.51116±65K3.1%1.4%41 tps1.3s200K$1.80$9.00
6356DeepSeek V3.1 Turbo1114±64K2.1%0.9%173 tps1.3s164K$2.00$3.75
6440DeepSeek V3.21113±53.6K0.8%1.4%83 tps5.1s131K$0.43$1.09
65101Gemini 2.5 Flash Lite1112±67.6K1.7%1.3%210 tps0.7s1M$0.10$0.40
6693Qwen Max1111±67.6K1.4%1.5%49 tps1.5s33K$1.60$6.40
6795Qwen3 32B1111±175151.9%3.9%30 tps3.1s41K$0.12$0.42
6822GLM 51110±71.8K0.8%3.4%36 tps2.7s200K$0.72$2.55
6984GPT-5 Mini Minimal1107±109703.5%1.2%63 tps1.4s400K$0.25$2.00
7093DeepSeek V3 0324 Turbo1103±54.4K1.9%6.3%12 tps2.4s164K$0.73$1.79
71121Qwen3 32B Fast1098±89K1.0%11.6%30 tps3.1s41K$0.10$0.25
7286DeepSeek V3.1 Chat1097±71.9K2.3%2.8%21 tps1.6s131K$0.38$1.00
7356DeepSeek V3.2 Thinking1096±63.8K0.9%9.0%30 tps2.6s131K$0.28$0.42
74111LongCat Flash Chat1095±71.7K2.8%0.8%85 tps0.9s131K$0.14$0.68
7584Nova Experimental Chat 10-091093±101.3K7.4%<0.1%59 tps6.1s98K$0$0
76121QwQ 32B1091±59.9K0.9%5.4%41 tps2.1s16K$0.43$0.56
7733Kimi K2.51090±64.5K0.7%6.5%33 tps1.7s262K$0.34$2.57
7886Nemotron 3 Nano (Thinking)1089±91.5K0.7%2.0%200 tps0.5s256K$0$0
7962GPT-5.1 Instant1085±63.7K1.1%1.3%50 tps1.9s400K$1.25$10.00
80106DeepSeek V3 03241084±45.7K1.4%5.8%12 tps2.7s164K$0.38$0.93
View All (260 models)