Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1113
Gemini 2.5 Flash Lite
1116
Gemini 2.5 Flash Lite Preview 0925
1119
MiniMax M2.5 Lightning
1121
LongCat Flash Chat
1125
DeepSeek V3.2
1131
GPT-5 (High)
1145
Kimi K2 0905
1147
Claude Sonnet 3.5 v2
1148
Claude Sonnet 4.5
1150
Qwen3 Omni 30B A3B Thinking
1151
Gemini 2.5 Flash Preview 0925
1152
Qwen Plus (Aug'24)
1154
Grok 4 Fast Reasoning
1154
Claude Haiku 4.5 (Extended Thinking)
1158
DeepSeek V3.2 Thinking

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
121101Gemini 2.5 Flash Lite1113±64.4K1.8%1.3%210 tps0.7s1M$0.10$0.40
12271Gemini 2.5 Flash Lite Preview 09251116±102K2.4%1.2%209 tps0.7s1M$0.25$0.35
12379MiniMax M2.5 Lightning1119±166500.8%1.5%51 tps2.0s205K$0.60$2.40
124111LongCat Flash Chat1121±177254.6%0.8%85 tps0.9s131K$0.14$0.68
12540DeepSeek V3.21125±102.1K1.6%1.4%83 tps5.1s131K$0.43$1.09
12626GPT-5 (High)1131±92.7K3.2%4.5%81 tps35.9s400K$1.25$10.00
127133Kimi K2 09051145±101.5K2.0%4.0%30 tps1.4s262K$0.63$2.39
128106Claude Sonnet 3.5 v21147±141K2.0%<0.1%46 tps1.4s200K$3.00$15.00
12937Claude Sonnet 4.51148±93.3K2.7%1.4%41 tps1.3s200K$1.80$9.00
13037Qwen3 Omni 30B A3B Thinking1150±168453.4%3.7%67 tps1.2s66K$0.97$1.79
13160Gemini 2.5 Flash Preview 09251151±91.8K3.3%1.2%5 tps0.9s1M$0.13$0.97
13268Qwen Plus (Aug'24)1152±94.8K1.1%1.4%53 tps1.3s30K$0.40$1.20
13348Grok 4 Fast Reasoning1154±102.2K3.0%2.1%102 tps3.1s2M$0.30$0.75
13426Claude Haiku 4.5 (Extended Thinking)1154±82.4K2.8%1.4%115 tps0.7s200K$1.00$5.00
13556DeepSeek V3.2 Thinking1158±192.4K2.3%9.0%30 tps2.6s131K$0.28$0.42
13644Grok 4.1 Fast Reasoning1159±114.3K2.9%1.5%58 tps7.3s2M$0.20$0.50
13744Gemini 2.5 Pro1161±612.1K1.4%2.3%45 tps2.6s1M$1.25$10.00
13856Gemini 3.1 Flash Lite Preview Thinking1162±197302.0%1.7%75 tps4.7s1M$0.25$1.50
13948Step 3.5 Flash1163±236400.8%2.2%109 tps0.6s256K$0.05$0.15
14044Kimi K2 Thinking Turbo1171±111.8K4.2%2.0%75 tps1.4s262K$1.15$8.00
14142GPT-5.2 (Extra High) 1172±133K1.6%13.2%17 tps20.5s400K$1.75$14.00
14233Kimi K2.51175±123.1K1.6%6.5%33 tps1.7s262K$0.34$2.57
14317Claude Opus 4.51177±151.9K4.7%1.5%45 tps1.5s200K$5.00$25.00
14429Nova Experimental Chat 12-101192±111.4K0.7%2.4%84 tps12.9s98K$0$0
14542Qwen3 Max Instruct Preview1192±92.8K3.0%1.1%31 tps1.7s256K$1.43$6.61
14613GPT-5.3 Instant1225±132.3K1.3%0.9%63 tps0.8s400K$1.75$14.00
14781GPT-4o1228±112.9K1.7%1.0%49 tps2.4s128K$3.71$12.57
14862GPT-5.1 Instant1233±122.6K2.7%1.3%50 tps1.9s400K$1.25$10.00
14932Gemini 2.5 Pro High1234±64.6K2.4%1.5%48 tps2.3s1M$1.25$10.00
15014Gemini 3 Flash Preview Thinking1248±103.7K1.3%1.6%3 tps6.2s1M$0.50$3.00
15126Grok 4.1 Fast Non-Reasoning1260±162.5K3.7%0.9%101 tps0.5s2M$0.20$0.50
15210Claude Sonnet 4.5 (Thinking)1261±75.5K1.9%1.9%44 tps1.1s200K$3.00$15.00
15340Qwen3 235B A22B Instruct 25071261±83.1K1.4%6.8%13 tps1.9s262K$0.13$0.52
15448gpt-oss-120b1269±74.6K1.4%0.7%213 tps0.5s131K$0.11$0.50
15522GPT-5 Chat1269±57.9K1.6%1.3%95 tps0.9s400K$1.25$10.00
15633Qwen3 Next 80B A3B Instruct1270±102.3K2.8%0.6%84 tps1.1s256K$0.20$1.42
15729Qwen3 VL 235B A22B Instruct1273±141.1K2.3%3.1%75 tps1.9s129K$0.37$1.81
15817GPT-5.2 (High)1275±126.5K1.4%6.7%18 tps16.3s400K$1.75$14.00
1595Claude Sonnet 4.6 (Thinking)1280±141.4K2.2%4.7%57 tps1.1s200K$3.00$15.00
16017Gemini 3 Flash Preview1281±112K1.2%1.3%138 tps1.4s1M$0.50$3.00
View All (173 models)