Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1142
Grok 4 Fast Reasoning
1141
GPT-5.4 mini
1134
DeepSeek V3.1 Turbo
1129
MiniMax M2.1 Lightning
1125
DeepSeek V3.1
1124
MiniMax M2.1
1121
Claude Haiku 4.5 (Extended Thinking)
1118
Gemini 2.5 Flash Preview 0925
1117
GPT-5
1116
OpenAI o3-pro
1110
Grok 4
1110
Claude Opus 4.5
1110
MiniMax M2
1107
Qwen3.5 397B A17B
1107
DeepSeek V3.1 Nex N1

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
4148Grok 4 Fast Reasoning1142±314.5K2.0%2.1%102 tps3.1s2M$0.30$0.75
4217GPT-5.4 mini1141±145451.8%0.8%148 tps0.5s400K$0.75$4.50
4356DeepSeek V3.1 Turbo1134±39.5K1.2%0.9%173 tps1.3s164K$2.00$3.75
4456MiniMax M2.1 Lightning1129±53.6K1.4%1.7%52 tps2.1s205K$0.30$2.40
4571DeepSeek V3.11125±44.4K1.1%0.8%197 tps0.4s164K$0.55$1.60
4660MiniMax M2.11124±324.4K1.0%2.1%66 tps2.6s205K$0.30$1.20
4726Claude Haiku 4.5 (Extended Thinking)1121±314.1K1.8%1.4%115 tps0.7s200K$1.00$5.00
4860Gemini 2.5 Flash Preview 09251118±314.4K2.2%1.2%5 tps0.9s1M$0.13$0.97
4952GPT-51117±231.1K1.7%3.1%78 tps23.1s400K$1.25$9.67
5081OpenAI o3-pro1116±53.2K2.8%5.2%22 tps70.8s200K$20.00$80.00
5168Grok 41110±198.8K0.9%3.9%29 tps11.1s256K$3.00$15.00
5217Claude Opus 4.51110±412.9K2.2%1.5%45 tps1.5s200K$5.00$25.00
5362MiniMax M21110±317.2K2.5%2.2%39 tps2.3s205K$0.21$0.85
5471Qwen3.5 397B A17B1107±65.1K1.6%4.3%57 tps1.4s256K$0.52$3.00
5586DeepSeek V3.1 Nex N11107±81.5K1.3%3.4%24 tps7.2s131K$0.14$0.50
5679Qwen3 Max Thinking Preview1106±413.3K2.0%3.1%40 tps2.1s256K$1.20$6.00
57101DeepSeek V3 (Turbo)1105±53.7K1.5%1.5%32 tps1.5s64K$0.40$1.30
5856Gemini 3.1 Flash Lite Preview Thinking1105±82K1.7%1.7%75 tps4.7s1M$0.25$1.50
5968GLM 4.71105±321K1.0%5.8%40 tps1.5s200K$0.77$1.73
6086Amazon Nova 2 Lite1099±410.5K2.7%1.0%137 tps0.6s300K$0.35$2.95
6168Qwen Plus (Aug'24)1098±250.5K1.1%1.4%53 tps1.3s30K$0.40$1.20
62101GPT-5 (Low)1097±71.5K1.0%1.8%75 tps8.2s400K$1.25$10.00
6362GPT-5.1 Instant1096±314.9K1.5%1.3%50 tps1.9s400K$1.25$10.00
6484GPT-5 Mini Minimal1094±34.9K3.0%1.2%63 tps1.4s400K$0.25$2.00
6595Kimi K2 Thinking1092±45.4K2.0%4.2%61 tps5.9s262K$0.24$1.03
6637Claude Sonnet 4.51092±225.2K2.2%1.4%41 tps1.3s200K$1.80$9.00
6771MiniMax M2.5 FP81092±102.1K1.6%3.6%33 tps1.7s205K$0.45$1.75
6881GPT-4o1091±223.5K0.7%1.0%49 tps2.4s128K$3.71$12.57
6971Seed 1.8 2512281090±314.9K1.5%3.7%41 tps2.1s256K$0.25$2.00
7052Claude Haiku 4.51089±320.4K2.1%1.1%100 tps0.9s200K$1.00$5.00
7171Gemini 2.5 Flash Lite Preview 09251087±215.1K2.5%1.2%209 tps0.7s1M$0.25$0.35
7286DeepSeek V3.1 Chat1087±310.7K1.8%2.8%21 tps1.6s131K$0.38$1.00
7371GPT-5 Mini1087±311.3K2.1%2.6%66 tps14.2s400K$0.25$2.00
7495Qwen3 32B1085±52.6K1.5%3.9%30 tps3.1s41K$0.12$0.42
7593Qwen Max1084±254.8K0.9%1.5%49 tps1.5s33K$1.60$6.40
7693DeepSeek V3 0324 Turbo1081±350.9K1.4%6.3%12 tps2.4s164K$0.73$1.79
77133Nemotron 3 Nano1076±81.6K1.9%1.3%216 tps0.8s256K$0.05$4.94
7865GLM 4.61075±311.7K2.8%5.4%39 tps1.5s200K$0.42$1.66
79121NVIDIA Llama 3.3 Nemotron Super 49B v1.51074±63.5K2.2%2.0%50 tps0.6s131K$0.09$0.33
8071Gemini 3.1 Flash Lite Preview1073±111.3K2.2%1.0%8 tps1.2s1M$0.25$1.50
View All (203 models)