Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1089
Claude Haiku 4.5
1090
Seed 1.8 251228
1091
GPT-4o
1092
MiniMax M2.5 FP8
1092
Claude Sonnet 4.5
1092
Kimi K2 Thinking
1094
GPT-5 Mini Minimal
1096
GPT-5.1 Instant
1097
GPT-5 (Low)
1098
Qwen Plus (Aug'24)
1099
Amazon Nova 2 Lite
1104
DeepSeek-R1 Turbo
1105
GLM 4.7
1105
Gemini 3.1 Flash Lite Preview Thinking
1105
DeepSeek V3 (Turbo)

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
20152Claude Haiku 4.51089±320.4K2.1%1.1%100 tps0.9s200K$1.00$5.00
20271Seed 1.8 2512281090±314.9K1.5%3.7%41 tps2.1s256K$0.25$2.00
20381GPT-4o1091±223.5K0.7%1.0%49 tps2.4s128K$3.71$12.57
20471MiniMax M2.5 FP81092±102.1K1.6%3.6%33 tps1.7s205K$0.45$1.75
20537Claude Sonnet 4.51092±225.2K2.2%1.4%41 tps1.3s200K$1.80$9.00
20695Kimi K2 Thinking1092±45.4K2.0%4.2%61 tps5.9s262K$0.24$1.03
20784GPT-5 Mini Minimal1094±34.9K3.0%1.2%63 tps1.4s400K$0.25$2.00
20862GPT-5.1 Instant1096±314.9K1.5%1.3%50 tps1.9s400K$1.25$10.00
209101GPT-5 (Low)1097±71.5K1.0%1.8%75 tps8.2s400K$1.25$10.00
21068Qwen Plus (Aug'24)1098±250.5K1.1%1.4%53 tps1.3s30K$0.40$1.20
21186Amazon Nova 2 Lite1099±410.5K2.7%1.0%137 tps0.6s300K$0.35$2.95
21295DeepSeek-R1 Turbo1104±54.8K1.5%2.6%29 tps1.8s64K$2.85$4.75
21368GLM 4.71105±321K1.0%5.8%40 tps1.5s200K$0.77$1.73
21456Gemini 3.1 Flash Lite Preview Thinking1105±82K1.7%1.7%75 tps4.7s1M$0.25$1.50
215101DeepSeek V3 (Turbo)1105±53.7K1.5%1.5%32 tps1.5s64K$0.40$1.30
21679Qwen3 Max Thinking Preview1106±413.3K2.0%3.1%40 tps2.1s256K$1.20$6.00
21786DeepSeek V3.1 Nex N11107±81.5K1.3%3.4%24 tps7.2s131K$0.14$0.50
21871Qwen3.5 397B A17B1107±65.1K1.6%4.3%57 tps1.4s256K$0.52$3.00
21962MiniMax M21110±317.2K2.5%2.2%39 tps2.3s205K$0.21$0.85
22017Claude Opus 4.51110±412.9K2.2%1.5%45 tps1.5s200K$5.00$25.00
22168Grok 41110±198.8K0.9%3.9%29 tps11.1s256K$3.00$15.00
22281OpenAI o3-pro1116±53.2K2.8%5.2%22 tps70.8s200K$20.00$80.00
22352GPT-51117±231.1K1.7%3.1%78 tps23.1s400K$1.25$9.67
22460Gemini 2.5 Flash Preview 09251118±314.4K2.2%1.2%5 tps0.9s1M$0.13$0.97
22526Claude Haiku 4.5 (Extended Thinking)1121±314.1K1.8%1.4%115 tps0.7s200K$1.00$5.00
22652Qwen3.5 122B A17B1123±52.6K1.3%1.5%82 tps1.4s256K$0.40$3.20
22786Nemotron 3 Nano (Thinking)1123±35.9K1.5%2.0%200 tps0.5s256K$0$0
22860MiniMax M2.11124±324.4K1.0%2.1%66 tps2.6s205K$0.30$1.20
22965DeepSeek V3.2 Exp Chat1125±311.5K1.9%2.6%29 tps1.5s131K$0.27$0.39
23071DeepSeek V3.11125±44.4K1.1%0.8%197 tps0.4s164K$0.55$1.60
23186Qwen3 235B A22B1129±37.8K2.1%5.3%71 tps0.9s41K$0.23$0.63
23256MiniMax M2.1 Lightning1129±53.6K1.4%1.7%52 tps2.1s205K$0.30$2.40
23365Mistral Large 31133±410.8K2.6%2.1%51 tps1.0s256K$0.50$1.50
23456DeepSeek V3.1 Turbo1134±39.5K1.2%0.9%173 tps1.3s164K$2.00$3.75
23517GPT-5.4 mini1141±145451.8%0.8%148 tps0.5s400K$0.75$4.50
23648Grok 4 Fast Reasoning1142±314.5K2.0%2.1%102 tps3.1s2M$0.30$0.75
23729MiniMax M2.71142±137001.4%3.0%34 tps2.5s205K$0.30$1.20
23856DeepSeek V3.2 Thinking1144±416.9K1.3%9.0%30 tps2.6s131K$0.28$0.42
23944Kimi K2 Thinking Turbo1145±210.9K1.6%2.0%75 tps1.4s262K$1.15$8.00
24042GPT-5.2 (Extra High) 1147±215.6K1.4%13.2%17 tps20.5s400K$1.75$14.00
View All (283 models)