Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1031
DeepSeek R1T2 Chimera
1037
OpenAI o3-pro
1038
DeepSeek-R1 0528
1041
Gemini 2.5 Flash Lite Thinking
1044
Grok 3
1045
DeepSeek V3 0324
1046
DeepSeek V3.2 Exp Thinking
1046
Kimi K2.5 Instant
1047
GPT-5 Mini
1047
GLM 4.7
1049
Qwen3 Next 80B A3B Thinking
1050
MiniMax M2.1 Lightning
1051
Gemini 2.5 Flash Lite Thinking Preview 0925
1053
Qwen3 14B
1053
Qwen3.5 122B A17B

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
81165DeepSeek R1T2 Chimera1031±175753.4%3.0%28 tps1.8s164K$0.13$0.45
8281OpenAI o3-pro1037±189502.6%5.2%22 tps70.8s200K$20.00$80.00
83133DeepSeek-R1 05281038±131.7K2.0%1.3%93 tps0.5s64K$1.60$3.67
84113Gemini 2.5 Flash Lite Thinking1041±92.2K1.8%1.0%118 tps4.4s1M$0.03$0.13
85106Grok 31044±76K1.1%1.5%53 tps0.6s1M$3.67$18.33
86106DeepSeek V3 03241045±84.1K1.0%5.8%12 tps2.7s164K$0.38$0.93
8795DeepSeek V3.2 Exp Thinking1046±187354.5%7.2%26 tps3.0s131K$0.28$0.42
8837Kimi K2.5 Instant1046±166202.4%2.9%32 tps3.0s262K$0.50$3.00
8971GPT-5 Mini1047±92.2K2.7%2.6%66 tps14.2s400K$0.25$2.00
9068GLM 4.71047±152.4K1.2%5.8%40 tps1.5s200K$0.77$1.73
91157Qwen3 Next 80B A3B Thinking1049±92K2.6%0.6%175 tps1.3s256K$0.21$2.26
9256MiniMax M2.1 Lightning1050±236151.6%1.7%52 tps2.1s205K$0.30$2.40
9395Gemini 2.5 Flash Lite Thinking Preview 09251051±151.5K4.2%1.5%152 tps3.0s1M$0.10$0.40
94133Qwen3 14B1053±131.7K2.9%1.7%109 tps0.8s41K$0.04$0.15
9552Qwen3.5 122B A17B1053±255803.3%1.5%82 tps1.4s256K$0.40$3.20
9665DeepSeek V3.2 Exp Chat1054±141.2K3.7%2.6%29 tps1.5s131K$0.27$0.39
9771Gemini 2.5 Flash Thinking1055±112.8K2.3%2.2%88 tps6.4s1M$0.30$2.50
9884GPT-5 Mini Minimal1057±117455.7%1.2%63 tps1.4s400K$0.25$2.00
9986Nemotron 3 Nano (Thinking)1059±188251.8%2.0%200 tps0.5s256K$0$0
10044DeepSeek V3.1 Terminus Chat1060±131.4K3.3%3.4%27 tps1.5s131K$0.86$1.80
101121Qwen3 32B Fast1061±93K2.4%11.6%30 tps3.1s41K$0.10$0.25
10295Gemini 2.5 Flash1061±710K1.0%1.3%2 tps3.7s1M$0.30$2.50
10368Grok 41062±610.5K1.3%3.9%29 tps11.1s256K$3.00$15.00
10493DeepSeek V3 0324 Turbo1068±74.2K0.8%6.3%12 tps2.4s164K$0.73$1.79
10586DeepSeek V3.1 Chat1069±111.3K3.0%2.8%21 tps1.6s131K$0.38$1.00
106113Kimi K2 Fast1069±78.5K1.0%0.8%365 tps0.5s131K$1.00$3.00
10760MiniMax M2.11071±112.6K1.1%2.1%66 tps2.6s205K$0.30$1.20
10895Kimi K2 Thinking1072±308808.8%4.2%61 tps5.9s262K$0.24$1.03
10995DeepSeek-R1 Turbo1073±167803.7%2.6%29 tps1.8s64K$2.85$4.75
11052Claude Haiku 4.51078±112.6K3.7%1.1%100 tps0.9s200K$1.00$5.00
111113Mistral Medium1079±92.7K1.8%1.8%48 tps0.6s33K$1.48$4.55
11222GLM 51080±151.4K1.4%3.4%36 tps2.7s200K$0.72$2.55
11393Qwen Max1080±75.4K1.1%1.5%49 tps1.5s33K$1.60$6.40
114148Qwen3 30B A3B Thinking 25071081±198902.7%0.5%124 tps1.2s131K$0.16$1.70
11556DeepSeek V3.1 Turbo1082±161.8K2.2%0.9%173 tps1.3s164K$2.00$3.75
11652GPT-51084±75K2.1%3.1%78 tps23.1s400K$1.25$9.67
117101gpt-oss-20b1085±122.1K2.6%0.5%216 tps0.5s131K$0.06$0.26
118124Qwen3 235B A22B Thinking 25071091±207052.1%2.5%53 tps1.6s131K$0.59$5.70
11979Qwen3 Max Thinking Preview1105±141.9K4.0%3.1%40 tps2.1s256K$1.20$6.00
12052Grok 4 Fast Non-Reasoning1108±121.8K3.0%1.5%93 tps0.6s2M$0.27$0.67
View All (173 models)