Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1145
Kimi K2 0905
1131
GPT-5 (High)
1125
DeepSeek V3.2
1121
LongCat Flash Chat
1116
Gemini 2.5 Flash Lite Preview 0925
1113
Gemini 2.5 Flash Lite
1108
Grok 4 Fast Non-Reasoning
1105
Qwen3 Max Thinking Preview
1091
Qwen3 235B A22B Thinking 2507
1084
GPT-5
1082
DeepSeek V3.1 Turbo
1081
Qwen3 30B A3B Thinking 2507
1080
Qwen Max
1080
GLM 5
1079
Mistral Medium

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
41133Kimi K2 09051145±101.5K2.0%4.0%30 tps1.4s262K$0.63$2.39
4226GPT-5 (High)1131±92.7K3.2%4.5%81 tps35.9s400K$1.25$10.00
4340DeepSeek V3.21125±102.1K1.6%1.4%83 tps5.1s131K$0.43$1.09
44111LongCat Flash Chat1121±177254.6%0.8%85 tps0.9s131K$0.14$0.68
4571Gemini 2.5 Flash Lite Preview 09251116±102K2.4%1.2%209 tps0.7s1M$0.25$0.35
46101Gemini 2.5 Flash Lite1113±64.4K1.8%1.3%210 tps0.7s1M$0.10$0.40
4752Grok 4 Fast Non-Reasoning1108±121.8K3.0%1.5%93 tps0.6s2M$0.27$0.67
4879Qwen3 Max Thinking Preview1105±141.9K4.0%3.1%40 tps2.1s256K$1.20$6.00
49124Qwen3 235B A22B Thinking 25071091±207052.1%2.5%53 tps1.6s131K$0.59$5.70
5052GPT-51084±75K2.1%3.1%78 tps23.1s400K$1.25$9.67
5156DeepSeek V3.1 Turbo1082±161.8K2.2%0.9%173 tps1.3s164K$2.00$3.75
52148Qwen3 30B A3B Thinking 25071081±198902.7%0.5%124 tps1.2s131K$0.16$1.70
5393Qwen Max1080±75.4K1.1%1.5%49 tps1.5s33K$1.60$6.40
5422GLM 51080±151.4K1.4%3.4%36 tps2.7s200K$0.72$2.55
55113Mistral Medium1079±92.7K1.8%1.8%48 tps0.6s33K$1.48$4.55
5652Claude Haiku 4.51078±112.6K3.7%1.1%100 tps0.9s200K$1.00$5.00
5795Kimi K2 Thinking1072±308808.8%4.2%61 tps5.9s262K$0.24$1.03
5860MiniMax M2.11071±112.6K1.1%2.1%66 tps2.6s205K$0.30$1.20
5986DeepSeek V3.1 Chat1069±111.3K3.0%2.8%21 tps1.6s131K$0.38$1.00
6093DeepSeek V3 0324 Turbo1068±74.2K0.8%6.3%12 tps2.4s164K$0.73$1.79
6168Grok 41062±610.5K1.3%3.9%29 tps11.1s256K$3.00$15.00
6295Gemini 2.5 Flash1061±710K1.0%1.3%2 tps3.7s1M$0.30$2.50
6344DeepSeek V3.1 Terminus Chat1060±131.4K3.3%3.4%27 tps1.5s131K$0.86$1.80
6484GPT-5 Mini Minimal1057±117455.7%1.2%63 tps1.4s400K$0.25$2.00
6571Gemini 2.5 Flash Thinking1055±112.8K2.3%2.2%88 tps6.4s1M$0.30$2.50
6695Gemini 2.5 Flash Lite Thinking Preview 09251051±151.5K4.2%1.5%152 tps3.0s1M$0.10$0.40
6756MiniMax M2.1 Lightning1050±236151.6%1.7%52 tps2.1s205K$0.30$2.40
68157Qwen3 Next 80B A3B Thinking1049±92K2.6%0.6%175 tps1.3s256K$0.21$2.26
6968GLM 4.71047±152.4K1.2%5.8%40 tps1.5s200K$0.77$1.73
7071GPT-5 Mini1047±92.2K2.7%2.6%66 tps14.2s400K$0.25$2.00
71106DeepSeek V3 03241045±84.1K1.0%5.8%12 tps2.7s164K$0.38$0.93
72106Grok 31044±76K1.1%1.5%53 tps0.6s1M$3.67$18.33
73113Gemini 2.5 Flash Lite Thinking1041±92.2K1.8%1.0%118 tps4.4s1M$0.03$0.13
7481OpenAI o3-pro1037±189502.6%5.2%22 tps70.8s200K$20.00$80.00
75165DeepSeek R1T2 Chimera1031±175753.4%3.0%28 tps1.8s164K$0.13$0.45
7648Claude Sonnet 4 (Thinking)1028±153.7K2.9%1.5%52 tps1.5s200K$3.00$13.67
77126Qwen3 VL 235B A22B Thinking1027±139354.1%4.3%47 tps3.0s127K$0.47$3.31
7862MiniMax M21027±92.5K5.2%2.2%39 tps2.3s205K$0.21$0.85
79129Qwen3 Max Thinking1022±121.3K1.1%13.5%32 tps2.3s256K$1.20$6.00
8071Qwen3.5 397B A17B1021±229101.1%4.3%57 tps1.4s256K$0.52$3.00
View All (131 models)