Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1021
DeepSeek-R1 Turbo
1021
Qwen Max
1020
Qwen3 Max Thinking Preview
1018
DeepSeek V3.1 Chat
1014
Kimi K2 0905
1014
Gemini 2.5 Flash Lite
1013
Amazon Nova 2 Lite
1005
GPT-5 Mini Low
1003
Grok 3
1002
Kimi K2 0711
999
DeepSeek V3.2 Exp Thinking
996
Gemini 2.5 Flash Lite Thinking
991
Kimi K2 0905 Turbo
990
DeepSeek V3 0324
989
Kimi K2 Fast

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
8195DeepSeek-R1 Turbo1021±134853.0%2.6%29 tps1.8s64K$2.85$4.75
8293Qwen Max1021±141.8K2.7%1.5%49 tps1.5s33K$1.60$6.40
8379Qwen3 Max Thinking Preview1020±101.2K2.4%3.1%40 tps2.1s256K$1.20$6.00
8486DeepSeek V3.1 Chat1018±121.1K3.1%2.8%21 tps1.6s131K$0.38$1.00
85133Kimi K2 09051014±138102.4%4.0%30 tps1.4s262K$0.63$2.39
86101Gemini 2.5 Flash Lite1014±95.3K3.9%1.3%210 tps0.7s1M$0.10$0.40
8786Amazon Nova 2 Lite1013±188154.7%1.0%137 tps0.6s300K$0.35$2.95
88108GPT-5 Mini Low1005±167353.9%<0.1%69 tps3.2s400K$0.25$2.00
89106Grok 31003±92K2.6%1.5%53 tps0.6s1M$3.67$18.33
90170Kimi K2 07111002±157203.4%1.6%29 tps1.3s131K$0.72$2.60
9195DeepSeek V3.2 Exp Thinking999±227751.9%7.2%26 tps3.0s131K$0.28$0.42
92113Gemini 2.5 Flash Lite Thinking996±112.5K3.7%1.0%118 tps4.4s1M$0.03$0.13
93124Kimi K2 0905 Turbo991±121.6K1.8%0.7%373 tps0.5s262K$1.70$6.50
94106DeepSeek V3 0324990±112.1K3.0%5.8%12 tps2.7s164K$0.38$0.93
95113Kimi K2 Fast989±107.4K2.2%0.8%365 tps0.5s131K$1.00$3.00
9686Qwen3 235B A22B989±197403.9%5.3%71 tps0.9s41K$0.23$0.63
97113Mistral Medium989±141.1K2.7%1.8%48 tps0.6s33K$1.48$4.55
9844Kimi K2 Thinking Turbo986±141.1K3.2%2.0%75 tps1.4s262K$1.15$8.00
9995Kimi K2 Thinking985±216203.1%4.2%61 tps5.9s262K$0.24$1.03
100106DeepSeek V3.1 Terminus Thinking979±136603.6%5.9%27 tps1.8s131K$0.56$1.68
101118GPT-4.1 mini976±132.7K1.8%1.1%67 tps0.9s1M$0.34$1.60
102113GLM 4.5969±191.3K3.5%3.7%46 tps1.4s131K$0.43$1.63
10384GPT-5 Mini Minimal968±177953.6%1.2%63 tps1.4s400K$0.25$2.00
104148OpenAI o3960±166002.4%0.9%85 tps6.8s128K$7.33$29.33
105129DeepSeek V3.1 Thinking958±141K2.4%7.1%18 tps1.8s131K$0.23$0.75
10656DeepSeek V3.1 Turbo957±148204.1%0.9%173 tps1.3s164K$2.00$3.75
107126DeepSeek V3956±121.7K2.5%0.9%69 tps1.1s64K$0.59$1.49
108148OpenAI o4-mini-high950±121.5K3.8%1.9%117 tps15.9s200K$1.10$4.40
10968GLM 4.7949±131.6K1.8%5.8%40 tps1.5s200K$0.77$1.73
11071Seed 1.8 251228949±181.2K2.7%3.7%41 tps2.1s256K$0.25$2.00
111129Command A948±121.9K3.1%2.2%42 tps0.8s256K$2.00$7.33
112139OpenAI o4-mini947±111.2K2.5%1.4%97 tps7.0s128K$1.10$4.40
113133Qwen3 14B943±168254.1%1.7%109 tps0.8s41K$0.04$0.15
114126Qwen3 VL 235B A22B Thinking939±159653.5%4.3%47 tps3.0s127K$0.47$3.31
115133Solar Pro 2 250710938±101.7K3.9%<0.1%9 tpsN/A66K$0.50$0.50
116148DeepSeek-R1936±217054.1%0.8%133 tps0.6s64K$0.91$3.07
117147GLM 4.5 Air932±161.6K3.5%<0.1%22 tps1.4s131K$0.10$0.38
118153OpenAI o1926±159152.1%4.2%92 tps5.5s200K$15.00$60.00
119101gpt-oss-20b912±121.5K4.1%0.5%216 tps0.5s131K$0.06$0.26
12065DeepSeek V3.2 Exp Chat909±111.3K2.6%2.6%29 tps1.5s131K$0.27$0.39
View All (159 models)