Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

970
OpenAI o3
963
LongCat Flash Chat
960
OpenAI o1
958
OpenAI o4-mini-high
956
OpenAI o4-mini
955
DeepSeek V3.1 Thinking
952
Qwen3 Max Thinking Preview
951
OpenAI o3-pro
949
Qwen3.5 35B A3B
928
Seed 1.6 250615
926
Kimi K2 Thinking
925
Kimi K2 0905 Turbo
922
Kimi K2 0905
922
Qwen3 VL 235B A22B Thinking
918
ERNIE 4.5 300B A47B

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
81148OpenAI o3970±101.2K3.1%0.9%85 tps6.8s128K$7.33$29.33
82111LongCat Flash Chat963±255604.3%0.8%85 tps0.9s131K$0.14$0.68
83153OpenAI o1960±112.3K2.4%4.2%92 tps5.5s200K$15.00$60.00
84148OpenAI o4-mini-high958±112.2K3.1%1.9%117 tps15.9s200K$1.10$4.40
85139OpenAI o4-mini956±161.4K2.8%1.4%97 tps7.0s128K$1.10$4.40
86129DeepSeek V3.1 Thinking955±141.1K2.2%7.1%18 tps1.8s131K$0.23$0.75
8779Qwen3 Max Thinking Preview952±201.1K2.2%3.1%40 tps2.1s256K$1.20$6.00
8881OpenAI o3-pro951±191.6K3.4%5.2%22 tps70.8s200K$20.00$80.00
89101Qwen3.5 35B A3B949±275302.8%2.1%116 tps2.1s256K$0.63$1.13
90143Seed 1.6 250615928±216355.2%3.1%46 tps2.2s256K$0.25$2.00
9195Kimi K2 Thinking926±177402.0%4.2%61 tps5.9s262K$0.24$1.03
92124Kimi K2 0905 Turbo925±131.5K4.7%0.7%373 tps0.5s262K$1.70$6.50
93133Kimi K2 0905922±218054.2%4.0%30 tps1.4s262K$0.63$2.39
94126Qwen3 VL 235B A22B Thinking922±187454.5%4.3%47 tps3.0s127K$0.47$3.31
95119ERNIE 4.5 300B A47B918±171.6K2.7%4.7%23 tps2.3s123K$0.28$1.10
96143Gemini 2.0 Flash Lite917±112.5K6.7%<0.1%42 tps0.5s1M$0.08$0.30
9762MiniMax M2905±181.4K3.5%2.2%39 tps2.3s205K$0.21$0.85
98177OpenAI o3-mini901±122.5K3.1%0.8%143 tps3.3s200K$1.10$4.40
99139Seed 2.0 Mini (Medium)900±305153.7%11.9%33 tps1.7s256K$0.15$0.60
100165Qwen3 4B878±237353.9%1.9%94 tps1.5s128K$0.01$0.01
101160Llama 4 Scout875±152.3K2.9%0.6%88 tps5.1s131K$0.18$0.46
102179GLM 4.7 Flash874±248552.8%5.8%61 tps2.8s128K$0.07$0.39
103214OpenAI o3-mini-high868±131.4K3.8%2.4%231 tps10.5s200K$1.10$4.40
104129Qwen3 Max Thinking866±141.5K1.7%13.5%32 tps2.3s256K$1.20$6.00
105139GLM 4.6V865±248902.7%6.4%21 tps1.8s128K$0.38$0.90
106170Kimi K2 0711858±248904.3%1.6%29 tps1.3s131K$0.72$2.60
107157Qwen3 Next 80B A3B Thinking846±151.3K3.9%0.6%175 tps1.3s256K$0.21$2.26
108157GPT-5 Nano843±142K6.0%3.2%113 tps20.9s400K$0.05$0.40
109175OpenAI o3-mini-low838±211.7K2.6%0.7%139 tps1.5s200K$1.10$4.40
11084GPT-5 Mini Minimal835±131.1K6.6%1.2%63 tps1.4s400K$0.25$2.00
111186Grok 3 Mini Fast832±231K3.3%1.6%44 tps0.5s131K$0.60$4.00
112133DeepSeek V3.2 Speciale830±285403.6%6.0%43 tps1.4s131K$0.84$1.52
113161Qwen3 8B827±366004.0%2.4%61 tps1.4s41K$0.02$0.07
114201GPT-4o mini826±186456.5%2.1%71 tps1.7s128K$0.15$0.60
115148Qwen3 30B A3B Thinking 2507818±187953.0%0.5%124 tps1.2s131K$0.16$1.70
116265Qwen 2.5 VL 72B Instruct804±297156.5%5.3%25 tps3.7s128K$1.01$2.79
117229Magistral Medium 2509797±175705.0%4.0%58 tps0.9s131K$2.00$5.00
118265Magistral Small 2509790±295306.2%2.7%116 tps0.6s131K$0.50$1.50
119186Gemma 3n E4B781±275354.5%2.0%30 tps0.5s8K$0.01$0.02
120194Llama 3.3 70B745±305254.5%0.3%500 tps0.5s8K$0.48$0.66
View All (121 models)