Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

989
Mistral Medium
986
Kimi K2 Thinking Turbo
985
Kimi K2 Thinking
979
DeepSeek V3.1 Terminus Thinking
976
GPT-4.1 mini
969
GLM 4.5
968
GPT-5 Mini Minimal
960
OpenAI o3
958
DeepSeek V3.1 Thinking
957
DeepSeek V3.1 Turbo
956
DeepSeek V3
950
OpenAI o4-mini-high
949
GLM 4.7
949
Seed 1.8 251228
948
Command A

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
81113Mistral Medium989±141.1K2.7%1.8%48 tps0.6s33K$1.48$4.55
8244Kimi K2 Thinking Turbo986±141.1K3.2%2.0%75 tps1.4s262K$1.15$8.00
8395Kimi K2 Thinking985±216203.1%4.2%61 tps5.9s262K$0.24$1.03
84106DeepSeek V3.1 Terminus Thinking979±136603.6%5.9%27 tps1.8s131K$0.56$1.68
85118GPT-4.1 mini976±132.7K1.8%1.1%67 tps0.9s1M$0.34$1.60
86113GLM 4.5969±191.3K3.5%3.7%46 tps1.4s131K$0.43$1.63
8784GPT-5 Mini Minimal968±177953.6%1.2%63 tps1.4s400K$0.25$2.00
88148OpenAI o3960±166002.4%0.9%85 tps6.8s128K$7.33$29.33
89129DeepSeek V3.1 Thinking958±141K2.4%7.1%18 tps1.8s131K$0.23$0.75
9056DeepSeek V3.1 Turbo957±148204.1%0.9%173 tps1.3s164K$2.00$3.75
91126DeepSeek V3956±121.7K2.5%0.9%69 tps1.1s64K$0.59$1.49
92148OpenAI o4-mini-high950±121.5K3.8%1.9%117 tps15.9s200K$1.10$4.40
9368GLM 4.7949±131.6K1.8%5.8%40 tps1.5s200K$0.77$1.73
9471Seed 1.8 251228949±181.2K2.7%3.7%41 tps2.1s256K$0.25$2.00
95129Command A948±121.9K3.1%2.2%42 tps0.8s256K$2.00$7.33
96139OpenAI o4-mini947±111.2K2.5%1.4%97 tps7.0s128K$1.10$4.40
97133Qwen3 14B943±168254.1%1.7%109 tps0.8s41K$0.04$0.15
98126Qwen3 VL 235B A22B Thinking939±159653.5%4.3%47 tps3.0s127K$0.47$3.31
99148DeepSeek-R1936±217054.1%0.8%133 tps0.6s64K$0.91$3.07
100153OpenAI o1926±159152.1%4.2%92 tps5.5s200K$15.00$60.00
101101gpt-oss-20b912±121.5K4.1%0.5%216 tps0.5s131K$0.06$0.26
10265DeepSeek V3.2 Exp Chat909±111.3K2.6%2.6%29 tps1.5s131K$0.27$0.39
103124Qwen3 235B A22B Thinking 2507905±166952.1%2.5%53 tps1.6s131K$0.59$5.70
104133DeepSeek-R1 0528904±196404.5%1.3%93 tps0.5s64K$1.60$3.67
105143Gemini 2.0 Flash Lite902±149907.9%<0.1%42 tps0.5s1M$0.08$0.30
106157GPT-5 Nano901±91.2K5.0%3.2%113 tps20.9s400K$0.05$0.40
107133GPT-4.1 nano896±111.8K3.2%0.6%175 tps0.5s1M$0.10$0.40
108157Qwen3 Next 80B A3B Thinking890±161.1K3.4%0.6%175 tps1.3s256K$0.21$2.26
109143Gemini 2.0 Flash885±168705.9%<0.1%76 tps0.5s1M$0.14$0.56
110119ERNIE 4.5 300B A47B873±121.1K3.4%4.7%23 tps2.3s123K$0.28$1.10
111121Qwen3 32B Fast863±132K4.5%11.6%30 tps3.1s41K$0.10$0.25
112133DeepSeek V3.2 Speciale842±384854.9%6.0%43 tps1.4s131K$0.84$1.52
113161Llama 4 Maverick838±102.4K4.3%1.2%88 tps2.4s1M$0.23$0.83
114139GLM 4.6V837±236405.2%6.4%21 tps1.8s128K$0.38$0.90
115126Qwen3 30B A3B832±209504.0%5.1%163 tps1.0s41K$0.06$0.21
116121QwQ 32B825±161.3K5.5%5.4%41 tps2.1s16K$0.43$0.56
117165Qwen3 4B818±208704.9%1.9%94 tps1.5s128K$0.01$0.01
118161Qwen3 8B813±236105.4%2.4%61 tps1.4s41K$0.02$0.07
119186Grok 3 Mini799±231.9K2.6%1.2%43 tps0.5s131K$0.30$0.50
120165Pixtral Large796±266107.6%2.5%57 tps1.3s128K$1.50$4.50
View All (133 models)