Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

932
Qwen3 32B Fast
933
Qwen3 14B
937
OpenAI o3-mini-low
937
OpenAI o4-mini-high
938
GLM 4.6V
938
Nemotron 3 Nano (Thinking)
939
DeepSeek-R1
939
GPT-4o mini
939
Qwen3 30B A3B
943
Llama 3.2 11B Instruct
945
Devstral Medium
945
Ministral 14B 3.0
946
OpenAI o3-mini
949
Qwen3 VL 30B A3B Thinking
954
gpt-oss-20b

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
81121Qwen3 32B Fast932±54.5K12.9%11.6%30 tps3.1s41K$0.10$0.25
82133Qwen3 14B933±112.7K17.1%1.7%109 tps0.8s41K$0.04$0.15
83175OpenAI o3-mini-low937±46.1K13.6%0.7%139 tps1.5s200K$1.10$4.40
84148OpenAI o4-mini-high937±66K14.9%1.9%117 tps15.9s200K$1.10$4.40
85139GLM 4.6V938±112.5K6.1%6.4%21 tps1.8s128K$0.38$0.90
8686Nemotron 3 Nano (Thinking)938±141.3K7.6%2.0%200 tps0.5s256K$0$0
87148DeepSeek-R1939±121.6K5.5%0.8%133 tps0.6s64K$0.91$3.07
88201GPT-4o mini939±91.4K9.2%2.1%71 tps1.7s128K$0.15$0.60
89126Qwen3 30B A3B939±53.7K12.1%5.1%163 tps1.0s41K$0.06$0.21
90194Llama 3.2 11B Instruct943±1474514.4%1.5%152 tps0.5s8K$0.16$0.16
91170Devstral Medium945±111.6K14.7%1.5%77 tps0.6s131K$0.40$2.00
92153Ministral 14B 3.0945±2849011.7%2.0%119 tps0.5s128K$0.20$0.20
93177OpenAI o3-mini946±66.7K12.3%0.8%143 tps3.3s200K$1.10$4.40
94165Qwen3 VL 30B A3B Thinking949±81.5K11.2%4.5%84 tps2.9s127K$0.20$1.47
95101gpt-oss-20b954±56.1K10.8%0.5%216 tps0.5s131K$0.06$0.26
96121QwQ 32B955±75K15.3%5.4%41 tps2.1s16K$0.43$0.56
97161Llama 4 Maverick956±411.2K8.2%1.2%88 tps2.4s1M$0.23$0.83
98161Mistral Small 3.1960±1691511.2%7.4%13 tps2.6s32K$0.17$0.28
99177Mistral Small 3.1 24B Instruct966±121K10.6%7.5%15 tps2.4s131K$0.06$0.18
100222Sky T1 32B Preview972±1680510.6%7.8%73 tps0.6s16K$0.12$0.18
101209Seed 1.6 Flash 250715974±169806.2%2.5%108 tps1.6s256K$0.07$0.30
102214Qwen 2.5 VL 32B Instruct977±208507.6%6.3%43 tps3.2s128K$0.35$0.62
103165DeepSeek R1T2 Chimera978±101.1K11.0%3.0%28 tps1.8s164K$0.13$0.45
104126Qwen3 VL 235B A22B Thinking979±63.5K11.5%4.3%47 tps3.0s127K$0.47$3.31
105201Gemma 3 27B IT983±1590510.4%2.0%60 tps0.8s128K$0.17$0.29
106133DeepSeek-R1 0528983±121.3K4.6%1.3%93 tps0.5s64K$1.60$3.67
107157Cogito v2.1 671B984±177155.9%0.8%85 tps0.5s128K$1.25$1.25
108129Qwen3 Max Thinking990±131.7K2.3%13.5%32 tps2.3s256K$1.20$6.00
109133Kimi K2 0905991±67.5K5.6%4.0%30 tps1.4s262K$0.63$2.39
110111LongCat Flash Chat996±149307.0%0.8%85 tps0.9s131K$0.14$0.68
111170Llama 3.1 8B Turbo998±121.1K2.8%2.1%650 tps0.5s128K$0.13$0.14
112139OpenAI o4-mini1000±54.8K10.2%1.4%97 tps7.0s128K$1.10$4.40
113121NVIDIA Llama 3.3 Nemotron Super 49B v1.51000±161K9.9%2.0%50 tps0.6s131K$0.09$0.33
114113GLM 4.51002±53.7K14.3%3.7%46 tps1.4s131K$0.43$1.63
115113Kimi K2 Fast1006±426.2K13.8%0.8%365 tps0.5s131K$1.00$3.00
116143Gemini 2.0 Flash Lite1011±65.7K6.9%<0.1%42 tps0.5s1M$0.08$0.30
117139Qwen3 VL 30B A3B Instruct1012±171K6.5%1.8%80 tps2.6s129K$0.18$0.67
118129DeepSeek V3.1 Thinking1014±73.9K14.0%7.1%18 tps1.8s131K$0.23$0.75
119148OpenAI o31016±111.3K4.6%0.9%85 tps6.8s128K$7.33$29.33
120124Qwen3 235B A22B Thinking 25071018±111.1K4.2%2.5%53 tps1.6s131K$0.59$5.70
View All (237 models)