Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1012
Qwen3 VL 30B A3B Instruct
1011
Gemini 2.0 Flash Lite
1006
Kimi K2 Fast
1002
GLM 4.5
1000
NVIDIA Llama 3.3 Nemotron Super 49B v1.5
1000
OpenAI o4-mini
998
Llama 3.1 8B Turbo
996
LongCat Flash Chat
991
Kimi K2 0905
990
Qwen3 Max Thinking
984
Cogito v2.1 671B
983
DeepSeek-R1 0528
983
Gemma 3 27B IT
979
Qwen3 VL 235B A22B Thinking
978
DeepSeek R1T2 Chimera

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
121139Qwen3 VL 30B A3B Instruct1012±171K6.5%1.8%80 tps2.6s129K$0.18$0.67
122143Gemini 2.0 Flash Lite1011±65.7K6.9%<0.1%42 tps0.5s1M$0.08$0.30
123113Kimi K2 Fast1006±426.2K13.8%0.8%365 tps0.5s131K$1.00$3.00
124113GLM 4.51002±53.7K14.3%3.7%46 tps1.4s131K$0.43$1.63
125121NVIDIA Llama 3.3 Nemotron Super 49B v1.51000±161K9.9%2.0%50 tps0.6s131K$0.09$0.33
126139OpenAI o4-mini1000±54.8K10.2%1.4%97 tps7.0s128K$1.10$4.40
127170Llama 3.1 8B Turbo998±121.1K2.8%2.1%650 tps0.5s128K$0.13$0.14
128111LongCat Flash Chat996±149307.0%0.8%85 tps0.9s131K$0.14$0.68
129133Kimi K2 0905991±67.5K5.6%4.0%30 tps1.4s262K$0.63$2.39
130129Qwen3 Max Thinking990±131.7K2.3%13.5%32 tps2.3s256K$1.20$6.00
131157Cogito v2.1 671B984±177155.9%0.8%85 tps0.5s128K$1.25$1.25
132133DeepSeek-R1 0528983±121.3K4.6%1.3%93 tps0.5s64K$1.60$3.67
133201Gemma 3 27B IT983±1590510.4%2.0%60 tps0.8s128K$0.17$0.29
134126Qwen3 VL 235B A22B Thinking979±63.5K11.5%4.3%47 tps3.0s127K$0.47$3.31
135165DeepSeek R1T2 Chimera978±101.1K11.0%3.0%28 tps1.8s164K$0.13$0.45
136214Qwen 2.5 VL 32B Instruct977±208507.6%6.3%43 tps3.2s128K$0.35$0.62
137209Seed 1.6 Flash 250715974±169806.2%2.5%108 tps1.6s256K$0.07$0.30
138222Sky T1 32B Preview972±1680510.6%7.8%73 tps0.6s16K$0.12$0.18
139177Mistral Small 3.1 24B Instruct966±121K10.6%7.5%15 tps2.4s131K$0.06$0.18
140161Mistral Small 3.1960±1691511.2%7.4%13 tps2.6s32K$0.17$0.28
141161Llama 4 Maverick956±411.2K8.2%1.2%88 tps2.4s1M$0.23$0.83
142121QwQ 32B955±75K15.3%5.4%41 tps2.1s16K$0.43$0.56
143101gpt-oss-20b954±56.1K10.8%0.5%216 tps0.5s131K$0.06$0.26
144165Qwen3 VL 30B A3B Thinking949±81.5K11.2%4.5%84 tps2.9s127K$0.20$1.47
145177OpenAI o3-mini946±66.7K12.3%0.8%143 tps3.3s200K$1.10$4.40
146153Ministral 14B 3.0945±2849011.7%2.0%119 tps0.5s128K$0.20$0.20
147170Devstral Medium945±111.6K14.7%1.5%77 tps0.6s131K$0.40$2.00
148194Llama 3.2 11B Instruct943±1474514.4%1.5%152 tps0.5s8K$0.16$0.16
149126Qwen3 30B A3B939±53.7K12.1%5.1%163 tps1.0s41K$0.06$0.21
150201GPT-4o mini939±91.4K9.2%2.1%71 tps1.7s128K$0.15$0.60
151148DeepSeek-R1939±121.6K5.5%0.8%133 tps0.6s64K$0.91$3.07
15286Nemotron 3 Nano (Thinking)938±141.3K7.6%2.0%200 tps0.5s256K$0$0
153139GLM 4.6V938±112.5K6.1%6.4%21 tps1.8s128K$0.38$0.90
154148OpenAI o4-mini-high937±66K14.9%1.9%117 tps15.9s200K$1.10$4.40
155175OpenAI o3-mini-low937±46.1K13.6%0.7%139 tps1.5s200K$1.10$4.40
156133Qwen3 14B933±112.7K17.1%1.7%109 tps0.8s41K$0.04$0.15
157121Qwen3 32B Fast932±54.5K12.9%11.6%30 tps3.1s41K$0.10$0.25
158265Qwen 2.5 VL 72B Instruct929±121.2K7.9%5.3%25 tps3.7s128K$1.01$2.79
159209Qwen 2.5 14B Instruct928±1391011.7%2.4%40 tps1.6s1M$0.40$1.61
160133DeepSeek V3.2 Speciale924±121.6K6.3%6.0%43 tps1.4s131K$0.84$1.52
View All (237 models)