Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

948
Amazon Nova 2 Lite
948
Gemini 2.0 Flash Lite
946
OpenAI o1
941
Pixtral Large
940
Qwen3 4B
932
GLM 4.7 Flash
932
Kimi K2 0711
929
Mistral Small 3.2 24B
927
DeepSeek V3.1 Terminus Thinking
924
Mistral Small 3.1
922
Grok 3 Fast
922
DeepSeek V3.2 Speciale
915
OpenAI o4-mini-high
904
DeepSeek-R1
900
Llama 4 Maverick

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
12186Amazon Nova 2 Lite948±219707.6%1.0%137 tps0.6s300K$0.35$2.95
122143Gemini 2.0 Flash Lite948±73.5K1.7%<0.1%42 tps0.5s1M$0.08$0.30
123153OpenAI o1946±113.6K1.0%4.2%92 tps5.5s200K$15.00$60.00
124165Pixtral Large941±189403.6%2.5%57 tps1.3s128K$1.50$4.50
125165Qwen3 4B940±141.7K5.2%1.9%94 tps1.5s128K$0.01$0.01
126179GLM 4.7 Flash932±266902.1%5.8%61 tps2.8s128K$0.07$0.39
127170Kimi K2 0711932±122.4K1.7%1.6%29 tps1.3s131K$0.72$2.60
128170Mistral Small 3.2 24B929±149852.5%2.8%141 tps0.7s33K$0.02$0.08
129106DeepSeek V3.1 Terminus Thinking927±178404.5%5.9%27 tps1.8s131K$0.56$1.68
130161Mistral Small 3.1924±366152.4%7.4%13 tps2.6s32K$0.17$0.28
131111Grok 3 Fast922±125300.9%1.7%52 tps2.4s131K$5.00$25.00
132133DeepSeek V3.2 Speciale922±257806.6%6.0%43 tps1.4s131K$0.84$1.52
133148OpenAI o4-mini-high915±104.7K1.5%1.9%117 tps15.9s200K$1.10$4.40
134148DeepSeek-R1904±161.7K2.8%0.8%133 tps0.6s64K$0.91$3.07
135161Llama 4 Maverick900±115.1K1.9%1.2%88 tps2.4s1M$0.23$0.83
136143Seed 1.6 250615888±235302.8%3.1%46 tps2.2s256K$0.25$2.00
137139GLM 4.6V880±338852.2%6.4%21 tps1.8s128K$0.38$0.90
138201Gemma 3 27B IT879±215601.8%2.0%60 tps0.8s128K$0.17$0.29
139157Cogito v2.1 671B876±304903.9%0.8%85 tps0.5s128K$1.25$1.25
140160Llama 4 Scout872±114.4K1.4%0.6%88 tps5.1s131K$0.18$0.46
141194Magistral Small 2506870±161K2.8%1.6%156 tps0.5s40K$0.37$1.10
142170Devstral Medium865±198051.8%1.5%77 tps0.6s131K$0.40$2.00
143209Llama 3.3 Swallow 70B Instruct865±198201.8%1.4%153 tps1.3s131K$0.13$0.39
144186GLM 4.6V Flash858±237502.6%3.7%64 tps2.1s128K$0.04$0.40
145175OpenAI o3-mini-low852±84.4K1.8%0.7%139 tps1.5s200K$1.10$4.40
146186Grok 3 Mini852±142.5K1.4%1.2%43 tps0.5s131K$0.30$0.50
147177OpenAI o3-mini851±74.7K1.8%0.8%143 tps3.3s200K$1.10$4.40
148179Inception Mercury847±131.4K1.0%0.4%257 tps1.1s32K$0.25$1.00
149177Mistral Small 3.1 24B Instruct839±226953.5%7.5%15 tps2.4s131K$0.06$0.18
150214OpenAI o3-mini-high833±132.9K1.0%2.4%231 tps10.5s200K$1.10$4.40
151222Sky T1 32B Preview821±186251.6%7.8%73 tps0.6s16K$0.12$0.18
152235GLM 4 32B820±167002.1%2.6%40 tps1.6s33K$0.14$0.14
153186Gemma 3n E4B814±171.6K3.6%2.0%30 tps0.5s8K$0.01$0.02
154186Grok 3 Mini Fast807±142.4K1.8%1.6%44 tps0.5s131K$0.60$4.00
155179Amazon Nova Pro 1.0803±131.2K2.0%0.9%96 tps0.7s300K$0.80$1.70
156225Command R 7B787±266602.9%1.1%76 tps0.4s128K$0.04$0.15
157229Magistral Medium 2509782±285507.6%4.0%58 tps0.9s131K$2.00$5.00
158186Jamba 1.6 Large761±157801.9%2.0%59 tps1.2s256K$1.33$5.33
159246DeepSeek-R1 Distill Llama 70B755±199602.5%3.6%27 tps1.6s32K$0.73$0.95
160222Jamba 1.5 Large744±247152.1%1.7%48 tps0.9s256K$1.50$6.00
View All (173 models)