Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

900
Llama 4 Maverick
899
R1 1776
888
Claude Sonnet 3.5
888
Seed 1.6 250615
886
DeepSeek-R1 0528 Qwen3 8B
883
NVIDIA Llama 3.3 Nemotron Super 49B v1
880
GLM 4.6V
879
Gemma 3 27B IT
876
Cogito v2.1 671B
872
Llama 4 Scout
870
Magistral Small 2506
870
GPT-5 Mini High
865
Claude Haiku 3.5
865
Devstral Medium
865
Llama 3.3 Swallow 70B Instruct

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
161161Llama 4 Maverick900±115.1K1.9%1.2%88 tps2.4s1M$0.23$0.83
162253R1 1776899±151.3K0.8%<0.1%61 tps1.0s128K$2.00$8.00
163200Claude Sonnet 3.5888±274952.9%1.0%40 tps2.7s200K$3.00$15.00
164143Seed 1.6 250615888±235302.8%3.1%46 tps2.2s256K$0.25$2.00
165314DeepSeek-R1 0528 Qwen3 8B886±191.1K5.7%<0.1%45 tps2.4s128K$0.05$0.09
166219NVIDIA Llama 3.3 Nemotron Super 49B v1883±179151.1%<0.1%13 tpsN/A131K$0.07$0.20
167139GLM 4.6V880±338852.2%6.4%21 tps1.8s128K$0.38$0.90
168201Gemma 3 27B IT879±215601.8%2.0%60 tps0.8s128K$0.17$0.29
169157Cogito v2.1 671B876±304903.9%0.8%85 tps0.5s128K$1.25$1.25
170160Llama 4 Scout872±114.4K1.4%0.6%88 tps5.1s131K$0.18$0.46
171194Magistral Small 2506870±161K2.8%1.6%156 tps0.5s40K$0.37$1.10
172241GPT-5 Mini High870±177703.1%<0.1%33 tps3.9s400K$0.25$2.00
173213Claude Haiku 3.5865±121.3K3.1%0.8%40 tps2.8s200K$0.80$4.00
174170Devstral Medium865±198051.8%1.5%77 tps0.6s131K$0.40$2.00
175209Llama 3.3 Swallow 70B Instruct865±198201.8%1.4%153 tps1.3s131K$0.13$0.39
176219Arcee AI Virtuoso-Large863±177101.4%<0.1%64 tps0.5s131K$0.75$1.20
177186GLM 4.6V Flash858±237502.6%3.7%64 tps2.1s128K$0.04$0.40
178175OpenAI o3-mini-low852±84.4K1.8%0.7%139 tps1.5s200K$1.10$4.40
179186Grok 3 Mini852±142.5K1.4%1.2%43 tps0.5s131K$0.30$0.50
180177OpenAI o3-mini851±74.7K1.8%0.8%143 tps3.3s200K$1.10$4.40
181179Inception Mercury847±131.4K1.0%0.4%257 tps1.1s32K$0.25$1.00
182177Mistral Small 3.1 24B Instruct839±226953.5%7.5%15 tps2.4s131K$0.06$0.18
183214OpenAI o3-mini-high833±132.9K1.0%2.4%231 tps10.5s200K$1.10$4.40
184314MAI-DS-R1823±198554.5%<0.1%73 tps3.2s64K$0.10$0.40
185399Magistral Medium (Thinking)822±246001.6%<0.1%67 tps0.8s41K$2.00$5.00
186277Wikipedia821±131.7K5.9%<0.1%47 tps2.1s32K$0$0
187222Sky T1 32B Preview821±186251.6%7.8%73 tps0.6s16K$0.12$0.18
188235GLM 4 32B820±167002.1%2.6%40 tps1.6s33K$0.14$0.14
189186Gemma 3n E4B814±171.6K3.6%2.0%30 tps0.5s8K$0.01$0.02
190186Grok 3 Mini Fast807±142.4K1.8%1.6%44 tps0.5s131K$0.60$4.00
191179Amazon Nova Pro 1.0803±131.2K2.0%0.9%96 tps0.7s300K$0.80$1.70
192277Grok 2798±185550.9%<0.1%55 tps1.1s131K$2.00$10.00
193270AFM 4.5B Preview797±386103.9%<0.1%32 tps0.0s66K$0$0
194292Arcee AI Spotlight788±151.1K1.3%<0.1%121 tps0.4s131K$0.18$0.18
195225Command R 7B787±266602.9%1.1%76 tps0.4s128K$0.04$0.15
196219Grok 3 Mini Beta783±176050.8%<0.1%75 tps0.5s131K$0.45$2.25
197200NVIDIA Llama 3.1 Nemotron 70B783±171.2K2.4%<0.1%9 tps0.1s128K$0.33$0.39
198229Magistral Medium 2509782±285507.6%4.0%58 tps0.9s131K$2.00$5.00
199270Solar Pro 2 250710 (Reasoning)782±226051.6%<0.1%9 tpsN/A66K$0.50$0.50
200200K2 Think763±246050.8%<0.1%418 tps2.8sN/A$0$0
View All (223 models)