Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

931
C4AI Aya Expanse 32B
931
Grok 3 Mini
933
Krutrim 2
934
Llama 3 70B
935
Mistral Small 24B Instruct
936
ERNIE 4.5 VL 424B A47B
936
Switchpoint Router
937
GLM 4.6V Flash
941
Llama 3 8B
941
Qwen 2.5 14B Instruct
943
Llama 3.3 70B
944
Qwen 2.5 7B Turbo
945
Magistral Small 2506
947
ERNIE 4.5 21B A3B Thinking
949
GLM 4.7 Flash

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
81214C4AI Aya Expanse 32B931±317.1K0.8%1.5%43 tps0.5s128K$0.50$1.50
82186Grok 3 Mini931±423K1.4%1.2%43 tps0.5s131K$0.30$0.50
83214Krutrim 2933±310.7K0.7%12.5%33 tps2.1s128K$1.00$1.00
84194Llama 3 70B934±71.7K1.1%4.5%21 tps1.7s8K$1.08$1.38
85201Mistral Small 24B Instruct935±46.3K1.2%1.5%84 tps0.4s33K$0.80$0.80
86201ERNIE 4.5 VL 424B A47B936±128055.3%4.9%36 tps3.5s123K$0.42$1.25
87179Switchpoint Router936±48.2K1.0%1.7%71 tps4.9s131K$0.85$3.40
88186GLM 4.6V Flash937±45.8K2.0%3.7%64 tps2.1s128K$0.04$0.40
89201Llama 3 8B941±312.1K0.9%6.0%85 tps0.7s8K$0.12$0.16
90209Qwen 2.5 14B Instruct941±68.3K1.4%2.4%40 tps1.6s1M$0.40$1.61
91194Llama 3.3 70B943±49.1K2.7%0.3%500 tps0.5s8K$0.48$0.66
92201Qwen 2.5 7B Turbo944±92.4K1.5%0.5%125 tps0.4s131K$0.30$0.30
93194Magistral Small 2506945±415.6K1.0%1.6%156 tps0.5s40K$0.37$1.10
94229ERNIE 4.5 21B A3B Thinking947±91.7K2.3%1.8%87 tps1.5s120K$0.07$0.28
95179GLM 4.7 Flash949±64K1.9%5.8%61 tps2.8s128K$0.07$0.39
96153OpenAI o1954±43.9K1.4%4.2%92 tps5.5s200K$15.00$60.00
97194Llama 3.2 11B Instruct955±49.2K1.0%1.5%152 tps0.5s8K$0.16$0.16
98194Mistral Small 3 24B Instruct955±47.2K0.9%2.6%77 tps0.6s33K$0.07$0.14
99186Jamba 1.6 Large957±315.3K0.9%2.0%59 tps1.2s256K$1.33$5.33
100177Mistral Small 3.1 24B Instruct962±410.6K1.3%7.5%15 tps2.4s131K$0.06$0.18
101186Gemma 3n E4B965±422.2K1.2%2.0%30 tps0.5s8K$0.01$0.02
102179Inception Mercury967±324.4K0.9%0.4%257 tps1.1s32K$0.25$1.00
103179Llama 3.1 70B Instruct969±158451.7%6.3%30 tps0.8s128K$0.17$0.22
104179Qwen 2.5 72B970±45.3K1.0%1.2%96 tps1.2s131K$0.14$0.26
105179Amazon Nova Pro 1.0971±222.3K0.8%0.9%96 tps0.7s300K$0.80$1.70
106246DeepSeek-R1 Distill Llama 70B973±112.1K3.0%3.6%27 tps1.6s32K$0.73$0.95
107170Mistral Small 3.2 24B986±313K1.1%2.8%141 tps0.7s33K$0.02$0.08
108157Cogito v2.1 671B986±43.6K1.2%0.8%85 tps0.5s128K$1.25$1.25
10948Claude Sonnet 4 (Thinking)986±38.7K1.8%1.5%52 tps1.5s200K$3.00$13.67
110170Kimi K2 0711987±319K1.1%1.6%29 tps1.3s131K$0.72$2.60
111161Llama 4 Maverick987±259.4K1.5%1.2%88 tps2.4s1M$0.23$0.83
112157GPT-5 Nano989±36.6K3.1%3.2%113 tps20.9s400K$0.05$0.40
113170Llama 3.1 8B Turbo989±57.3K1.5%2.1%650 tps0.5s128K$0.13$0.14
114186Gemma 3 27B990±63.1K1.8%1.8%35 tps1.1s66K$0.06$0.10
115160Llama 4 Scout990±255K1.5%0.6%88 tps5.1s131K$0.18$0.46
116170Devstral Medium992±410.6K0.9%1.5%77 tps0.6s131K$0.40$2.00
117143Gemini 2.0 Flash Lite994±355.3K2.8%<0.1%42 tps0.5s1M$0.08$0.30
118186Jamba 1.7 Large994±92.1K2.5%1.3%58 tps1.0s256K$1.33$5.33
119165Pixtral Large997±57.6K1.8%2.5%57 tps1.3s128K$1.50$4.50
120165DeepSeek R1T2 Chimera998±55.6K1.7%3.0%28 tps1.8s164K$0.13$0.45
View All (283 models)