Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

929
Qwen 2.5 VL 72B Instruct
928
Qwen 2.5 14B Instruct
924
DeepSeek V3.2 Speciale
924
Devstral Small
920
Magistral Small 2506
919
Inception Mercury
919
Qwen3 30B A3B Thinking 2507
918
Magistral Medium 2509
916
Amazon Nova Pro 1.0
915
Mistral Small 3.2 24B Instruct
911
Llama 4 Scout
911
Baichuan-M2-32B
911
Kimi K2 0711
911
Mistral Small 3.2 24B
908
OpenAI o3-mini-high

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
121265Qwen 2.5 VL 72B Instruct929±121.2K7.9%5.3%25 tps3.7s128K$1.01$2.79
122209Qwen 2.5 14B Instruct928±1391011.7%2.4%40 tps1.6s1M$0.40$1.61
123133DeepSeek V3.2 Speciale924±121.6K6.3%6.0%43 tps1.4s131K$0.84$1.52
124201Devstral Small924±1657012.3%2.4%180 tps0.6s131K$0.10$0.30
125194Magistral Small 2506920±152K6.9%1.6%156 tps0.5s40K$0.37$1.10
126179Inception Mercury919±102.8K11.5%0.4%257 tps1.1s32K$0.25$1.00
127148Qwen3 30B A3B Thinking 2507919±101.3K3.6%0.5%124 tps1.2s131K$0.16$1.70
128229Magistral Medium 2509918±82.1K11.3%4.0%58 tps0.9s131K$2.00$5.00
129179Amazon Nova Pro 1.0916±162.1K10.3%0.9%96 tps0.7s300K$0.80$1.70
130186Mistral Small 3.2 24B Instruct915±225259.5%1.9%113 tps1.1s131K$0.02$0.08
131160Llama 4 Scout911±68K9.6%0.6%88 tps5.1s131K$0.18$0.46
132179Baichuan-M2-32B911±2550513.7%<0.1%32 tps3.3s131K$0.07$0.07
133170Kimi K2 0711911±83.2K9.2%1.6%29 tps1.3s131K$0.72$2.60
134170Mistral Small 3.2 24B911±102K12.4%2.8%141 tps0.7s33K$0.02$0.08
135214OpenAI o3-mini-high908±141.1K6.5%2.4%231 tps10.5s200K$1.10$4.40
136186Gemma 3n E4B905±102.6K8.4%2.0%30 tps0.5s8K$0.01$0.02
137201ERNIE 4.5 VL 424B A47B905±128057.5%4.9%36 tps3.5s123K$0.42$1.25
138209Llama 3.3 Swallow 70B Instruct904±81.6K15.2%1.4%153 tps1.3s131K$0.13$0.39
139186Grok 3 Mini903±56K12.8%1.2%43 tps0.5s131K$0.30$0.50
140157Qwen3 Next 80B A3B Thinking903±74.9K11.2%0.6%175 tps1.3s256K$0.21$2.26
141161Qwen3 8B902±122K17.8%2.4%61 tps1.4s41K$0.02$0.07
142186Grok 3 Mini Fast897±75.2K14.9%1.6%44 tps0.5s131K$0.60$4.00
143186Jamba 1.6 Large895±118809.7%2.0%59 tps1.2s256K$1.33$5.33
144194Llama 3.3 70B893±102.2K9.2%0.3%500 tps0.5s8K$0.48$0.66
145235GLM 4 32B893±121.2K11.1%2.6%40 tps1.6s33K$0.14$0.14
146179GLM 4.7 Flash887±138452.9%5.8%61 tps2.8s128K$0.07$0.39
147246Mixtral 8x22B880±2346512.3%1.2%140 tps0.6s64K$2.00$6.00
148186Jamba 1.7 Large877±1762015.1%1.3%58 tps1.0s256K$1.33$5.33
149214Gemma 3 12B874±1591011.2%4.2%73 tps0.8s131K$0.05$0.12
150165Qwen3 4B871±73.3K16.3%1.9%94 tps1.5s128K$0.01$0.01
151256Solar Mini 250422869±1564016.3%1.8%90 tps1.7s33K$0.15$0.15
152274Moonshot V1 128k Vision862±175857.1%3.1%44 tps3.8s131K$2.00$5.00
153179Qwen 2.5 72B854±2653511.6%1.2%96 tps1.2s131K$0.14$0.26
154222Rnj-1 Instruct853±245907.8%0.6%103 tps0.3s33K$0.15$0.15
155240Hermes 4 405B FP8850±1650013.0%3.5%31 tps0.9s131K$0.52$1.73
156222Jamba 1.5 Large844±1391512.0%1.7%48 tps0.9s256K$1.50$6.00
157265Magistral Small 2509844±267458.0%2.7%116 tps0.6s131K$0.50$1.50
158201Llama 3 8B840±1983515.7%6.0%85 tps0.7s8K$0.12$0.16
159214Krutrim 2832±145602.6%12.5%33 tps2.1s128K$1.00$1.00
160225GPT-3.5 Turbo 16k807±111.1K10.6%<0.1%22 tps0.6s16K$3.00$4.00
View All (170 models)