Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

850
Hermes 4 405B FP8
844
Jamba 1.5 Large
844
Hermes 4 405B Reasoning FP8
844
Magistral Small 2509
840
Llama 3 8B
837
GLM 4.6V Flash
832
Krutrim 2
828
Command R+
828
Pixtral 12B
827
Gemma 3 4B
823
Qwen 2.5 7B
818
Open Mistral Nemo
815
Mixtral 8x7B Instruct
808
Command R 7B
807
GPT-3.5 Turbo 16k

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
201240Hermes 4 405B FP8850±1650013.0%3.5%31 tps0.9s131K$0.52$1.73
202222Jamba 1.5 Large844±1391512.0%1.7%48 tps0.9s256K$1.50$6.00
203260Hermes 4 405B Reasoning FP8844±122.1K18.8%3.6%32 tps0.8s131K$1.00$3.00
204265Magistral Small 2509844±267458.0%2.7%116 tps0.6s131K$0.50$1.50
205201Llama 3 8B840±1983515.7%6.0%85 tps0.7s8K$0.12$0.16
206186GLM 4.6V Flash837±92K9.0%3.7%64 tps2.1s128K$0.04$0.40
207214Krutrim 2832±145602.6%12.5%33 tps2.1s128K$1.00$1.00
208235Command R+828±2166510.1%2.8%36 tps0.7s128K$2.08$9.45
209274Pixtral 12B828±112.2K6.8%2.2%101 tps1.2s131K$0.08$0.08
210235Gemma 3 4B827±111.2K9.9%1.3%138 tps0.7s131K$0.02$0.04
211214Qwen 2.5 7B823±1568511.6%3.7%40 tps1.9s131K$0.08$0.27
212225Open Mistral Nemo818±2459012.6%1.5%171 tps0.5s131K$0.15$0.15
213256Mixtral 8x7B Instruct815±2851012.1%0.2%79 tps0.7s33K$0.23$0.31
214225Command R 7B808±131.3K12.9%1.1%76 tps0.4s128K$0.04$0.15
215225GPT-3.5 Turbo 16k807±111.1K10.6%<0.1%22 tps0.6s16K$3.00$4.00
216235Mixtral 8x7B801±2952512.5%2.2%142 tps0.6s33K$0.23$0.23
217260Mistral Small799±2346512.3%1.7%142 tps0.6s32K$0.43$1.30
218246WizardLM-2 8x22B795±185157.2%11.6%11 tps2.5s66K$0.77$0.77
219271Hermes 3 405B Instruct792±2149010.1%2.3%20 tps1.1s131K$0.80$0.80
220265LFM2 2.6B787±1754515.5%6.7%184 tps0.4s33K$0.01$0.02
221256Phi 4780±1756511.7%5.1%28 tps1.3s128K$0.10$0.32
222246Ministral 3B772±1778512.3%0.8%248 tps0.4s131K$0.08$0.08
223260Open Mistral 7B770±2851012.8%0.7%176 tps0.4s33K$0.25$0.25
224284MiniMax M1769±161.1K17.9%<0.1%31 tps2.8s1M$0.55$2.20
225256Gemma 3 1B758±2396513.1%0.6%176 tps1.0s33K$0.06$0.10
226201Mistral Small 24B Instruct752±2248015.0%1.5%84 tps0.4s33K$0.80$0.80
227240GPT-3.5 Turbo Instruct751±2153010.9%<0.1%46 tps1.2s4K$1.50$2.00
228271Inflection 3 Pi738±2154511.4%1.1%33 tps3.4s8K$2.50$10.00
229253Gemma 2 27B727±1558511.4%1.4%44 tps1.4s8K$0.80$0.80
230274LFM2 8B A1B726±2657017.4%<0.1%142 tps0.3s33K$0.01$0.02
231265Inflection 3 Productivity713±1660011.8%0.6%50 tps3.2s8K$2.50$10.00
232271Mistral Large706±1750012.3%1.5%54 tps0.7s33K$2.00$6.00
233265Mixtral-8x7B Instruct v0.1675±3946512.3%1.3%54 tps0.4s33K$0.60$0.60
234288Qwen 2.5 VL 3B Instruct670±122.5K9.2%3.0%44 tps2.5s128K$0.21$0.63
235281MythoMax L2 13B569±3278515.1%1.2%22 tps1.1s4K$0.18$0.18
236285Hunyuan A13B Instruct485±2765021.2%2.3%67 tps2.0s33K$0.01$0.01
237291Phi 4 Mini Reasoning475±121.9K22.5%9.7%30 tps0.9s128K$0.07$0.30
View All (237 models)