Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

475
Phi 4 Mini Reasoning
485
Hunyuan A13B Instruct
569
MythoMax L2 13B
670
Qwen 2.5 VL 3B Instruct
675
Mixtral-8x7B Instruct v0.1
706
Mistral Large
713
Inflection 3 Productivity
726
LFM2 8B A1B
727
Gemma 2 27B
738
Inflection 3 Pi
751
GPT-3.5 Turbo Instruct
752
Mistral Small 24B Instruct
758
Gemma 3 1B
769
MiniMax M1
770
Open Mistral 7B

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
1291Phi 4 Mini Reasoning475±121.9K22.5%9.7%30 tps0.9s128K$0.07$0.30
2285Hunyuan A13B Instruct485±2765021.2%2.3%67 tps2.0s33K$0.01$0.01
3281MythoMax L2 13B569±3278515.1%1.2%22 tps1.1s4K$0.18$0.18
4288Qwen 2.5 VL 3B Instruct670±122.5K9.2%3.0%44 tps2.5s128K$0.21$0.63
5265Mixtral-8x7B Instruct v0.1675±3946512.3%1.3%54 tps0.4s33K$0.60$0.60
6271Mistral Large706±1750012.3%1.5%54 tps0.7s33K$2.00$6.00
7265Inflection 3 Productivity713±1660011.8%0.6%50 tps3.2s8K$2.50$10.00
8274LFM2 8B A1B726±2657017.4%<0.1%142 tps0.3s33K$0.01$0.02
9253Gemma 2 27B727±1558511.4%1.4%44 tps1.4s8K$0.80$0.80
10271Inflection 3 Pi738±2154511.4%1.1%33 tps3.4s8K$2.50$10.00
11240GPT-3.5 Turbo Instruct751±2153010.9%<0.1%46 tps1.2s4K$1.50$2.00
12201Mistral Small 24B Instruct752±2248015.0%1.5%84 tps0.4s33K$0.80$0.80
13256Gemma 3 1B758±2396513.1%0.6%176 tps1.0s33K$0.06$0.10
14284MiniMax M1769±161.1K17.9%<0.1%31 tps2.8s1M$0.55$2.20
15260Open Mistral 7B770±2851012.8%0.7%176 tps0.4s33K$0.25$0.25
16246Ministral 3B772±1778512.3%0.8%248 tps0.4s131K$0.08$0.08
17256Phi 4780±1756511.7%5.1%28 tps1.3s128K$0.10$0.32
18265LFM2 2.6B787±1754515.5%6.7%184 tps0.4s33K$0.01$0.02
19271Hermes 3 405B Instruct792±2149010.1%2.3%20 tps1.1s131K$0.80$0.80
20246WizardLM-2 8x22B795±185157.2%11.6%11 tps2.5s66K$0.77$0.77
21260Mistral Small799±2346512.3%1.7%142 tps0.6s32K$0.43$1.30
22235Mixtral 8x7B801±2952512.5%2.2%142 tps0.6s33K$0.23$0.23
23225GPT-3.5 Turbo 16k807±111.1K10.6%<0.1%22 tps0.6s16K$3.00$4.00
24225Command R 7B808±131.3K12.9%1.1%76 tps0.4s128K$0.04$0.15
25256Mixtral 8x7B Instruct815±2851012.1%0.2%79 tps0.7s33K$0.23$0.31
26225Open Mistral Nemo818±2459012.6%1.5%171 tps0.5s131K$0.15$0.15
27214Qwen 2.5 7B823±1568511.6%3.7%40 tps1.9s131K$0.08$0.27
28235Gemma 3 4B827±111.2K9.9%1.3%138 tps0.7s131K$0.02$0.04
29274Pixtral 12B828±112.2K6.8%2.2%101 tps1.2s131K$0.08$0.08
30235Command R+828±2166510.1%2.8%36 tps0.7s128K$2.08$9.45
31214Krutrim 2832±145602.6%12.5%33 tps2.1s128K$1.00$1.00
32186GLM 4.6V Flash837±92K9.0%3.7%64 tps2.1s128K$0.04$0.40
33201Llama 3 8B840±1983515.7%6.0%85 tps0.7s8K$0.12$0.16
34265Magistral Small 2509844±267458.0%2.7%116 tps0.6s131K$0.50$1.50
35260Hermes 4 405B Reasoning FP8844±122.1K18.8%3.6%32 tps0.8s131K$1.00$3.00
36222Jamba 1.5 Large844±1391512.0%1.7%48 tps0.9s256K$1.50$6.00
37240Hermes 4 405B FP8850±1650013.0%3.5%31 tps0.9s131K$0.52$1.73
38222Rnj-1 Instruct853±245907.8%0.6%103 tps0.3s33K$0.15$0.15
39179Qwen 2.5 72B854±2653511.6%1.2%96 tps1.2s131K$0.14$0.26
40194Mistral Small 3 24B Instruct858±1858512.7%2.6%77 tps0.6s33K$0.07$0.14
View All (237 models)