Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

344
Mistral Nemo 12B Inferor v0.0
508
LFM2.5 1.2B Thinking
569
Phi 3.5 Mini 128k Instruct
596
QwQ 32B RpR v1
624
Qwen 2.5 VL 3B Instruct
626
Phi 4 Mini Reasoning
633
OpenHands LM 32B V0.1
646
UI-TARS 1.5 7B
649
Moonshot V1 128k Vision
694
Kimi Dev 72B
707
Phi 4 Reasoning
741
Goliath 120B
742
Llema 7B
744
ERNIE 4.5 0.3B
748
Pixtral 12B

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
1439Mistral Nemo 12B Inferor v0.0344±72.2K0.9%<0.1%83 tps0.8s16K$0.80$1.20
2291LFM2.5 1.2B Thinking508±265455.2%2.6%258 tps0.4s33K$0$0
3430Phi 3.5 Mini 128k Instruct569±137452.6%<0.1%14 tps0.7s128K$0.10$0.10
4434QwQ 32B RpR v1596±91.9K4.1%<0.1%34 tps3.3s33K$0.02$0.07
5288Qwen 2.5 VL 3B Instruct624±172.2K7.8%3.0%44 tps2.5s128K$0.21$0.63
6291Phi 4 Mini Reasoning626±84.4K4.9%9.7%30 tps0.9s128K$0.07$0.30
7430OpenHands LM 32B V0.1633±71.8K0.8%<0.1%11 tpsN/A16K$2.60$3.40
8289UI-TARS 1.5 7B646±179154.2%4.0%75 tps0.9s128K$0.10$0.20
9274Moonshot V1 128k Vision649±315156.4%3.1%44 tps3.8s131K$2.00$5.00
10419Kimi Dev 72B694±198802.2%<0.1%17 tps13.5s131K$0.12$0.47
11287Phi 4 Reasoning707±101K2.8%21.0%29 tps1.0s33K$0.06$0.25
12281Goliath 120B741±92.6K2.1%2.7%21 tps2.2s6K$6.56$9.38
13421Llema 7B742±44.3K1.7%<0.1%1 tps15.0s4K$0.80$1.20
14424ERNIE 4.5 0.3B744±101.1K5.7%<0.1%85 tps2.2s120K$0$0
15274Pixtral 12B748±152.3K5.1%2.2%101 tps1.2s131K$0.08$0.08
16274LFM2 8B A1B757±91.9K4.8%<0.1%142 tps0.3s33K$0.01$0.02
17274MiniMax M2-her767±118801.7%<0.1%108 tps0.7s205K$0.30$1.20
18285Phi 4 Mini Instruct774±53.7K2.0%7.4%40 tps1.1s128K$0.07$0.30
19285Hunyuan A13B Instruct777±54.1K2.4%2.3%67 tps2.0s33K$0.01$0.01
20274C4AI Aya Expanse 8B778±158852.7%0.9%61 tps0.4s8K$0.50$1.50
21281Gemma 2 9B779±71.3K3.6%<0.1%100 tps0.4s8K$0.09$0.09
22265Magistral Small 2509786±111.7K3.7%2.7%116 tps0.6s131K$0.50$1.50
23399Phi 3 Medium 128k Instruct789±108952.2%<0.1%40 tps1.3s128K$0.58$0.84
24412Shisa V2 Llama 3.3 70B790±91.1K2.2%<0.1%8 tps2.0s33K$0.03$0.09
25412Dolphin 2.9.2 Mixtral 8x22B791±45.4K0.8%<0.1%20 tps1.5s16K$0.90$0.90
26412Dolphin 3.0 R1 Mistral 24B795±72.4K1.4%<0.1%13 tps0.1s33K$0.03$0.09
27281MythoMax L2 13B796±49K1.6%1.2%22 tps1.1s4K$0.18$0.18
28412ArliAI QwQ 32B Arliai RpR V1799±166555.1%<0.1%34 tps1.8s33K$0.02$0.07
29265Qwen 2.5 VL 72B Instruct808±121.3K5.5%5.3%25 tps3.7s128K$1.01$2.79
30271Hermes 3 405B Instruct814±45.5K1.2%2.3%20 tps1.1s131K$0.80$0.80
31265LFM2 2.6B816±81.7K5.2%6.7%184 tps0.4s33K$0.01$0.02
32271Mistral Large818±64.3K1.6%1.5%54 tps0.7s33K$2.00$6.00
33406Solar Pro 250422820±71.2K1.2%<0.1%13 tps0.6s33K$0$0
34406Command821±63.1K1.6%<0.1%25 tpsN/A4K$0.83$1.33
35392Phi 3 Mini 128k Instruct821±91K2.4%<0.1%16 tps0.5s128K$0.12$0.31
36274DeepHermes 3 Mistral 24B Preview824±99953.4%2.5%50 tps1.0s33K$0.06$0.25
37392Mistral Nemo 12B Celeste V1.9825±45.6K1.1%<0.1%6 tps10.2s8K$0.80$1.20
38392MiMo 7B RL825±36.7K1.1%<0.1%31 tps0.4s32K$0.49$0.49
39260Hermes 4 405B Reasoning FP8826±45K3.5%3.6%32 tps0.8s131K$1.00$3.00
40361Magistral Small 2507830±81.4K4.1%<0.1%148 tps0.4s41K$0.50$1.50
View All (410 models)