Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

389
MiniMax M1 (Extended)
413
Mistral Nemo 12B Inferor v0.0
472
ArliAI: QwQ 32B RpR v1
612
QwQ 32B RpR v1
625
DeepSeek-R1 Distill Qwen 1.5B
629
Qwen 2.5 VL 3B Instruct
630
LFM2.5 1.2B Thinking
635
Phi 4 Mini Reasoning
639
OpenHands LM 32B V0.1
646
Phi 3.5 Mini 128k Instruct
667
UI-TARS 1.5 7B
678
DeepSeek-R1 Distill Llama 8B
688
MiniMax M1
697
Phi 4 Reasoning
721
DeepSeek-R1 Distill Qwen 7B

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
1439MiniMax M1 (Extended)389±234951.0%<0.1%3 tpsN/A128K$0$0
2439Mistral Nemo 12B Inferor v0.0413±92.5K1.2%<0.1%83 tps0.8s16K$0.80$1.20
3438ArliAI: QwQ 32B RpR v1472±204857.6%<0.1%20 tps2.5s33K$0$0
4434QwQ 32B RpR v1612±102.4K7.0%<0.1%34 tps3.3s33K$0.02$0.07
5430DeepSeek-R1 Distill Qwen 1.5B625±111.5K3.9%<0.1%20 tps0.0s131K$0.18$0.18
6288Qwen 2.5 VL 3B Instruct629±75.1K6.8%3.0%44 tps2.5s128K$0.21$0.63
7291LFM2.5 1.2B Thinking630±227054.7%2.6%258 tps0.4s33K$0$0
8291Phi 4 Mini Reasoning635±47.9K7.9%9.7%30 tps0.9s128K$0.07$0.30
9430OpenHands LM 32B V0.1639±101.9K1.0%<0.1%11 tpsN/A16K$2.60$3.40
10430Phi 3.5 Mini 128k Instruct646±138352.9%<0.1%14 tps0.7s128K$0.10$0.10
11289UI-TARS 1.5 7B667±181.4K8.7%4.0%75 tps0.9s128K$0.10$0.20
12428DeepSeek-R1 Distill Llama 8B678±92.1K3.4%<0.1%17 tpsN/A32K$0.04$0.04
13284MiniMax M1688±47.8K4.0%<0.1%31 tps2.8s1M$0.55$2.20
14287Phi 4 Reasoning697±84K3.5%21.0%29 tps1.0s33K$0.06$0.25
15424DeepSeek-R1 Distill Qwen 7B721±91.1K3.4%<0.1%0 tpsN/A131K$0.05$0.10
16419Kimi Dev 72B735±101.1K3.9%<0.1%17 tps13.5s131K$0.12$0.47
17285Hunyuan A13B Instruct739±54.6K5.0%2.3%67 tps2.0s33K$0.01$0.01
18424ERNIE 4.5 0.3B741±131.5K8.5%<0.1%85 tps2.2s120K$0$0
19274Pixtral 12B755±124.6K5.7%2.2%101 tps1.2s131K$0.08$0.08
20412ArliAI QwQ 32B Arliai RpR V1764±111.1K6.6%<0.1%34 tps1.8s33K$0.02$0.07
21421Llema 7B766±34.7K1.4%<0.1%1 tps15.0s4K$0.80$1.20
22285Phi 4 Mini Instruct778±54.1K3.2%7.4%40 tps1.1s128K$0.07$0.30
23274Moonshot V1 128k Vision779±129555.0%3.1%44 tps3.8s131K$2.00$5.00
24399Magistral Medium (Thinking)780±63.7K3.7%<0.1%67 tps0.8s41K$2.00$5.00
25399Gemini 1.5 Flash 8B783±71.4K4.5%<0.1%11 tps0.0s1M$0.02$0.10
26392Phi 4 Reasoning Plus784±136506.5%<0.1%32 tps1.2s33K$0.04$0.17
27274MiniMax M2-her791±111.1K2.2%<0.1%108 tps0.7s205K$0.30$1.20
28281Goliath 120B794±53.1K2.5%2.7%21 tps2.2s6K$6.56$9.38
29274C4AI Aya Expanse 8B798±121.2K6.5%0.9%61 tps0.4s8K$0.50$1.50
30412Shisa V2 Llama 3.3 70B798±91.4K6.5%<0.1%8 tps2.0s33K$0.03$0.09
31406DeepSeek-R1 Distill Qwen 14B802±53.6K3.7%<0.1%44 tps1.7s64K$0.63$0.63
32274DeepSeek-R1 Distill Qwen 32B807±64K3.3%6.2%22 tps1.8s131K$0.37$0.39
33281MythoMax L2 13B814±49.8K2.5%1.2%22 tps1.1s4K$0.18$0.18
34412Dolphin 3.0 R1 Mistral 24B819±62.5K4.5%<0.1%13 tps0.1s33K$0.03$0.09
35412Dolphin 2.9.2 Mixtral 8x22B823±35.8K1.0%<0.1%20 tps1.5s16K$0.90$0.90
36406Command823±53.6K2.0%<0.1%25 tpsN/A4K$0.83$1.33
37374Cogito V2 671B825±102.3K4.2%<0.1%41 tps0.6s164K$1.25$1.25
38399Phi 3 Medium 128k Instruct827±91.1K3.1%<0.1%40 tps1.3s128K$0.58$0.84
39274LFM2 8B A1B831±72.3K6.7%<0.1%142 tps0.3s33K$0.01$0.02
40281Gemma 2 9B831±91.6K3.7%<0.1%100 tps0.4s8K$0.09$0.09
View All (432 models)