Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

807
DeepSeek-R1 Distill Qwen 32B
802
DeepSeek-R1 Distill Qwen 14B
798
Shisa V2 Llama 3.3 70B
798
C4AI Aya Expanse 8B
794
Goliath 120B
791
MiniMax M2-her
784
Phi 4 Reasoning Plus
783
Gemini 1.5 Flash 8B
780
Magistral Medium (Thinking)
779
Moonshot V1 128k Vision
778
Phi 4 Mini Instruct
766
Llema 7B
764
ArliAI QwQ 32B Arliai RpR V1
755
Pixtral 12B
741
ERNIE 4.5 0.3B

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
401274DeepSeek-R1 Distill Qwen 32B807±64K3.3%6.2%22 tps1.8s131K$0.37$0.39
402406DeepSeek-R1 Distill Qwen 14B802±53.6K3.7%<0.1%44 tps1.7s64K$0.63$0.63
403412Shisa V2 Llama 3.3 70B798±91.4K6.5%<0.1%8 tps2.0s33K$0.03$0.09
404274C4AI Aya Expanse 8B798±121.2K6.5%0.9%61 tps0.4s8K$0.50$1.50
405281Goliath 120B794±53.1K2.5%2.7%21 tps2.2s6K$6.56$9.38
406274MiniMax M2-her791±111.1K2.2%<0.1%108 tps0.7s205K$0.30$1.20
407392Phi 4 Reasoning Plus784±136506.5%<0.1%32 tps1.2s33K$0.04$0.17
408399Gemini 1.5 Flash 8B783±71.4K4.5%<0.1%11 tps0.0s1M$0.02$0.10
409399Magistral Medium (Thinking)780±63.7K3.7%<0.1%67 tps0.8s41K$2.00$5.00
410274Moonshot V1 128k Vision779±129555.0%3.1%44 tps3.8s131K$2.00$5.00
411285Phi 4 Mini Instruct778±54.1K3.2%7.4%40 tps1.1s128K$0.07$0.30
412421Llema 7B766±34.7K1.4%<0.1%1 tps15.0s4K$0.80$1.20
413412ArliAI QwQ 32B Arliai RpR V1764±111.1K6.6%<0.1%34 tps1.8s33K$0.02$0.07
414274Pixtral 12B755±124.6K5.7%2.2%101 tps1.2s131K$0.08$0.08
415424ERNIE 4.5 0.3B741±131.5K8.5%<0.1%85 tps2.2s120K$0$0
416285Hunyuan A13B Instruct739±54.6K5.0%2.3%67 tps2.0s33K$0.01$0.01
417419Kimi Dev 72B735±101.1K3.9%<0.1%17 tps13.5s131K$0.12$0.47
418424DeepSeek-R1 Distill Qwen 7B721±91.1K3.4%<0.1%0 tpsN/A131K$0.05$0.10
419287Phi 4 Reasoning697±84K3.5%21.0%29 tps1.0s33K$0.06$0.25
420284MiniMax M1688±47.8K4.0%<0.1%31 tps2.8s1M$0.55$2.20
421428DeepSeek-R1 Distill Llama 8B678±92.1K3.4%<0.1%17 tpsN/A32K$0.04$0.04
422289UI-TARS 1.5 7B667±181.4K8.7%4.0%75 tps0.9s128K$0.10$0.20
423430Phi 3.5 Mini 128k Instruct646±138352.9%<0.1%14 tps0.7s128K$0.10$0.10
424430OpenHands LM 32B V0.1639±101.9K1.0%<0.1%11 tpsN/A16K$2.60$3.40
425291Phi 4 Mini Reasoning635±47.9K7.9%9.7%30 tps0.9s128K$0.07$0.30
426291LFM2.5 1.2B Thinking630±227054.7%2.6%258 tps0.4s33K$0$0
427288Qwen 2.5 VL 3B Instruct629±75.1K6.8%3.0%44 tps2.5s128K$0.21$0.63
428430DeepSeek-R1 Distill Qwen 1.5B625±111.5K3.9%<0.1%20 tps0.0s131K$0.18$0.18
429434QwQ 32B RpR v1612±102.4K7.0%<0.1%34 tps3.3s33K$0.02$0.07
430438ArliAI: QwQ 32B RpR v1472±204857.6%<0.1%20 tps2.5s33K$0$0
431439Mistral Nemo 12B Inferor v0.0413±92.5K1.2%<0.1%83 tps0.8s16K$0.80$1.20
432439MiniMax M1 (Extended)389±234951.0%<0.1%3 tpsN/A128K$0$0
View All (432 models)