Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

694
Kimi Dev 72B
649
Moonshot V1 128k Vision
646
UI-TARS 1.5 7B
633
OpenHands LM 32B V0.1
626
Phi 4 Mini Reasoning
624
Qwen 2.5 VL 3B Instruct
596
QwQ 32B RpR v1
569
Phi 3.5 Mini 128k Instruct
508
LFM2.5 1.2B Thinking
344
Mistral Nemo 12B Inferor v0.0

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
401419Kimi Dev 72B694±198802.2%<0.1%17 tps13.5s131K$0.12$0.47
402274Moonshot V1 128k Vision649±315156.4%3.1%44 tps3.8s131K$2.00$5.00
403289UI-TARS 1.5 7B646±179154.2%4.0%75 tps0.9s128K$0.10$0.20
404430OpenHands LM 32B V0.1633±71.8K0.8%<0.1%11 tpsN/A16K$2.60$3.40
405291Phi 4 Mini Reasoning626±84.4K4.9%9.7%30 tps0.9s128K$0.07$0.30
406288Qwen 2.5 VL 3B Instruct624±172.2K7.8%3.0%44 tps2.5s128K$0.21$0.63
407434QwQ 32B RpR v1596±91.9K4.1%<0.1%34 tps3.3s33K$0.02$0.07
408430Phi 3.5 Mini 128k Instruct569±137452.6%<0.1%14 tps0.7s128K$0.10$0.10
409291LFM2.5 1.2B Thinking508±265455.2%2.6%258 tps0.4s33K$0$0
410439Mistral Nemo 12B Inferor v0.0344±72.2K0.9%<0.1%83 tps0.8s16K$0.80$1.20
View All (410 models)