Models
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

329
Seed Coder 8B Reasoning
406
QwQ 32B RpR v1
447
Phi 4 Mini Reasoning
454
Mistral Nemo 12B Inferor v0.0
463
CodeLlama 7B Instruct Solidity
481
DeepSeek-R1 Distill Qwen 1.5B
523
Qwen 2.5 VL 3B Instruct
573
Phi 4 Reasoning
588
Hunyuan A13B Instruct
595
DeepSeek-R1 Distill Llama 8B
599
Phi 4 Mini Instruct
600
MythoMax L2 13B
601
Llema 7B
602
ERNIE 4.5 0.3B
610
UI-TARS 1.5 7B

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
1404Seed Coder 8B Reasoning329±417004.1%<0.1%25 tpsN/A32K$0.99$0.99
2402QwQ 32B RpR v1406±351K10.9%<0.1%34 tps3.3s33K$0.02$0.07
3286Phi 4 Mini Reasoning447±153.4K12.0%9.7%30 tps0.9s128K$0.07$0.30
4399Mistral Nemo 12B Inferor v0.0454±285651.7%<0.1%83 tps0.8s16K$0.80$1.20
5284CodeLlama 7B Instruct Solidity463±544858.5%3.6%33 tps0.7s16K$0.80$1.20
6399DeepSeek-R1 Distill Qwen 1.5B481±197305.2%<0.1%20 tps0.0s131K$0.18$0.18
7284Qwen 2.5 VL 3B Instruct523±254.1K6.1%3.0%44 tps2.5s128K$0.21$0.63
8279Phi 4 Reasoning573±172.1K5.5%21.0%29 tps1.0s33K$0.06$0.25
9279Hunyuan A13B Instruct588±221.6K9.2%2.3%67 tps2.0s33K$0.01$0.01
10390DeepSeek-R1 Distill Llama 8B595±191.2K5.6%<0.1%17 tpsN/A32K$0.04$0.04
11279Phi 4 Mini Instruct599±211K7.1%7.4%40 tps1.1s128K$0.07$0.30
12279MythoMax L2 13B600±212.3K5.8%1.2%22 tps1.1s4K$0.18$0.18
13390Llema 7B601±218504.5%<0.1%1 tps15.0s4K$0.80$1.20
14390ERNIE 4.5 0.3B602±4068511.0%<0.1%85 tps2.2s120K$0$0
15279UI-TARS 1.5 7B610±4053011.7%4.0%75 tps0.9s128K$0.10$0.20
16386Shisa V2 Llama 3.3 70B623±245859.3%<0.1%8 tps2.0s33K$0.03$0.09
17386DeepSeek-R1 Distill Qwen 7B633±195655.0%<0.1%0 tpsN/A131K$0.05$0.10
18386Dolphin 2.9.2 Mixtral 8x22B652±191.1K2.6%<0.1%20 tps1.5s16K$0.90$0.90
19386MiMo 7B RL655±131.2K3.5%<0.1%31 tps0.4s32K$0.49$0.49
20276MiniMax M1686±133.8K5.3%<0.1%31 tps2.8s1M$0.55$2.20
21383ArliAI QwQ 32B Arliai RpR V1686±406359.3%<0.1%34 tps1.8s33K$0.02$0.07
22276DeepSeek-R1 Distill Qwen 32B696±202K5.5%6.2%22 tps1.8s131K$0.37$0.39
23374Phi 4 Multimodal Instruct697±162.1K6.8%<0.1%17 tps1.4s128K$0.03$0.05
24374Dolphin 3.0 R1 Mistral 24B701±168907.8%<0.1%13 tps0.1s33K$0.03$0.09
25276Hermes 3 405B Instruct702±201.4K4.1%2.3%20 tps1.1s131K$0.80$0.80
26269DeepHermes 3 Mistral 24B Preview706±307155.9%2.5%50 tps1.0s33K$0.06$0.25
27374Mythalion 13B709±101.1K1.3%<0.1%63 tps0.5s4K$0.56$1.13
28269Inflection 3 Pi719±181.5K4.1%1.1%33 tps3.4s8K$2.50$10.00
29374Solar Pro 250422720±195306.2%<0.1%13 tps0.6s33K$0$0
30269Pixtral 12B722±213K6.3%2.2%101 tps1.2s131K$0.08$0.08
31374Mistral Nemo 12B Celeste V1.9725±181.1K3.5%<0.1%6 tps10.2s8K$0.80$1.20
32361Zenith730±288509.6%<0.1%36 tps1.8s131K$0$0
33361Meridian734±399659.8%<0.1%92 tps1.2s131K$0$0
34361Command734±187654.4%<0.1%25 tpsN/A4K$0.83$1.33
35269Inflection 3 Productivity737±241.5K5.0%0.6%50 tps3.2s8K$2.50$10.00
36269Command R+738±151.6K5.6%2.8%36 tps0.7s128K$2.08$9.45
37269Mixtral 8x22B Instruct738±171.4K5.6%1.8%142 tps0.7s66K$0.45$0.45
38269Gemma 3 4B742±103.3K4.7%1.3%138 tps0.7s131K$0.02$0.04
39262Qwen 2.5 VL 72B Instruct746±202.1K6.0%5.3%25 tps3.7s128K$1.01$2.79
40361Seed Coder 8B Instruct751±226052.4%<0.1%35 tpsN/A32K$0.99$0.99
View All (404 models)