Models
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

329
Seed Coder 8B Reasoning
406
QwQ 32B RpR v1
454
Mistral Nemo 12B Inferor v0.0
588
Hunyuan A13B Instruct
595
DeepSeek-R1 Distill Llama 8B
602
ERNIE 4.5 0.3B
610
UI-TARS 1.5 7B
623
Shisa V2 Llama 3.3 70B
652
Dolphin 2.9.2 Mixtral 8x22B
655
MiMo 7B RL
686
MiniMax M1
686
ArliAI QwQ 32B Arliai RpR V1
696
DeepSeek-R1 Distill Qwen 32B
701
Dolphin 3.0 R1 Mistral 24B
706
DeepHermes 3 Mistral 24B Preview

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
1404Seed Coder 8B Reasoning329±417004.1%<0.1%25 tpsN/A32K$0.99$0.99
2402QwQ 32B RpR v1406±351K10.9%<0.1%34 tps3.3s33K$0.02$0.07
3399Mistral Nemo 12B Inferor v0.0454±285651.7%<0.1%83 tps0.8s16K$0.80$1.20
4279Hunyuan A13B Instruct588±221.6K9.2%2.3%67 tps2.0s33K$0.01$0.01
5390DeepSeek-R1 Distill Llama 8B595±191.2K5.6%<0.1%17 tpsN/A32K$0.04$0.04
6390ERNIE 4.5 0.3B602±4068511.0%<0.1%85 tps2.2s120K$0$0
7279UI-TARS 1.5 7B610±4053011.7%4.0%75 tps0.9s128K$0.10$0.20
8386Shisa V2 Llama 3.3 70B623±245859.3%<0.1%8 tps2.0s33K$0.03$0.09
9386Dolphin 2.9.2 Mixtral 8x22B652±191.1K2.6%<0.1%20 tps1.5s16K$0.90$0.90
10386MiMo 7B RL655±131.2K3.5%<0.1%31 tps0.4s32K$0.49$0.49
11276MiniMax M1686±133.8K5.3%<0.1%31 tps2.8s1M$0.55$2.20
12383ArliAI QwQ 32B Arliai RpR V1686±406359.3%<0.1%34 tps1.8s33K$0.02$0.07
13276DeepSeek-R1 Distill Qwen 32B696±202K5.5%6.2%22 tps1.8s131K$0.37$0.39
14374Dolphin 3.0 R1 Mistral 24B701±168907.8%<0.1%13 tps0.1s33K$0.03$0.09
15269DeepHermes 3 Mistral 24B Preview706±307155.9%2.5%50 tps1.0s33K$0.06$0.25
16269Inflection 3 Pi719±181.5K4.1%1.1%33 tps3.4s8K$2.50$10.00
17374Solar Pro 250422720±195306.2%<0.1%13 tps0.6s33K$0$0
18374Mistral Nemo 12B Celeste V1.9725±181.1K3.5%<0.1%6 tps10.2s8K$0.80$1.20
19361Zenith730±288509.6%<0.1%36 tps1.8s131K$0$0
20361Meridian734±399659.8%<0.1%92 tps1.2s131K$0$0
21269Inflection 3 Productivity737±241.5K5.0%0.6%50 tps3.2s8K$2.50$10.00
22262Qwen 2.5 VL 72B Instruct746±202.1K6.0%5.3%25 tps3.7s128K$1.01$2.79
23361Seed Coder 8B Instruct751±226052.4%<0.1%35 tpsN/A32K$0.99$0.99
24361DeepSeek-R1 Distill Qwen 14B756±161.9K6.3%<0.1%44 tps1.7s64K$0.63$0.63
25262Open Mistral 7B762±181.3K4.7%0.7%176 tps0.4s33K$0.25$0.25
26354OLMo 3 7B Think763±217707.8%4.2%77 tps0.4s66K$0.12$0.20
27262Baichuan-M2-32B770±3074010.8%<0.1%32 tps3.3s131K$0.07$0.07
28252Hermes 4 70B781±294608.9%1.1%67 tps0.6s131K$0.12$0.39
29252Mistral Large785±161.1K5.8%1.5%54 tps0.7s33K$2.00$6.00
30252GPT-3.5 Turbo Instruct787±92K2.7%<0.1%46 tps1.2s4K$1.50$2.00
31252Mercury Coder793±275103.8%<0.1%247 tps2.2s32K$0.25$1.00
32346Magistral Medium 2507795±2566514.2%<0.1%86 tps0.7s41K$2.00$5.00
33252Hermes 4 405B FP8797±218158.4%3.5%31 tps0.9s131K$0.52$1.73
34252WizardLM-2 8x22B801±121.9K3.1%11.6%11 tps2.5s66K$0.77$0.77
35252Magistral Small 2509802±181.8K7.5%2.7%116 tps0.6s131K$0.50$1.50
36346Magistral Medium (Thinking)804±102.2K5.7%<0.1%67 tps0.8s41K$2.00$5.00
37337Qwen 2 72B Instruct805±191K3.3%<0.1%3 tpsN/A33K$0.90$0.90
38337GLM 4.1V 9B Thinking813±161.1K4.2%<0.1%69 tps1.3s66K$0.04$0.14
39240LFM2 8B A1B818±1882511.3%<0.1%142 tps0.3s33K$0.01$0.02
40240Moonshot V1 32k820±179503.1%1.4%53 tps1.4s33K$1.00$3.00
View All (305 models)