Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

915
OpenAI o4-mini-high
922
DeepSeek V3.2 Speciale
922
Grok 3 Fast
924
Mistral Small 3.1
927
DeepSeek V3.1 Terminus Thinking
929
Mistral Small 3.2 24B
932
Kimi K2 0711
932
GLM 4.7 Flash
940
Qwen3 4B
941
Pixtral Large
946
OpenAI o1
948
Gemini 2.0 Flash Lite
948
Amazon Nova 2 Lite
955
GPT-5 Nano
958
GPT-4.1 nano

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
41148OpenAI o4-mini-high915±104.7K1.5%1.9%117 tps15.9s200K$1.10$4.40
42133DeepSeek V3.2 Speciale922±257806.6%6.0%43 tps1.4s131K$0.84$1.52
43111Grok 3 Fast922±125300.9%1.7%52 tps2.4s131K$5.00$25.00
44161Mistral Small 3.1924±366152.4%7.4%13 tps2.6s32K$0.17$0.28
45106DeepSeek V3.1 Terminus Thinking927±178404.5%5.9%27 tps1.8s131K$0.56$1.68
46170Mistral Small 3.2 24B929±149852.5%2.8%141 tps0.7s33K$0.02$0.08
47170Kimi K2 0711932±122.4K1.7%1.6%29 tps1.3s131K$0.72$2.60
48179GLM 4.7 Flash932±266902.1%5.8%61 tps2.8s128K$0.07$0.39
49165Qwen3 4B940±141.7K5.2%1.9%94 tps1.5s128K$0.01$0.01
50165Pixtral Large941±189403.6%2.5%57 tps1.3s128K$1.50$4.50
51153OpenAI o1946±113.6K1.0%4.2%92 tps5.5s200K$15.00$60.00
52143Gemini 2.0 Flash Lite948±73.5K1.7%<0.1%42 tps0.5s1M$0.08$0.30
5386Amazon Nova 2 Lite948±219707.6%1.0%137 tps0.6s300K$0.35$2.95
54157GPT-5 Nano955±181.2K4.1%3.2%113 tps20.9s400K$0.05$0.40
55133GPT-4.1 nano958±84K1.4%0.6%175 tps0.5s1M$0.10$0.40
56129Command A959±86.2K1.2%2.2%42 tps0.8s256K$2.00$7.33
5771DeepSeek V3.1962±207503.2%0.8%197 tps0.4s164K$0.55$1.60
58143Gemini 2.0 Flash965±121.8K1.3%<0.1%76 tps0.5s1M$0.14$0.56
59148OpenAI o3968±121.6K1.6%0.9%85 tps6.8s128K$7.33$29.33
60139OpenAI o4-mini971±82.2K2.6%1.4%97 tps7.0s128K$1.10$4.40
6165Mistral Large 3974±201.2K5.5%2.1%51 tps1.0s256K$0.50$1.50
62126DeepSeek V3975±105.5K0.5%0.9%69 tps1.1s64K$0.59$1.49
63129DeepSeek V3.1 Thinking984±131.4K3.7%7.1%18 tps1.8s131K$0.23$0.75
64126Qwen3 30B A3B986±161.9K3.4%5.1%163 tps1.0s41K$0.06$0.21
65113GLM 4.5986±151.5K2.0%3.7%46 tps1.4s131K$0.43$1.63
66161Qwen3 8B991±111.4K2.5%2.4%61 tps1.4s41K$0.02$0.07
6771Seed 1.8 251228997±132.4K1.3%3.7%41 tps2.1s256K$0.25$2.00
6886Qwen3 235B A22B998±181.4K3.2%5.3%71 tps0.9s41K$0.23$0.63
69118GPT-4.1 mini999±105.1K1.4%1.1%67 tps0.9s1M$0.34$1.60
7065GLM 4.61001±151.3K4.4%5.4%39 tps1.5s200K$0.42$1.66
71153Qwen 2.5 32B Instruct1004±141.2K1.7%2.5%48 tps1.0s131K$0.21$0.25
7286Claude Sonnet 41013±710.4K1.2%1.8%49 tps1.3s200K$3.00$15.00
73119ERNIE 4.5 300B A47B1014±114K1.1%4.7%23 tps2.3s123K$0.28$1.10
74121QwQ 32B1015±74.6K1.4%5.4%41 tps2.1s16K$0.43$0.56
75124Kimi K2 0905 Turbo1017±122.1K2.3%0.7%373 tps0.5s262K$1.70$6.50
7671Qwen3.5 397B A17B1021±229101.1%4.3%57 tps1.4s256K$0.52$3.00
77129Qwen3 Max Thinking1022±121.3K1.1%13.5%32 tps2.3s256K$1.20$6.00
7862MiniMax M21027±92.5K5.2%2.2%39 tps2.3s205K$0.21$0.85
79126Qwen3 VL 235B A22B Thinking1027±139354.1%4.3%47 tps3.0s127K$0.47$3.31
8048Claude Sonnet 4 (Thinking)1028±153.7K2.9%1.5%52 tps1.5s200K$3.00$13.67
View All (173 models)