Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1017
Kimi K2 0905 Turbo
1014
ERNIE 4.5 300B A47B
1013
Claude Sonnet 4
1001
GLM 4.6
999
GPT-4.1 mini
997
Seed 1.8 251228
991
Qwen3 8B
986
GLM 4.5
984
DeepSeek V3.1 Thinking
971
OpenAI o4-mini
968
OpenAI o3
965
Gemini 2.0 Flash
962
DeepSeek V3.1
958
GPT-4.1 nano
955
GPT-5 Nano

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
81124Kimi K2 0905 Turbo1017±122.1K2.3%0.7%373 tps0.5s262K$1.70$6.50
82119ERNIE 4.5 300B A47B1014±114K1.1%4.7%23 tps2.3s123K$0.28$1.10
8386Claude Sonnet 41013±710.4K1.2%1.8%49 tps1.3s200K$3.00$15.00
8465GLM 4.61001±151.3K4.4%5.4%39 tps1.5s200K$0.42$1.66
85118GPT-4.1 mini999±105.1K1.4%1.1%67 tps0.9s1M$0.34$1.60
8671Seed 1.8 251228997±132.4K1.3%3.7%41 tps2.1s256K$0.25$2.00
87161Qwen3 8B991±111.4K2.5%2.4%61 tps1.4s41K$0.02$0.07
88113GLM 4.5986±151.5K2.0%3.7%46 tps1.4s131K$0.43$1.63
89129DeepSeek V3.1 Thinking984±131.4K3.7%7.1%18 tps1.8s131K$0.23$0.75
90139OpenAI o4-mini971±82.2K2.6%1.4%97 tps7.0s128K$1.10$4.40
91148OpenAI o3968±121.6K1.6%0.9%85 tps6.8s128K$7.33$29.33
92143Gemini 2.0 Flash965±121.8K1.3%<0.1%76 tps0.5s1M$0.14$0.56
9371DeepSeek V3.1962±207503.2%0.8%197 tps0.4s164K$0.55$1.60
94133GPT-4.1 nano958±84K1.4%0.6%175 tps0.5s1M$0.10$0.40
95157GPT-5 Nano955±181.2K4.1%3.2%113 tps20.9s400K$0.05$0.40
9686Amazon Nova 2 Lite948±219707.6%1.0%137 tps0.6s300K$0.35$2.95
97143Gemini 2.0 Flash Lite948±73.5K1.7%<0.1%42 tps0.5s1M$0.08$0.30
98153OpenAI o1946±113.6K1.0%4.2%92 tps5.5s200K$15.00$60.00
99165Qwen3 4B940±141.7K5.2%1.9%94 tps1.5s128K$0.01$0.01
100179GLM 4.7 Flash932±266902.1%5.8%61 tps2.8s128K$0.07$0.39
101170Kimi K2 0711932±122.4K1.7%1.6%29 tps1.3s131K$0.72$2.60
102170Mistral Small 3.2 24B929±149852.5%2.8%141 tps0.7s33K$0.02$0.08
103111Grok 3 Fast922±125300.9%1.7%52 tps2.4s131K$5.00$25.00
104133DeepSeek V3.2 Speciale922±257806.6%6.0%43 tps1.4s131K$0.84$1.52
105148OpenAI o4-mini-high915±104.7K1.5%1.9%117 tps15.9s200K$1.10$4.40
106143Seed 1.6 250615888±235302.8%3.1%46 tps2.2s256K$0.25$2.00
107139GLM 4.6V880±338852.2%6.4%21 tps1.8s128K$0.38$0.90
108157Cogito v2.1 671B876±304903.9%0.8%85 tps0.5s128K$1.25$1.25
109160Llama 4 Scout872±114.4K1.4%0.6%88 tps5.1s131K$0.18$0.46
110194Magistral Small 2506870±161K2.8%1.6%156 tps0.5s40K$0.37$1.10
111170Devstral Medium865±198051.8%1.5%77 tps0.6s131K$0.40$2.00
112209Llama 3.3 Swallow 70B Instruct865±198201.8%1.4%153 tps1.3s131K$0.13$0.39
113175OpenAI o3-mini-low852±84.4K1.8%0.7%139 tps1.5s200K$1.10$4.40
114186Grok 3 Mini852±142.5K1.4%1.2%43 tps0.5s131K$0.30$0.50
115177OpenAI o3-mini851±74.7K1.8%0.8%143 tps3.3s200K$1.10$4.40
116179Inception Mercury847±131.4K1.0%0.4%257 tps1.1s32K$0.25$1.00
117214OpenAI o3-mini-high833±132.9K1.0%2.4%231 tps10.5s200K$1.10$4.40
118235GLM 4 32B820±167002.1%2.6%40 tps1.6s33K$0.14$0.14
119186Gemma 3n E4B814±171.6K3.6%2.0%30 tps0.5s8K$0.01$0.02
120186Grok 3 Mini Fast807±142.4K1.8%1.6%44 tps0.5s131K$0.60$4.00
View All (131 models)