Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1033
Arcee AI Maestro Reasoning
1032
Llama 3.1 405B Instruct
1031
Llama 3 70B Turbo
1031
GPT-4.1 mini
1031
Ministral 14B 3.0
1030
Command A
1030
Claude Opus 4 (Thinking)
1030
Grok 3 Fast
1027
Qwen3 VL 235B A22B Thinking
1027
DeepSeek V3.2 Speciale
1027
Qwen3 4B
1027
DeepSeek V3
1027
Qwen3 VL 30B A3B Thinking
1026
Claude Sonnet 4
1025
GLM 4.5 Turbo

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
161147Arcee AI Maestro Reasoning1033±411.8K1.6%<0.1%85 tps0.3s131K$0.90$3.30
162159Llama 3.1 405B Instruct1032±81.3K1.5%<0.1%52 tps0.5s128K$2.60$4.27
163177Llama 3 70B Turbo1031±315.7K1.3%<0.1%31 tps0.0s8K$0.73$0.83
164118GPT-4.1 mini1031±257.7K1.3%1.1%67 tps0.9s1M$0.34$1.60
165153Ministral 14B 3.01031±62.3K3.1%2.0%119 tps0.5s128K$0.20$0.20
166129Command A1030±267.4K1.3%2.2%42 tps0.8s256K$2.00$7.33
16721Claude Opus 4 (Thinking)1030±61.3K3.0%<0.1%28 tps1.3s200K$15.00$75.00
168111Grok 3 Fast1030±312K1.1%1.7%52 tps2.4s131K$5.00$25.00
169126Qwen3 VL 235B A22B Thinking1027±47.3K2.9%4.3%47 tps3.0s127K$0.47$3.31
170133DeepSeek V3.2 Speciale1027±55.9K2.2%6.0%43 tps1.4s131K$0.84$1.52
171165Qwen3 4B1027±49.4K3.3%1.9%94 tps1.5s128K$0.01$0.01
172126DeepSeek V31027±241.8K1.0%0.9%69 tps1.1s64K$0.59$1.49
173165Qwen3 VL 30B A3B Thinking1027±72.3K4.6%4.5%84 tps2.9s127K$0.20$1.47
17486Claude Sonnet 41026±288.9K1.5%1.8%49 tps1.3s200K$3.00$15.00
175182GLM 4.5 Turbo1025±111K2.9%<0.1%46 tps1.6s131K$1.00$3.00
176159Qwen Turbo1025±332.8K1.3%<0.1%53 tps1.1s1M$0.05$0.20
177143Mistral Medium 31023±91.2K1.7%2.4%47 tps0.8s33K$0.40$2.00
178182GLM 4.6 FP81022±52.2K3.7%<0.1%56 tps1.8s200K$0.40$1.75
17977Claude Opus 4.11022±36.5K2.5%3.0%17 tps3.7s200K$15.00$75.00
180161Qwen3 8B1020±56.1K2.6%2.4%61 tps1.4s41K$0.02$0.07
181143Seed 1.6 2506151018±43.6K1.6%3.1%46 tps2.2s256K$0.25$2.00
182182Fauna Fox1018±410.7K2.4%<0.1%194 tps0.3s128K$0.04$0.15
183148OpenAI o31018±64.2K1.8%0.9%85 tps6.8s128K$7.33$29.33
184133Kimi K2 09051016±49.2K2.1%4.0%30 tps1.4s262K$0.63$2.39
185157Qwen3 Next 80B A3B Thinking1015±312.3K2.3%0.6%175 tps1.3s256K$0.21$2.26
186133GPT-4.1 nano1014±252.1K1.3%0.6%175 tps0.5s1M$0.10$0.40
187101Qwen3.5 35B A3B1011±151.1K1.4%2.1%116 tps2.1s256K$0.63$1.13
188179Baichuan-M2-32B1011±81.4K2.7%<0.1%32 tps3.3s131K$0.07$0.07
189139Seed 2.0 Mini (Medium)1010±101.3K2.6%11.9%33 tps1.7s256K$0.15$0.60
190148OpenAI o4-mini-high1009±219.5K2.1%1.9%117 tps15.9s200K$1.10$4.40
191175MiMo V2 Flash1009±106453.7%7.2%24 tps1.9s262K$0.07$0.23
192139Qwen3 VL 30B A3B Instruct1009±91.2K4.5%1.8%80 tps2.6s129K$0.18$0.67
193193GPT-5 Nano High1008±106001.6%<0.1%23 tps25.7s400K$0.05$0.40
194129Qwen3 Max Thinking1007±56.9K1.4%13.5%32 tps2.3s256K$1.20$6.00
195139OpenAI o4-mini1007±313.6K2.2%1.4%97 tps7.0s128K$1.10$4.40
196165ERNIE 4.5 21B A3B1006±71.4K1.8%2.3%78 tps1.5s120K$0.05$0.19
197186Mistral Small 3.2 24B Instruct1006±71.6K3.4%1.9%113 tps1.1s131K$0.02$0.08
198233TNG Tech DeepSeek R1T Chimera1006±126000.8%<0.1%78 tps1.5s164K$0.11$0.44
199139GLM 4.6V1005±37.4K1.4%6.4%21 tps1.8s128K$0.38$0.90
200153Qwen 2.5 32B Instruct1005±315.7K1.0%2.5%48 tps1.0s131K$0.21$0.25
View All (410 models)