Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

982
Mistral Small 3.1 24B Instruct
982
Amazon Nova Pro 1.0
979
Claude Haiku 3
979
Cogito V2 Preview Llama 70B
979
AFM 4.5B Preview
979
Inception Mercury
977
GLM 4.6 FP8
977
Jamba 1.6 Large
976
Llama 3.3 70B
976
Llama 3.1 70B Instruct
976
Gemma 3n E4B
975
Cogito V2 Preview Llama 109B
974
GPT-5 Nano
972
Mistral Small 3 24B Instruct
972
Qwen 2.5 72B

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
241177Mistral Small 3.1 24B Instruct982±211.2K1.8%7.5%15 tps2.4s131K$0.06$0.18
242179Amazon Nova Pro 1.0982±224.5K1.6%0.9%96 tps0.7s300K$0.80$1.70
243241Claude Haiku 3979±311.5K1.6%0.4%62 tps0.5s200K$0.25$1.25
244233Cogito V2 Preview Llama 70B979±101.2K4.8%<0.1%44 tps1.6s33K$0.44$0.44
245270AFM 4.5B Preview979±611.5K1.3%<0.1%32 tps0.0s66K$0$0
246179Inception Mercury979±228K1.8%0.4%257 tps1.1s32K$0.25$1.00
247182GLM 4.6 FP8977±62.8K8.2%<0.1%56 tps1.8s200K$0.40$1.75
248186Jamba 1.6 Large977±215.8K1.3%2.0%59 tps1.2s256K$1.33$5.33
249194Llama 3.3 70B976±310.8K4.1%0.3%500 tps0.5s8K$0.48$0.66
250179Llama 3.1 70B Instruct976±149252.6%6.3%30 tps0.8s128K$0.17$0.22
251186Gemma 3n E4B976±225.5K1.8%2.0%30 tps0.5s8K$0.01$0.02
252302Cogito V2 Preview Llama 109B975±109805.3%<0.1%84 tps1.4s33K$0.18$0.59
253157GPT-5 Nano974±310.1K6.0%3.2%113 tps20.9s400K$0.05$0.40
254194Mistral Small 3 24B Instruct972±47.7K1.5%2.6%77 tps0.6s33K$0.07$0.14
255179Qwen 2.5 72B972±45.6K2.1%1.2%96 tps1.2s131K$0.14$0.26
256175MiMo V2 Flash971±139004.3%7.2%24 tps1.9s262K$0.07$0.23
257193GPT-5 Nano High969±98752.2%<0.1%23 tps25.7s400K$0.05$0.40
258339OLMo 3 7B Instruct969±156852.8%1.6%72 tps0.6s66K$0.10$0.20
259194Llama 3.2 11B Instruct967±29.6K1.9%1.5%152 tps0.5s8K$0.16$0.16
260165DeepSeek R1T2 Chimera967±45.9K3.3%3.0%28 tps1.8s164K$0.13$0.45
261175OpenAI o3-mini-low966±230.5K4.6%0.7%139 tps1.5s200K$1.10$4.40
262194Magistral Small 2506966±317.5K1.5%1.6%156 tps0.5s40K$0.37$1.10
263159Sherlock Think Alpha964±166504.4%<0.1%50 tps5.4s2M$0$0
264302OLMo 3 32B Think963±91.8K2.7%<0.1%84 tps0.6s66K$0.15$0.50
265265Llama 3.1 405B Instruct Turbo962±48.3K1.7%<0.1%26 tps0.8s131K$3.50$3.50
266177OpenAI o3-mini962±233.6K4.2%0.8%143 tps3.3s200K$1.10$4.40
267201ERNIE 4.5 VL 424B A47B961±101.5K5.7%4.9%36 tps3.5s123K$0.42$1.25
268161DeepSeek Prover v2961±63.3K1.8%5.2%14 tps1.3s164K$0.40$1.56
269201Llama 3 8B960±213.1K1.8%6.0%85 tps0.7s8K$0.12$0.16
270209Seed 1.6 Flash 250715960±53.6K3.1%2.5%108 tps1.6s256K$0.07$0.30
271194GLM 4.5 Flash960±161.4K4.8%12.2%15 tps2.2s131K$0$0
272186Grok 3 Mini Fast958±226.4K4.4%1.6%44 tps0.5s131K$0.60$4.00
273214Qwen 2.5 VL 32B Instruct958±121.6K5.4%6.3%43 tps3.2s128K$0.35$0.62
274201Mistral Small 24B Instruct958±46.8K2.1%1.5%84 tps0.4s33K$0.80$0.80
275253Magistral Medium958±72.8K8.2%<0.1%95 tps0.5s41K$2.00$5.00
276170Kimi K2 0711957±223.3K2.3%1.6%29 tps1.3s131K$0.72$2.60
277277Wikipedia957±265.3K1.5%<0.1%47 tps2.1s32K$0$0
278314Cogito V2 Preview Llama 405B957±101K5.1%<0.1%23 tps2.1s33K$1.17$1.17
279179Switchpoint Router957±48.5K2.0%1.7%71 tps4.9s131K$0.85$3.40
280214Gemma 3 12B956±39.8K1.9%4.2%73 tps0.8s131K$0.05$0.12
View All (432 models)