Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

969
Llama 3.1 70B Instruct Turbo
969
Llama 3.1 70B Instruct
967
Arcee AI Blitz
967
Inception Mercury
965
Magistral Medium
965
Gemma 3n E4B
962
Mistral Small 3.1 24B Instruct
961
AFM 4.5B Preview
960
Claude Haiku 3
957
Jamba 1.6 Large
956
Gemini 2.5 Flash Preview Thinking
956
Grok 3 Mini Beta
955
Mistral Small 3 24B Instruct
955
Llama 3.2 11B Instruct
954
OpenAI o1

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
241233Llama 3.1 70B Instruct Turbo969±316.5K0.9%<0.1%110 tps0.8s128K$0.88$0.88
242179Llama 3.1 70B Instruct969±158451.7%6.3%30 tps0.8s128K$0.17$0.22
243241Arcee AI Blitz967±314.9K0.7%<0.1%6 tpsN/A33K$0.45$0.75
244179Inception Mercury967±324.4K0.9%0.4%257 tps1.1s32K$0.25$1.00
245253Magistral Medium965±62K3.8%<0.1%95 tps0.5s41K$2.00$5.00
246186Gemma 3n E4B965±422.2K1.2%2.0%30 tps0.5s8K$0.01$0.02
247177Mistral Small 3.1 24B Instruct962±410.6K1.3%7.5%15 tps2.4s131K$0.06$0.18
248270AFM 4.5B Preview961±59.7K2.1%<0.1%32 tps0.0s66K$0$0
249241Claude Haiku 3960±310.8K1.0%0.4%62 tps0.5s200K$0.25$1.25
250186Jamba 1.6 Large957±315.3K0.9%2.0%59 tps1.2s256K$1.33$5.33
251182Gemini 2.5 Flash Preview Thinking956±109601.5%<0.1%26 tps1.8s1M$0.15$1.76
252219Grok 3 Mini Beta956±37.7K0.6%<0.1%75 tps0.5s131K$0.45$2.25
253194Mistral Small 3 24B Instruct955±47.2K0.9%2.6%77 tps0.6s33K$0.07$0.14
254194Llama 3.2 11B Instruct955±49.2K1.0%1.5%152 tps0.5s8K$0.16$0.16
255153OpenAI o1954±43.9K1.4%4.2%92 tps5.5s200K$15.00$60.00
256277Dobby Unhinged Llama 3.3 70B951±53.9K1.4%<0.1%41 tps0.4s128K$0.90$0.90
257179GLM 4.7 Flash949±64K1.9%5.8%61 tps2.8s128K$0.07$0.39
258229ERNIE 4.5 21B A3B Thinking947±91.7K2.3%1.8%87 tps1.5s120K$0.07$0.28
259194Magistral Small 2506945±415.6K1.0%1.6%156 tps0.5s40K$0.37$1.10
260292GPT-5 Nano Minimal944±92.4K4.6%<0.1%88 tps0.8s400K$0.05$0.40
261201Qwen 2.5 7B Turbo944±92.4K1.5%0.5%125 tps0.4s131K$0.30$0.30
262194Llama 3.3 70B943±49.1K2.7%0.3%500 tps0.5s8K$0.48$0.66
263253Grok 4 (Low Reasoning)942±51.5K1.3%<0.1%18 tps9.5s256K$0$0
264314GLM 4 32B 0414 128K942±136854.9%<0.1%48 tps3.5s131K$0.10$0.10
265209Qwen 2.5 14B Instruct941±68.3K1.4%2.4%40 tps1.6s1M$0.40$1.61
266201Llama 3 8B941±312.1K0.9%6.0%85 tps0.7s8K$0.12$0.16
267302OLMo 3 32B Think939±121.4K1.4%<0.1%84 tps0.6s66K$0.15$0.50
268270Arcee AI Virtuoso-Medium939±310.1K0.7%<0.1%3 tpsN/A131K$0.50$0.80
269302Cogito V2 Preview Llama 109B938±147452.0%<0.1%84 tps1.4s33K$0.18$0.59
270265Llama 3.1 405B Instruct Turbo938±48K0.9%<0.1%26 tps0.8s131K$3.50$3.50
271186GLM 4.6V Flash937±45.8K2.0%3.7%64 tps2.1s128K$0.04$0.40
272292Arcee AI Spotlight937±318.1K0.9%<0.1%121 tps0.4s131K$0.18$0.18
273179Switchpoint Router936±48.2K1.0%1.7%71 tps4.9s131K$0.85$3.40
274201ERNIE 4.5 VL 424B A47B936±128055.3%4.9%36 tps3.5s123K$0.42$1.25
275201Mistral Small 24B Instruct935±46.3K1.2%1.5%84 tps0.4s33K$0.80$0.80
276194Llama 3 70B934±71.7K1.1%4.5%21 tps1.7s8K$1.08$1.38
277214Krutrim 2933±310.7K0.7%12.5%33 tps2.1s128K$1.00$1.00
278277Cypher Alpha932±43K2.9%<0.1%4 tpsN/A1M$0$0
279277Grok 2931±39.2K0.9%<0.1%55 tps1.1s131K$2.00$10.00
280186Grok 3 Mini931±423K1.4%1.2%43 tps0.5s131K$0.30$0.50
View All (410 models)