Leaderboard | Text

Models

Choose model family

Claude by Anthropic

Mistral by Mistral AI

Choose topic

All topics Facts and Information Creative Writing and Ideation Logic and Problem-Solving Task Completion Coding

Language

Choose language

All languages English Chinese Arabic Spanish Indonesian Japanese

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

All Single turn Multiple turns

Open license models

Filter the leaderboard to only show models that have an open license.

All selected Open license Proprietary license

466

MythoMax L2 13B

545

Phi 4 Mini Reasoning

570

MiniMax M1

624

DeepSeek-R1 Distill Llama 8B

632

Phi 4 Reasoning

638

DeepSeek-R1 Distill Qwen 32B

648

DeepSeek-R1 Distill Qwen 1.5B

664

DeepSeek-R1 Distill Qwen 7B

664

DeepSeek-R1 Distill Qwen 14B

699

GPT-3.5 Turbo 16k

702

C4AI Aya Expanse 32B

733

Gemma 3 1B

733

DeepSeek-R1 Distill Llama 70B

748

Arcee AI Blitz

751

GLM 4 32B

Last updated about 1 month ago

Rank	Overall	Name	VIBE Score	Confidence Interval	Votes	Downvote %	Abort %	Speed	Latency	Context	Cost (Input)	Cost (Output)
1	281	MythoMax L2 13B	466	±30	520	4.6%	1.2%	22 tps	1.1s	4K	$0.18	$0.18
2	291	Phi 4 Mini Reasoning	545	±12	2.3K	3.1%	9.7%	30 tps	0.9s	128K	$0.07	$0.30
3	284	MiniMax M1	570	±10	3K	1.3%	<0.1%	31 tps	2.8s	1M	$0.55	$2.20
4	428	DeepSeek-R1 Distill Llama 8B	624	±20	1.1K	3.1%	<0.1%	17 tps	N/A	32K	$0.04	$0.04
5	287	Phi 4 Reasoning	632	±13	2K	2.2%	21.0%	29 tps	1.0s	33K	$0.06	$0.25
6	274	DeepSeek-R1 Distill Qwen 32B	638	±9	1.8K	2.7%	6.2%	22 tps	1.8s	131K	$0.37	$0.39
7	430	DeepSeek-R1 Distill Qwen 1.5B	648	±17	915	2.1%	<0.1%	20 tps	0.0s	131K	$0.18	$0.18
8	424	DeepSeek-R1 Distill Qwen 7B	664	±21	640	1.5%	<0.1%	0 tps	N/A	131K	$0.05	$0.10
9	406	DeepSeek-R1 Distill Qwen 14B	664	±14	1.7K	2.5%	<0.1%	44 tps	1.7s	64K	$0.63	$0.63
10	225	GPT-3.5 Turbo 16k	699	±17	690	0.7%	<0.1%	22 tps	0.6s	16K	$3.00	$4.00
11	214	C4AI Aya Expanse 32B	702	±22	825	1.8%	1.5%	43 tps	0.5s	128K	$0.50	$1.50
12	256	Gemma 3 1B	733	±21	670	2.9%	0.6%	176 tps	1.0s	33K	$0.06	$0.10
13	246	DeepSeek-R1 Distill Llama 70B	733	±10	2.6K	2.3%	3.6%	27 tps	1.6s	32K	$0.73	$0.95
14	241	Arcee AI Blitz	748	±15	710	1.4%	<0.1%	6 tps	N/A	33K	$0.45	$0.75
15	235	GLM 4 32B	751	±19	740	2.0%	2.6%	40 tps	1.6s	33K	$0.14	$0.14
16	225	Command R	754	±22	540	2.7%	5.8%	54 tps	0.6s	128K	$0.30	$0.99
17	229	Ministral 8B	763	±23	525	3.7%	1.4%	177 tps	0.4s	128K	$0.14	$0.14
18	246	Ministral 3B	766	±21	575	1.7%	0.8%	248 tps	0.4s	131K	$0.08	$0.08
19	339	Refuel LLM 2 Small	768	±20	800	2.4%	<0.1%	116 tps	0.5s	8K	$0.20	$0.20
20	399	Magistral Medium (Thinking)	774	±7	2.3K	2.7%	<0.1%	67 tps	0.8s	41K	$2.00	$5.00
21	225	Command R 7B	775	±18	870	1.7%	1.1%	76 tps	0.4s	128K	$0.04	$0.15
22	314	MAI-DS-R1	778	±12	1.7K	3.4%	<0.1%	73 tps	3.2s	64K	$0.10	$0.40
23	219	Arcee AI Virtuoso-Large	791	±11	840	1.8%	<0.1%	64 tps	0.5s	131K	$0.75	$1.20
24	292	Arcee AI Spotlight	796	±15	1.4K	1.8%	<0.1%	121 tps	0.4s	131K	$0.18	$0.18
25	222	Sky T1 32B Preview	797	±17	625	1.6%	7.8%	73 tps	0.6s	16K	$0.12	$0.18
26	201	GPT-4o mini	803	±24	545	4.4%	2.1%	71 tps	1.7s	128K	$0.15	$0.60
27	179	Amazon Nova Pro 1.0	807	±19	1.4K	1.7%	0.9%	96 tps	0.7s	300K	$0.80	$1.70
28	270	Arcee AI Virtuoso-Medium	809	±21	540	0.9%	<0.1%	3 tps	N/A	131K	$0.50	$0.80
29	241	Claude Haiku 3	813	±18	645	2.3%	0.4%	62 tps	0.5s	200K	$0.25	$1.25
30	194	Llama 3.2 11B Instruct	816	±22	525	2.8%	1.5%	152 tps	0.5s	8K	$0.16	$0.16
31	222	Jamba 1.5 Large	819	±15	690	1.4%	1.7%	48 tps	0.9s	256K	$1.50	$6.00
32	235	Gemma 3 4B	825	±14	755	3.2%	1.3%	138 tps	0.7s	131K	$0.02	$0.04
33	201	Llama 3 8B	826	±17	720	1.4%	6.0%	85 tps	0.7s	8K	$0.12	$0.16
34	260	Hermes 4 405B Reasoning FP8	828	±11	1.3K	3.7%	3.6%	32 tps	0.8s	131K	$1.00	$3.00
35	179	Inception Mercury	829	±10	2K	1.5%	0.4%	257 tps	1.1s	32K	$0.25	$1.00
36	265	Magistral Small 2509	830	±23	825	5.7%	2.7%	116 tps	0.6s	131K	$0.50	$1.50
37	270	AFM 4.5B Preview	830	±22	875	2.2%	<0.1%	32 tps	0.0s	66K	$0	$0
38	209	Qwen 2.5 14B Instruct	830	±24	570	1.7%	2.4%	40 tps	1.6s	1M	$0.40	$1.61
39	161	Mistral Small 3.1	835	±17	675	1.5%	7.4%	13 tps	2.6s	32K	$0.17	$0.28
40	374	Cogito V2 671B	839	±13	1.3K	3.0%	<0.1%	41 tps	0.6s	164K	$1.25	$1.25

1of7

View All (260 models)