Leaderboard | Text

Models

Choose model family

Claude by Anthropic

Mistral by Mistral AI

Choose topic

All topics Facts and Information Creative Writing and Ideation Logic and Problem-Solving Task Completion Coding

Language

Choose language

All languages English Chinese Arabic Spanish Indonesian Japanese

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

All Single turn Multiple turns

Open license models

Filter the leaderboard to only show models that have an open license.

All selected Open license Proprietary license

570

MiniMax M1

638

DeepSeek-R1 Distill Qwen 32B

699

GPT-3.5 Turbo 16k

751

GLM 4 32B

803

GPT-4o mini

807

Amazon Nova Pro 1.0

819

Jamba 1.5 Large

826

Llama 3 8B

829

Inception Mercury

830

Magistral Small 2509

830

Qwen 2.5 14B Instruct

847

Magistral Small 2506

849

Magistral Medium 2509

854

Llama 3.3 Swallow 70B Instruct

866

ERNIE 4.5 21B A3B Thinking

Last updated about 1 month ago

Rank	Overall	Name	VIBE Score	Confidence Interval	Votes	Downvote %	Abort %	Speed	Latency	Context	Cost (Input)	Cost (Output)
1	284	MiniMax M1	570	±10	3K	1.3%	<0.1%	31 tps	2.8s	1M	$0.55	$2.20
2	274	DeepSeek-R1 Distill Qwen 32B	638	±9	1.8K	2.7%	6.2%	22 tps	1.8s	131K	$0.37	$0.39
3	225	GPT-3.5 Turbo 16k	699	±17	690	0.7%	<0.1%	22 tps	0.6s	16K	$3.00	$4.00
4	235	GLM 4 32B	751	±19	740	2.0%	2.6%	40 tps	1.6s	33K	$0.14	$0.14
5	201	GPT-4o mini	803	±24	545	4.4%	2.1%	71 tps	1.7s	128K	$0.15	$0.60
6	179	Amazon Nova Pro 1.0	807	±19	1.4K	1.7%	0.9%	96 tps	0.7s	300K	$0.80	$1.70
7	222	Jamba 1.5 Large	819	±15	690	1.4%	1.7%	48 tps	0.9s	256K	$1.50	$6.00
8	201	Llama 3 8B	826	±17	720	1.4%	6.0%	85 tps	0.7s	8K	$0.12	$0.16
9	179	Inception Mercury	829	±10	2K	1.5%	0.4%	257 tps	1.1s	32K	$0.25	$1.00
10	265	Magistral Small 2509	830	±23	825	5.7%	2.7%	116 tps	0.6s	131K	$0.50	$1.50
11	209	Qwen 2.5 14B Instruct	830	±24	570	1.7%	2.4%	40 tps	1.6s	1M	$0.40	$1.61
12	194	Magistral Small 2506	847	±14	1.3K	1.9%	1.6%	156 tps	0.5s	40K	$0.37	$1.10
13	229	Magistral Medium 2509	849	±16	990	3.9%	4.0%	58 tps	0.9s	131K	$2.00	$5.00
14	209	Llama 3.3 Swallow 70B Instruct	854	±18	890	1.1%	1.4%	153 tps	1.3s	131K	$0.13	$0.39
15	229	ERNIE 4.5 21B A3B Thinking	866	±16	685	2.8%	1.8%	87 tps	1.5s	120K	$0.07	$0.28
16	179	Switchpoint Router	876	±16	675	1.5%	1.7%	71 tps	4.9s	131K	$0.85	$3.40
17	186	Gemma 3n E4B	892	±10	2K	2.6%	2.0%	30 tps	0.5s	8K	$0.01	$0.02
18	170	Devstral Medium	899	±10	995	1.5%	1.5%	77 tps	0.6s	131K	$0.40	$2.00
19	194	Llama 3.3 70B	900	±12	1.1K	3.0%	0.3%	500 tps	0.5s	8K	$0.48	$0.66
20	186	Grok 3 Mini	908	±9	3.8K	2.3%	1.2%	43 tps	0.5s	131K	$0.30	$0.50
21	113	GLM 4.5 AirX	913	±23	540	1.8%	3.3%	75 tps	1.2s	131K	$1.10	$4.50
22	186	Jamba 1.6 Large	915	±18	660	1.5%	2.0%	59 tps	1.2s	256K	$1.33	$5.33
23	214	Gemma 3 12B	921	±18	635	3.1%	4.2%	73 tps	0.8s	131K	$0.05	$0.12
24	177	OpenAI o3-mini	921	±6	9K	1.9%	0.8%	143 tps	3.3s	200K	$1.10	$4.40
25	214	OpenAI o3-mini-high	922	±6	7.5K	2.0%	2.4%	231 tps	10.5s	200K	$1.10	$4.40
26	209	Seed 1.6 Flash 250715	922	±16	580	2.5%	2.5%	108 tps	1.6s	256K	$0.07	$0.30
27	139	OpenAI o4-mini	925	±8	4K	2.3%	1.4%	97 tps	7.0s	128K	$1.10	$4.40
28	170	Kimi K2 0711	928	±9	3.2K	2.2%	1.6%	29 tps	1.3s	131K	$0.72	$2.60
29	157	GPT-5 Nano	928	±10	1.8K	3.0%	3.2%	113 tps	20.9s	400K	$0.05	$0.40
30	148	OpenAI o3	935	±5	4.3K	1.7%	0.9%	85 tps	6.8s	128K	$7.33	$29.33
31	157	Cogito v2.1 671B	937	±15	885	1.7%	0.8%	85 tps	0.5s	128K	$1.25	$1.25
32	186	Grok 3 Mini Fast	939	±11	3.9K	2.3%	1.6%	44 tps	0.5s	131K	$0.60	$4.00
33	179	GLM 4.7 Flash	939	±11	1.1K	1.3%	5.8%	61 tps	2.8s	128K	$0.07	$0.39
34	160	Llama 4 Scout	941	±7	6.9K	1.5%	0.6%	88 tps	5.1s	131K	$0.18	$0.46
35	165	DeepSeek R1T2 Chimera	947	±20	620	2.4%	3.0%	28 tps	1.8s	164K	$0.13	$0.45
36	170	Mistral Small 3.2 24B	953	±9	1.4K	1.8%	2.8%	141 tps	0.7s	33K	$0.02	$0.08
37	129	DeepSeek V3.1 Thinking	956	±10	2.2K	2.4%	7.1%	18 tps	1.8s	131K	$0.23	$0.75
38	175	OpenAI o3-mini-low	963	±8	8.8K	1.9%	0.7%	139 tps	1.5s	200K	$1.10	$4.40
39	86	Amazon Nova 2 Lite	966	±9	1.6K	3.0%	1.0%	137 tps	0.6s	300K	$0.35	$2.95
40	148	OpenAI o4-mini-high	966	±4	9.3K	1.8%	1.9%	117 tps	15.9s	200K	$1.10	$4.40

1of4

View All (142 models)