Leaderboard | Text

Models

Choose model family

Claude by Anthropic

Mistral by Mistral AI

Choose topic

All topics Facts and Information Creative Writing and Ideation Logic and Problem-Solving Task Completion Coding

Language

Choose language

All languages English Chinese Arabic Spanish Indonesian Japanese

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

All Single turn Multiple turns

Open license models

Filter the leaderboard to only show models that have an open license.

All selected Open license Proprietary license

305

AFM 4.5B

546

Qwen 2.5 VL 3B Instruct

552

DeepSeek-R1 0528 Qwen3 8B

621

GPT-5 Nano Minimal

682

Wikipedia

715

Fauna Fox

726

GLM 4.6V Flash

739

Grok 3 Mini

744

Pixtral 12B

745

Llama 3.3 70B

778

Pixtral Large

779

Qwen Turbo

781

Gemma 3n E4B

790

Magistral Small 2509

796

GPT-5 Mini Low

Last updated about 1 month ago

Rank	Overall	Name	VIBE Score	Confidence Interval	Votes	Downvote %	Abort %	Speed	Latency	Context	Cost (Input)	Cost (Output)
1	292	AFM 4.5B	305	±46	585	5.6%	<0.1%	81 tps	0.3s	66K	$0.05	$0.20
2	288	Qwen 2.5 VL 3B Instruct	546	±31	1K	9.1%	3.0%	44 tps	2.5s	128K	$0.21	$0.63
3	314	DeepSeek-R1 0528 Qwen3 8B	552	±23	565	5.8%	<0.1%	45 tps	2.4s	128K	$0.05	$0.09
4	292	GPT-5 Nano Minimal	621	±18	590	7.8%	<0.1%	88 tps	0.8s	400K	$0.05	$0.40
5	277	Wikipedia	682	±34	550	9.8%	<0.1%	47 tps	2.1s	32K	$0	$0
6	182	Fauna Fox	715	±28	490	4.9%	<0.1%	194 tps	0.3s	128K	$0.04	$0.15
7	186	GLM 4.6V Flash	726	±29	575	2.5%	3.7%	64 tps	2.1s	128K	$0.04	$0.40
8	186	Grok 3 Mini	739	±20	1.4K	2.5%	1.2%	43 tps	0.5s	131K	$0.30	$0.50
9	274	Pixtral 12B	744	±33	940	9.6%	2.2%	101 tps	1.2s	131K	$0.08	$0.08
10	194	Llama 3.3 70B	745	±30	525	4.5%	0.3%	500 tps	0.5s	8K	$0.48	$0.66
11	165	Pixtral Large	778	±18	1.1K	7.2%	2.5%	57 tps	1.3s	128K	$1.50	$4.50
12	159	Qwen Turbo	779	±15	860	2.8%	<0.1%	53 tps	1.1s	1M	$0.05	$0.20
13	186	Gemma 3n E4B	781	±27	535	4.5%	2.0%	30 tps	0.5s	8K	$0.01	$0.02
14	265	Magistral Small 2509	790	±29	530	6.2%	2.7%	116 tps	0.6s	131K	$0.50	$1.50
15	108	GPT-5 Mini Low	796	±16	865	7.0%	<0.1%	69 tps	3.2s	400K	$0.25	$2.00
16	229	Magistral Medium 2509	797	±17	570	5.0%	4.0%	58 tps	0.9s	131K	$2.00	$5.00
17	147	Arcee AI Maestro Reasoning	802	±17	480	4.0%	<0.1%	85 tps	0.3s	131K	$0.90	$3.30
18	265	Qwen 2.5 VL 72B Instruct	804	±29	715	6.5%	5.3%	25 tps	3.7s	128K	$1.01	$2.79
19	213	DeepSeek R1T Chimera	805	±25	510	3.8%	<0.1%	46 tps	1.1s	164K	$0.09	$0.36
20	148	Qwen3 30B A3B Thinking 2507	818	±18	795	3.0%	0.5%	124 tps	1.2s	131K	$0.16	$1.70
21	86	Nemotron 3 Nano (Thinking)	821	±23	540	4.4%	2.0%	200 tps	0.5s	256K	$0	$0
22	201	GPT-4o mini	826	±18	645	6.5%	2.1%	71 tps	1.7s	128K	$0.15	$0.60
23	161	Qwen3 8B	827	±36	600	4.0%	2.4%	61 tps	1.4s	41K	$0.02	$0.07
24	133	DeepSeek V3.2 Speciale	830	±28	540	3.6%	6.0%	43 tps	1.4s	131K	$0.84	$1.52
25	186	Grok 3 Mini Fast	832	±23	1K	3.3%	1.6%	44 tps	0.5s	131K	$0.60	$4.00
26	84	GPT-5 Mini Minimal	835	±13	1.1K	6.6%	1.2%	63 tps	1.4s	400K	$0.25	$2.00
27	175	OpenAI o3-mini-low	838	±21	1.7K	2.6%	0.7%	139 tps	1.5s	200K	$1.10	$4.40
28	157	GPT-5 Nano	843	±14	2K	6.0%	3.2%	113 tps	20.9s	400K	$0.05	$0.40
29	157	Qwen3 Next 80B A3B Thinking	846	±15	1.3K	3.9%	0.6%	175 tps	1.3s	256K	$0.21	$2.26
30	121	Qwen3 32B Fast	853	±13	1.8K	4.2%	11.6%	30 tps	3.1s	41K	$0.10	$0.25
31	133	Qwen3 14B	856	±24	745	2.6%	1.7%	109 tps	0.8s	41K	$0.04	$0.15
32	170	Kimi K2 0711	858	±24	890	4.3%	1.6%	29 tps	1.3s	131K	$0.72	$2.60
33	139	GLM 4.6V	865	±24	890	2.7%	6.4%	21 tps	1.8s	128K	$0.38	$0.90
34	129	Qwen3 Max Thinking	866	±14	1.5K	1.7%	13.5%	32 tps	2.3s	256K	$1.20	$6.00
35	214	OpenAI o3-mini-high	868	±13	1.4K	3.8%	2.4%	231 tps	10.5s	200K	$1.10	$4.40
36	246	DeepSeek-R1 Distill Llama 70B	869	±28	590	4.8%	3.6%	27 tps	1.6s	32K	$0.73	$0.95
37	179	GLM 4.7 Flash	874	±24	855	2.8%	5.8%	61 tps	2.8s	128K	$0.07	$0.39
38	160	Llama 4 Scout	875	±15	2.3K	2.9%	0.6%	88 tps	5.1s	131K	$0.18	$0.46
39	270	Solar Pro 2 250710 (Reasoning)	878	±25	505	3.8%	<0.1%	9 tps	N/A	66K	$0.50	$0.50
40	165	Qwen3 4B	878	±23	735	3.9%	1.9%	94 tps	1.5s	128K	$0.01	$0.01

1of5

View All (188 models)