Leaderboard | Text

Models

Choose model family

Claude by Anthropic

Mistral by Mistral AI

Topics

Choose topic

All topics Facts and Information Creative Writing and Ideation Logic and Problem-Solving Task Completion Coding

Choose language

All languages English Chinese Arabic Spanish Indonesian Japanese

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

All Single turn Multiple turns

Open license models

Filter the leaderboard to only show models that have an open license.

All selected Open license Proprietary license

AFM 4.5B

373

Qwen 2.5 VL 3B Instruct

518

Fauna Fox

613

Inception Mercury

635

Qwen 2.5 VL 72B Instruct

674

Pixtral 12B

703

Wikipedia

719

Llama 3.3 70B

724

Grok 3 Mini Fast

752

OpenAI o3-mini-low

765

Magistral Medium 2509

777

OpenAI o3-mini

778

OpenAI o3-mini-high

782

Llama 4 Scout

787

Qwen3 30B A3B Thinking 2507

Last updated about 1 month ago

Rank	Overall	Name	VIBE Score	Confidence Interval	Votes	Downvote %	Abort %	Speed	Latency	Context	Cost (Input)	Cost (Output)
1	292	AFM 4.5B	95	±54	605	9.7%	<0.1%	81 tps	0.3s	66K	$0.05	$0.20
2	288	Qwen 2.5 VL 3B Instruct	373	±46	955	7.7%	3.0%	44 tps	2.5s	128K	$0.21	$0.63
3	182	Fauna Fox	518	±29	625	6.0%	<0.1%	194 tps	0.3s	128K	$0.04	$0.15
4	179	Inception Mercury	613	±25	500	6.5%	0.4%	257 tps	1.1s	32K	$0.25	$1.00
5	265	Qwen 2.5 VL 72B Instruct	635	±34	505	5.6%	5.3%	25 tps	3.7s	128K	$1.01	$2.79
6	274	Pixtral 12B	674	±33	720	5.9%	2.2%	101 tps	1.2s	131K	$0.08	$0.08
7	277	Wikipedia	703	±16	710	4.1%	<0.1%	47 tps	2.1s	32K	$0	$0
8	194	Llama 3.3 70B	719	±18	550	3.5%	0.3%	500 tps	0.5s	8K	$0.48	$0.66
9	186	Grok 3 Mini Fast	724	±15	1.3K	4.3%	1.6%	44 tps	0.5s	131K	$0.60	$4.00
10	175	OpenAI o3-mini-low	752	±17	1.4K	4.3%	0.7%	139 tps	1.5s	200K	$1.10	$4.40
11	229	Magistral Medium 2509	765	±18	610	3.9%	4.0%	58 tps	0.9s	131K	$2.00	$5.00
12	177	OpenAI o3-mini	777	±12	2K	3.6%	0.8%	143 tps	3.3s	200K	$1.10	$4.40
13	214	OpenAI o3-mini-high	778	±17	630	3.1%	2.4%	231 tps	10.5s	200K	$1.10	$4.40
14	160	Llama 4 Scout	782	±15	1.6K	4.1%	0.6%	88 tps	5.1s	131K	$0.18	$0.46
15	148	Qwen3 30B A3B Thinking 2507	787	±14	655	4.4%	0.5%	124 tps	1.2s	131K	$0.16	$1.70
16	170	Mistral Small 3.2 24B	793	±28	475	5.9%	2.8%	141 tps	0.7s	33K	$0.02	$0.08
17	165	Pixtral Large	796	±26	610	7.6%	2.5%	57 tps	1.3s	128K	$1.50	$4.50
18	302	YouTube	797	±20	1.1K	4.0%	<0.1%	34 tps	2.7s	32K	$0.99	$0.99
19	186	Grok 3 Mini	799	±23	1.9K	2.6%	1.2%	43 tps	0.5s	131K	$0.30	$0.50
20	213	Claude Haiku 3.5	801	±15	1.2K	5.9%	0.8%	40 tps	2.8s	200K	$0.80	$4.00
21	314	MAI-DS-R1	810	±20	565	5.8%	<0.1%	73 tps	3.2s	64K	$0.10	$0.40
22	161	Qwen3 8B	813	±23	610	5.4%	2.4%	61 tps	1.4s	41K	$0.02	$0.07
23	165	Qwen3 4B	818	±20	870	4.9%	1.9%	94 tps	1.5s	128K	$0.01	$0.01
24	121	QwQ 32B	825	±16	1.3K	5.5%	5.4%	41 tps	2.1s	16K	$0.43	$0.56
25	126	Qwen3 30B A3B	832	±20	950	4.0%	5.1%	163 tps	1.0s	41K	$0.06	$0.21
26	139	GLM 4.6V	837	±23	640	5.2%	6.4%	21 tps	1.8s	128K	$0.38	$0.90
27	161	Llama 4 Maverick	838	±10	2.4K	4.3%	1.2%	88 tps	2.4s	1M	$0.23	$0.83
28	133	DeepSeek V3.2 Speciale	842	±38	485	4.9%	6.0%	43 tps	1.4s	131K	$0.84	$1.52
29	159	Qwen Turbo	847	±15	1.1K	4.3%	<0.1%	53 tps	1.1s	1M	$0.05	$0.20
30	121	Qwen3 32B Fast	863	±13	2K	4.5%	11.6%	30 tps	3.1s	41K	$0.10	$0.25
31	119	ERNIE 4.5 300B A47B	873	±12	1.1K	3.4%	4.7%	23 tps	2.3s	123K	$0.28	$1.10
32	143	Gemini 2.0 Flash	885	±16	870	5.9%	<0.1%	76 tps	0.5s	1M	$0.14	$0.56
33	157	Qwen3 Next 80B A3B Thinking	890	±16	1.1K	3.4%	0.6%	175 tps	1.3s	256K	$0.21	$2.26
34	241	GPT-5 Mini High	891	±16	810	4.1%	<0.1%	33 tps	3.9s	400K	$0.25	$2.00
35	133	GPT-4.1 nano	896	±11	1.8K	3.2%	0.6%	175 tps	0.5s	1M	$0.10	$0.40
36	157	GPT-5 Nano	901	±9	1.2K	5.0%	3.2%	113 tps	20.9s	400K	$0.05	$0.40
37	143	Gemini 2.0 Flash Lite	902	±14	990	7.9%	<0.1%	42 tps	0.5s	1M	$0.08	$0.30
38	133	DeepSeek-R1 0528	904	±19	640	4.5%	1.3%	93 tps	0.5s	64K	$1.60	$3.67
39	124	Qwen3 235B A22B Thinking 2507	905	±16	695	2.1%	2.5%	53 tps	1.6s	131K	$0.59	$5.70
40	65	DeepSeek V3.2 Exp Chat	909	±11	1.3K	2.6%	2.6%	29 tps	1.5s	131K	$0.27	$0.39

1of4

View All (159 models)