Leaderboard | Text

Models

Choose model family

Claude by Anthropic

Mistral by Mistral AI

Topics

Choose topic

All topics Facts and Information Creative Writing and Ideation Logic and Problem-Solving Task Completion Coding

Choose language

All languages English Chinese Arabic Spanish Indonesian Japanese

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

All Single turn Multiple turns

Open license models

Filter the leaderboard to only show models that have an open license.

All selected Open license Proprietary license

982

Mistral Small 3.1 24B Instruct

982

Amazon Nova Pro 1.0

979

Claude Haiku 3

979

Cogito V2 Preview Llama 70B

979

AFM 4.5B Preview

979

Inception Mercury

977

GLM 4.6 FP8

977

Jamba 1.6 Large

976

Llama 3.3 70B

976

Llama 3.1 70B Instruct

976

Gemma 3n E4B

975

Cogito V2 Preview Llama 109B

974

GPT-5 Nano

972

Mistral Small 3 24B Instruct

972

Qwen 2.5 72B

Last updated about 1 month ago

Rank	Overall	Name	VIBE Score	Confidence Interval	Votes	Downvote %	Abort %	Speed	Latency	Context	Cost (Input)	Cost (Output)
241	177	Mistral Small 3.1 24B Instruct	982	±2	11.2K	1.8%	7.5%	15 tps	2.4s	131K	$0.06	$0.18
242	179	Amazon Nova Pro 1.0	982	±2	24.5K	1.6%	0.9%	96 tps	0.7s	300K	$0.80	$1.70
243	241	Claude Haiku 3	979	±3	11.5K	1.6%	0.4%	62 tps	0.5s	200K	$0.25	$1.25
244	233	Cogito V2 Preview Llama 70B	979	±10	1.2K	4.8%	<0.1%	44 tps	1.6s	33K	$0.44	$0.44
245	270	AFM 4.5B Preview	979	±6	11.5K	1.3%	<0.1%	32 tps	0.0s	66K	$0	$0
246	179	Inception Mercury	979	±2	28K	1.8%	0.4%	257 tps	1.1s	32K	$0.25	$1.00
247	182	GLM 4.6 FP8	977	±6	2.8K	8.2%	<0.1%	56 tps	1.8s	200K	$0.40	$1.75
248	186	Jamba 1.6 Large	977	±2	15.8K	1.3%	2.0%	59 tps	1.2s	256K	$1.33	$5.33
249	194	Llama 3.3 70B	976	±3	10.8K	4.1%	0.3%	500 tps	0.5s	8K	$0.48	$0.66
250	179	Llama 3.1 70B Instruct	976	±14	925	2.6%	6.3%	30 tps	0.8s	128K	$0.17	$0.22
251	186	Gemma 3n E4B	976	±2	25.5K	1.8%	2.0%	30 tps	0.5s	8K	$0.01	$0.02
252	302	Cogito V2 Preview Llama 109B	975	±10	980	5.3%	<0.1%	84 tps	1.4s	33K	$0.18	$0.59
253	157	GPT-5 Nano	974	±3	10.1K	6.0%	3.2%	113 tps	20.9s	400K	$0.05	$0.40
254	194	Mistral Small 3 24B Instruct	972	±4	7.7K	1.5%	2.6%	77 tps	0.6s	33K	$0.07	$0.14
255	179	Qwen 2.5 72B	972	±4	5.6K	2.1%	1.2%	96 tps	1.2s	131K	$0.14	$0.26
256	175	MiMo V2 Flash	971	±13	900	4.3%	7.2%	24 tps	1.9s	262K	$0.07	$0.23
257	193	GPT-5 Nano High	969	±9	875	2.2%	<0.1%	23 tps	25.7s	400K	$0.05	$0.40
258	339	OLMo 3 7B Instruct	969	±15	685	2.8%	1.6%	72 tps	0.6s	66K	$0.10	$0.20
259	194	Llama 3.2 11B Instruct	967	±2	9.6K	1.9%	1.5%	152 tps	0.5s	8K	$0.16	$0.16
260	165	DeepSeek R1T2 Chimera	967	±4	5.9K	3.3%	3.0%	28 tps	1.8s	164K	$0.13	$0.45
261	175	OpenAI o3-mini-low	966	±2	30.5K	4.6%	0.7%	139 tps	1.5s	200K	$1.10	$4.40
262	194	Magistral Small 2506	966	±3	17.5K	1.5%	1.6%	156 tps	0.5s	40K	$0.37	$1.10
263	159	Sherlock Think Alpha	964	±16	650	4.4%	<0.1%	50 tps	5.4s	2M	$0	$0
264	302	OLMo 3 32B Think	963	±9	1.8K	2.7%	<0.1%	84 tps	0.6s	66K	$0.15	$0.50
265	265	Llama 3.1 405B Instruct Turbo	962	±4	8.3K	1.7%	<0.1%	26 tps	0.8s	131K	$3.50	$3.50
266	177	OpenAI o3-mini	962	±2	33.6K	4.2%	0.8%	143 tps	3.3s	200K	$1.10	$4.40
267	201	ERNIE 4.5 VL 424B A47B	961	±10	1.5K	5.7%	4.9%	36 tps	3.5s	123K	$0.42	$1.25
268	161	DeepSeek Prover v2	961	±6	3.3K	1.8%	5.2%	14 tps	1.3s	164K	$0.40	$1.56
269	201	Llama 3 8B	960	±2	13.1K	1.8%	6.0%	85 tps	0.7s	8K	$0.12	$0.16
270	209	Seed 1.6 Flash 250715	960	±5	3.6K	3.1%	2.5%	108 tps	1.6s	256K	$0.07	$0.30
271	194	GLM 4.5 Flash	960	±16	1.4K	4.8%	12.2%	15 tps	2.2s	131K	$0	$0
272	186	Grok 3 Mini Fast	958	±2	26.4K	4.4%	1.6%	44 tps	0.5s	131K	$0.60	$4.00
273	214	Qwen 2.5 VL 32B Instruct	958	±12	1.6K	5.4%	6.3%	43 tps	3.2s	128K	$0.35	$0.62
274	201	Mistral Small 24B Instruct	958	±4	6.8K	2.1%	1.5%	84 tps	0.4s	33K	$0.80	$0.80
275	253	Magistral Medium	958	±7	2.8K	8.2%	<0.1%	95 tps	0.5s	41K	$2.00	$5.00
276	170	Kimi K2 0711	957	±2	23.3K	2.3%	1.6%	29 tps	1.3s	131K	$0.72	$2.60
277	277	Wikipedia	957	±2	65.3K	1.5%	<0.1%	47 tps	2.1s	32K	$0	$0
278	314	Cogito V2 Preview Llama 405B	957	±10	1K	5.1%	<0.1%	23 tps	2.1s	33K	$1.17	$1.17
279	179	Switchpoint Router	957	±4	8.5K	2.0%	1.7%	71 tps	4.9s	131K	$0.85	$3.40
280	214	Gemma 3 12B	956	±3	9.8K	1.9%	4.2%	73 tps	0.8s	131K	$0.05	$0.12

7of11

View All (432 models)