Leaderboard | Text

Models

Choose model family

Claude by Anthropic

Mistral by Mistral AI

Topics

Choose topic

All topics Facts and Information Creative Writing and Ideation Logic and Problem-Solving Task Completion Coding

Choose language

All languages English Chinese Arabic Spanish Indonesian Japanese

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

All Single turn Multiple turns

Open license models

Filter the leaderboard to only show models that have an open license.

All selected Open license Proprietary license

954

Devstral Small

954

Rnj-1 Instruct

956

GLM 4.7 Flash

956

Gemma 3 12B

957

Switchpoint Router

957

Kimi K2 0711

958

Mistral Small 24B Instruct

958

Qwen 2.5 VL 32B Instruct

958

Grok 3 Mini Fast

960

GLM 4.5 Flash

960

Seed 1.6 Flash 250715

960

Llama 3 8B

961

DeepSeek Prover v2

961

ERNIE 4.5 VL 424B A47B

962

OpenAI o3-mini

Last updated about 1 month ago

Rank	Overall	Name	VIBE Score	Confidence Interval	Votes	Downvote %	Abort %	Speed	Latency	Context	Cost (Input)	Cost (Output)
81	201	Devstral Small	954	±6	5.4K	2.4%	2.4%	180 tps	0.6s	131K	$0.10	$0.30
82	222	Rnj-1 Instruct	954	±6	3K	3.7%	0.6%	103 tps	0.3s	33K	$0.15	$0.15
83	179	GLM 4.7 Flash	956	±8	4.8K	1.8%	5.8%	61 tps	2.8s	128K	$0.07	$0.39
84	214	Gemma 3 12B	956	±3	9.8K	1.9%	4.2%	73 tps	0.8s	131K	$0.05	$0.12
85	179	Switchpoint Router	957	±4	8.5K	2.0%	1.7%	71 tps	4.9s	131K	$0.85	$3.40
86	170	Kimi K2 0711	957	±2	23.3K	2.3%	1.6%	29 tps	1.3s	131K	$0.72	$2.60
87	201	Mistral Small 24B Instruct	958	±4	6.8K	2.1%	1.5%	84 tps	0.4s	33K	$0.80	$0.80
88	214	Qwen 2.5 VL 32B Instruct	958	±12	1.6K	5.4%	6.3%	43 tps	3.2s	128K	$0.35	$0.62
89	186	Grok 3 Mini Fast	958	±2	26.4K	4.4%	1.6%	44 tps	0.5s	131K	$0.60	$4.00
90	194	GLM 4.5 Flash	960	±16	1.4K	4.8%	12.2%	15 tps	2.2s	131K	$0	$0
91	209	Seed 1.6 Flash 250715	960	±5	3.6K	3.1%	2.5%	108 tps	1.6s	256K	$0.07	$0.30
92	201	Llama 3 8B	960	±2	13.1K	1.8%	6.0%	85 tps	0.7s	8K	$0.12	$0.16
93	161	DeepSeek Prover v2	961	±6	3.3K	1.8%	5.2%	14 tps	1.3s	164K	$0.40	$1.56
94	201	ERNIE 4.5 VL 424B A47B	961	±10	1.5K	5.7%	4.9%	36 tps	3.5s	123K	$0.42	$1.25
95	177	OpenAI o3-mini	962	±2	33.6K	4.2%	0.8%	143 tps	3.3s	200K	$1.10	$4.40
96	194	Magistral Small 2506	966	±3	17.5K	1.5%	1.6%	156 tps	0.5s	40K	$0.37	$1.10
97	175	OpenAI o3-mini-low	966	±2	30.5K	4.6%	0.7%	139 tps	1.5s	200K	$1.10	$4.40
98	165	DeepSeek R1T2 Chimera	967	±4	5.9K	3.3%	3.0%	28 tps	1.8s	164K	$0.13	$0.45
99	194	Llama 3.2 11B Instruct	967	±2	9.6K	1.9%	1.5%	152 tps	0.5s	8K	$0.16	$0.16
100	175	MiMo V2 Flash	971	±13	900	4.3%	7.2%	24 tps	1.9s	262K	$0.07	$0.23
101	179	Qwen 2.5 72B	972	±4	5.6K	2.1%	1.2%	96 tps	1.2s	131K	$0.14	$0.26
102	194	Mistral Small 3 24B Instruct	972	±4	7.7K	1.5%	2.6%	77 tps	0.6s	33K	$0.07	$0.14
103	157	GPT-5 Nano	974	±3	10.1K	6.0%	3.2%	113 tps	20.9s	400K	$0.05	$0.40
104	186	Gemma 3n E4B	976	±2	25.5K	1.8%	2.0%	30 tps	0.5s	8K	$0.01	$0.02
105	179	Llama 3.1 70B Instruct	976	±14	925	2.6%	6.3%	30 tps	0.8s	128K	$0.17	$0.22
106	194	Llama 3.3 70B	976	±3	10.8K	4.1%	0.3%	500 tps	0.5s	8K	$0.48	$0.66
107	186	Jamba 1.6 Large	977	±2	15.8K	1.3%	2.0%	59 tps	1.2s	256K	$1.33	$5.33
108	179	Inception Mercury	979	±2	28K	1.8%	0.4%	257 tps	1.1s	32K	$0.25	$1.00
109	179	Amazon Nova Pro 1.0	982	±2	24.5K	1.6%	0.9%	96 tps	0.7s	300K	$0.80	$1.70
110	177	Mistral Small 3.1 24B Instruct	982	±2	11.2K	1.8%	7.5%	15 tps	2.4s	131K	$0.06	$0.18
111	153	OpenAI o1	982	±4	18.6K	2.5%	4.2%	92 tps	5.5s	200K	$15.00	$60.00
112	186	Gemma 3 27B	983	±6	3.5K	3.7%	1.8%	35 tps	1.1s	66K	$0.06	$0.10
113	179	Baichuan-M2-32B	983	±7	1.9K	5.9%	<0.1%	32 tps	3.3s	131K	$0.07	$0.07
114	148	OpenAI o3	987	±3	12K	2.6%	0.9%	85 tps	6.8s	128K	$7.33	$29.33
115	133	Kimi K2 0905	988	±3	16.2K	3.9%	4.0%	30 tps	1.4s	262K	$0.63	$2.39
116	148	OpenAI o4-mini-high	988	±2	33.5K	4.5%	1.9%	117 tps	15.9s	200K	$1.10	$4.40
117	201	Qwen 2.5 7B Turbo	992	±7	2.6K	2.8%	0.5%	125 tps	0.4s	131K	$0.30	$0.30
118	194	Llama 3 70B	993	±9	1.9K	1.3%	4.5%	21 tps	1.7s	8K	$1.08	$1.38
119	165	Pixtral Large	994	±4	9.9K	2.6%	2.5%	57 tps	1.3s	128K	$1.50	$4.50
120	170	Mistral Small 3.2 24B	994	±3	15.2K	2.5%	2.8%	141 tps	0.7s	33K	$0.02	$0.08

3of8

View All (288 models)