Leaderboard | Text

Models

Choose model family

Claude by Anthropic

Mistral by Mistral AI

Topics

Choose topic

All topics Facts and Information Creative Writing and Ideation Logic and Problem-Solving Task Completion Coding

Choose language

All languages English Chinese Arabic Spanish Indonesian Japanese

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

All Single turn Multiple turns

Open license models

Filter the leaderboard to only show models that have an open license.

All selected Open license Proprietary license

1207

MiniMax M2.7-highspeed

1171

Kimi K2.5 Instant

1164

Step 3.5 Flash

1161

Qwen3 Next 80B A3B Instruct

1151

Qwen3.5 122B A17B

1151

Kimi K2.5

1144

gpt-oss-120b

1137

Kimi K2 Thinking Turbo

1127

DeepSeek V3.2 Thinking

1127

Nemotron 3 Nano (Thinking)

1124

DeepSeek V3.2 Exp Chat

1122

Mistral Large 3

1121

MiniMax M2.5 Lightning

1097

Qwen3.5 27B

1093

gpt-oss-20b

Last updated about 1 month ago

Rank	Overall	Name	VIBE Score	Confidence Interval	Votes	Downvote %	Abort %	Speed	Latency	Context	Cost (Input)	Cost (Output)
1	22	MiniMax M2.7-highspeed	1207	±10	1.1K	2.1%	2.3%	50 tps	2.1s	205K	$0.60	$2.40
2	37	Kimi K2.5 Instant	1171	±4	6.2K	1.8%	2.9%	32 tps	3.0s	262K	$0.50	$3.00
3	48	Step 3.5 Flash	1164	±5	4K	1.5%	2.2%	109 tps	0.6s	256K	$0.05	$0.15
4	33	Qwen3 Next 80B A3B Instruct	1161	±2	24.9K	3.8%	0.6%	84 tps	1.1s	256K	$0.20	$1.42
5	52	Qwen3.5 122B A17B	1151	±4	4.7K	1.6%	1.5%	82 tps	1.4s	256K	$0.40	$3.20
6	33	Kimi K2.5	1151	±3	32.5K	1.8%	6.5%	33 tps	1.7s	262K	$0.34	$2.57
7	48	gpt-oss-120b	1144	±2	40.7K	3.7%	0.7%	213 tps	0.5s	131K	$0.11	$0.50
8	44	Kimi K2 Thinking Turbo	1137	±3	29.8K	2.5%	2.0%	75 tps	1.4s	262K	$1.15	$8.00
9	56	DeepSeek V3.2 Thinking	1127	±3	37.6K	2.6%	9.0%	30 tps	2.6s	131K	$0.28	$0.42
10	86	Nemotron 3 Nano (Thinking)	1127	±3	7.5K	2.4%	2.0%	200 tps	0.5s	256K	$0	$0
11	65	DeepSeek V3.2 Exp Chat	1124	±3	14.3K	4.0%	2.6%	29 tps	1.5s	131K	$0.27	$0.39
12	65	Mistral Large 3	1122	±3	14.3K	3.3%	2.1%	51 tps	1.0s	256K	$0.50	$1.50
13	79	MiniMax M2.5 Lightning	1121	±4	5.6K	1.3%	1.5%	51 tps	2.0s	205K	$0.60	$2.40
14	81	Qwen3.5 27B	1097	±6	2.3K	2.4%	3.7%	55 tps	2.6s	256K	$0.30	$2.40
15	101	gpt-oss-20b	1093	±2	20.3K	4.6%	0.5%	216 tps	0.5s	131K	$0.06	$0.26
16	104	Grok 3 Beta	1092	±3	8.1K	0.6%	<0.1%	58 tps	0.8s	131K	$3.00	$15.00
17	86	Qwen3 235B A22B	1090	±3	11.9K	5.1%	5.3%	71 tps	0.9s	41K	$0.23	$0.63
18	159	Llama 3.1 405B Instruct	1072	±8	1.7K	2.0%	<0.1%	52 tps	0.5s	128K	$2.60	$4.27
19	106	DeepSeek V3.1 Terminus Thinking	1071	±3	8.5K	4.9%	5.9%	27 tps	1.8s	131K	$0.56	$1.68
20	95	DeepSeek-R1 Turbo	1069	±3	7.3K	2.9%	2.6%	29 tps	1.8s	64K	$2.85	$4.75
21	95	DeepSeek V3.2 Exp Thinking	1068	±3	9.9K	2.7%	7.2%	26 tps	3.0s	131K	$0.28	$0.42
22	121	QwQ 32B	1068	±2	28.3K	4.4%	5.4%	41 tps	2.1s	16K	$0.43	$0.56
23	200	Llama 3 8B Turbo	1066	±6	2.7K	1.3%	<0.1%	97 tps	0.1s	8K	$0.12	$0.13
24	121	Qwen3 32B Fast	1059	±2	25K	3.8%	11.6%	30 tps	3.1s	41K	$0.10	$0.25
25	177	Llama 3 70B Turbo	1058	±2	18.1K	0.8%	<0.1%	31 tps	0.0s	8K	$0.73	$0.83
26	126	Qwen3 30B A3B	1051	±2	15.1K	4.4%	5.1%	163 tps	1.0s	41K	$0.06	$0.21
27	200	K2 Think	1047	±4	4.7K	2.1%	<0.1%	418 tps	2.8s	N/A	$0	$0
28	182	Qwen 2.5 72B Turbo	1044	±7	3K	2.6%	<0.1%	84 tps	0.8s	131K	$0.60	$0.60
29	129	Command A	1039	±2	82.1K	2.2%	2.2%	42 tps	0.8s	256K	$2.00	$7.33
30	133	Qwen3 14B	1037	±2	12.5K	5.7%	1.7%	109 tps	0.8s	41K	$0.04	$0.15
31	113	Kimi K2 Fast	1037	±2	107.2K	4.5%	0.8%	365 tps	0.5s	131K	$1.00	$3.00
32	126	DeepSeek V3	1035	±2	57.9K	1.7%	0.9%	69 tps	1.1s	64K	$0.59	$1.49
33	170	Llama 3.1 8B Turbo	1027	±3	8.2K	1.4%	2.1%	650 tps	0.5s	128K	$0.13	$0.14
34	139	Qwen3 VL 30B A3B Instruct	1022	±7	2.1K	4.8%	1.8%	80 tps	2.6s	129K	$0.18	$0.67
35	133	DeepSeek-R1 0528	1020	±3	12.3K	2.1%	1.3%	93 tps	0.5s	64K	$1.60	$3.67
36	165	Qwen3 VL 30B A3B Thinking	1020	±4	3.5K	6.5%	4.5%	84 tps	2.9s	127K	$0.20	$1.47
37	200	NVIDIA Llama 3.1 Nemotron 70B	1016	±2	23.2K	1.0%	<0.1%	9 tps	0.1s	128K	$0.33	$0.39
38	219	NVIDIA Llama 3.3 Nemotron Super 49B v1	1012	±2	15.5K	1.1%	<0.1%	13 tps	N/A	131K	$0.07	$0.20
39	153	Qwen 2.5 32B Instruct	1011	±3	16.8K	3.1%	2.5%	48 tps	1.0s	131K	$0.21	$0.25
40	161	Mistral Small 3.1	1006	±3	9.7K	2.0%	7.4%	13 tps	2.6s	32K	$0.17	$0.28

1of3

View All (111 models)