Leaderboard | Text

Models

Choose model family

Claude by Anthropic

Mistral by Mistral AI

Topics

Choose topic

All topics Facts and Information Creative Writing and Ideation Logic and Problem-Solving Task Completion Coding

Choose language

All languages English Chinese Arabic Spanish Indonesian Japanese

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

All Single turn Multiple turns

Open license models

Filter the leaderboard to only show models that have an open license.

All selected Open license Proprietary license

1079

Mistral Medium

1078

Claude Haiku 4.5

1076

Gemini 2.5 Flash Preview

1074

Qwen Turbo

1073

DeepSeek-R1 Turbo

1072

Kimi K2 Thinking

1071

MiniMax M2.1

1069

Kimi K2 Fast

1069

DeepSeek V3.1 Chat

1068

DeepSeek V3 0324 Turbo

1062

Grok 4

1061

Gemini 2.5 Flash

1061

Qwen3 32B Fast

1060

DeepSeek V3.1 Terminus Chat

1059

Nemotron 3 Nano (Thinking)

Last updated about 1 month ago

Rank	Overall	Name	VIBE Score	Confidence Interval	Votes	Downvote %	Abort %	Speed	Latency	Context	Cost (Input)	Cost (Output)
81	113	Mistral Medium	1079	±9	2.7K	1.8%	1.8%	48 tps	0.6s	33K	$1.48	$4.55
82	52	Claude Haiku 4.5	1078	±11	2.6K	3.7%	1.1%	100 tps	0.9s	200K	$1.00	$5.00
83	100	Gemini 2.5 Flash Preview	1076	±14	640	0.8%	<0.1%	138 tps	6.9s	1M	$0.15	$0.60
84	159	Qwen Turbo	1074	±7	2.9K	1.2%	<0.1%	53 tps	1.1s	1M	$0.05	$0.20
85	95	DeepSeek-R1 Turbo	1073	±16	780	3.7%	2.6%	29 tps	1.8s	64K	$2.85	$4.75
86	95	Kimi K2 Thinking	1072	±30	880	8.8%	4.2%	61 tps	5.9s	262K	$0.24	$1.03
87	60	MiniMax M2.1	1071	±11	2.6K	1.1%	2.1%	66 tps	2.6s	205K	$0.30	$1.20
88	113	Kimi K2 Fast	1069	±7	8.5K	1.0%	0.8%	365 tps	0.5s	131K	$1.00	$3.00
89	86	DeepSeek V3.1 Chat	1069	±11	1.3K	3.0%	2.8%	21 tps	1.6s	131K	$0.38	$1.00
90	93	DeepSeek V3 0324 Turbo	1068	±7	4.2K	0.8%	6.3%	12 tps	2.4s	164K	$0.73	$1.79
91	68	Grok 4	1062	±6	10.5K	1.3%	3.9%	29 tps	11.1s	256K	$3.00	$15.00
92	95	Gemini 2.5 Flash	1061	±7	10K	1.0%	1.3%	2 tps	3.7s	1M	$0.30	$2.50
93	121	Qwen3 32B Fast	1061	±9	3K	2.4%	11.6%	30 tps	3.1s	41K	$0.10	$0.25
94	44	DeepSeek V3.1 Terminus Chat	1060	±13	1.4K	3.3%	3.4%	27 tps	1.5s	131K	$0.86	$1.80
95	86	Nemotron 3 Nano (Thinking)	1059	±18	825	1.8%	2.0%	200 tps	0.5s	256K	$0	$0
96	84	GPT-5 Mini Minimal	1057	±11	745	5.7%	1.2%	63 tps	1.4s	400K	$0.25	$2.00
97	71	Gemini 2.5 Flash Thinking	1055	±11	2.8K	2.3%	2.2%	88 tps	6.4s	1M	$0.30	$2.50
98	65	DeepSeek V3.2 Exp Chat	1054	±14	1.2K	3.7%	2.6%	29 tps	1.5s	131K	$0.27	$0.39
99	52	Qwen3.5 122B A17B	1053	±25	580	3.3%	1.5%	82 tps	1.4s	256K	$0.40	$3.20
100	133	Qwen3 14B	1053	±13	1.7K	2.9%	1.7%	109 tps	0.8s	41K	$0.04	$0.15
101	95	Gemini 2.5 Flash Lite Thinking Preview 0925	1051	±15	1.5K	4.2%	1.5%	152 tps	3.0s	1M	$0.10	$0.40
102	56	MiniMax M2.1 Lightning	1050	±23	615	1.6%	1.7%	52 tps	2.1s	205K	$0.30	$2.40
103	157	Qwen3 Next 80B A3B Thinking	1049	±9	2K	2.6%	0.6%	175 tps	1.3s	256K	$0.21	$2.26
104	147	GLM 4.5 Air	1047	±7	1.8K	2.2%	<0.1%	22 tps	1.4s	131K	$0.10	$0.38
105	68	GLM 4.7	1047	±15	2.4K	1.2%	5.8%	40 tps	1.5s	200K	$0.77	$1.73
106	71	GPT-5 Mini	1047	±9	2.2K	2.7%	2.6%	66 tps	14.2s	400K	$0.25	$2.00
107	37	Kimi K2.5 Instant	1046	±16	620	2.4%	2.9%	32 tps	3.0s	262K	$0.50	$3.00
108	95	DeepSeek V3.2 Exp Thinking	1046	±18	735	4.5%	7.2%	26 tps	3.0s	131K	$0.28	$0.42
109	106	DeepSeek V3 0324	1045	±8	4.1K	1.0%	5.8%	12 tps	2.7s	164K	$0.38	$0.93
110	106	Grok 3	1044	±7	6K	1.1%	1.5%	53 tps	0.6s	1M	$3.67	$18.33
111	113	Gemini 2.5 Flash Lite Thinking	1041	±9	2.2K	1.8%	1.0%	118 tps	4.4s	1M	$0.03	$0.13
112	133	DeepSeek-R1 0528	1038	±13	1.7K	2.0%	1.3%	93 tps	0.5s	64K	$1.60	$3.67
113	81	OpenAI o3-pro	1037	±18	950	2.6%	5.2%	22 tps	70.8s	200K	$20.00	$80.00
114	165	DeepSeek R1T2 Chimera	1031	±17	575	3.4%	3.0%	28 tps	1.8s	164K	$0.13	$0.45
115	48	Claude Sonnet 4 (Thinking)	1028	±15	3.7K	2.9%	1.5%	52 tps	1.5s	200K	$3.00	$13.67
116	126	Qwen3 VL 235B A22B Thinking	1027	±13	935	4.1%	4.3%	47 tps	3.0s	127K	$0.47	$3.31
117	62	MiniMax M2	1027	±9	2.5K	5.2%	2.2%	39 tps	2.3s	205K	$0.21	$0.85
118	129	Qwen3 Max Thinking	1022	±12	1.3K	1.1%	13.5%	32 tps	2.3s	256K	$1.20	$6.00
119	71	Qwen3.5 397B A17B	1021	±22	910	1.1%	4.3%	57 tps	1.4s	256K	$0.52	$3.00
120	124	Kimi K2 0905 Turbo	1017	±12	2.1K	2.3%	0.7%	373 tps	0.5s	262K	$1.70	$6.50

3of6

View All (223 models)