Leaderboard | Text

Models

Choose model family

Claude by Anthropic

Mistral by Mistral AI

Topics

Choose topic

All topics Facts and Information Creative Writing and Ideation Logic and Problem-Solving Task Completion Coding

Choose language

All languages English Chinese Arabic Spanish Indonesian Japanese

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

All Single turn Multiple turns

Open license models

Filter the leaderboard to only show models that have an open license.

All selected Open license Proprietary license

1016

Grok 4.1 Fast Reasoning

1022

Grok 4 Fast Reasoning

1022

Grok 4

1023

Grok 4 Fast Non-Reasoning

1023

Grok 4.1 Fast Non-Reasoning

1025

Gemini 2.5 Flash Preview 0925

1036

MiniMax M2.1

1038

Qwen3 Next 80B A3B Instruct

1040

Claude Sonnet 4.5

1040

GPT-5.1 Instant

1048

Qwen Plus (Aug'24)

1049

Gemini 2.5 Flash

1053

Qwen3 235B A22B Instruct 2507

1056

Qwen3 30B A3B Instruct 2507

1059

GLM 4.6

Last updated about 1 month ago

Rank	Overall	Name	VIBE Score	Confidence Interval	Votes	Downvote %	Abort %	Speed	Latency	Context	Cost (Input)	Cost (Output)
41	44	Grok 4.1 Fast Reasoning	1016	±18	1.4K	2.0%	1.5%	58 tps	7.3s	2M	$0.20	$0.50
42	48	Grok 4 Fast Reasoning	1022	±14	1.2K	2.0%	2.1%	102 tps	3.1s	2M	$0.30	$0.75
43	68	Grok 4	1022	±10	2.1K	2.5%	3.9%	29 tps	11.1s	256K	$3.00	$15.00
44	52	Grok 4 Fast Non-Reasoning	1023	±16	870	1.7%	1.5%	93 tps	0.6s	2M	$0.27	$0.67
45	26	Grok 4.1 Fast Non-Reasoning	1023	±21	820	1.8%	0.9%	101 tps	0.5s	2M	$0.20	$0.50
46	60	Gemini 2.5 Flash Preview 0925	1025	±13	1.2K	2.0%	1.2%	5 tps	0.9s	1M	$0.13	$0.97
47	60	MiniMax M2.1	1036	±22	695	1.4%	2.1%	66 tps	2.6s	205K	$0.30	$1.20
48	33	Qwen3 Next 80B A3B Instruct	1038	±15	920	2.6%	0.6%	84 tps	1.1s	256K	$0.20	$1.42
49	37	Claude Sonnet 4.5	1040	±8	2K	3.2%	1.4%	41 tps	1.3s	200K	$1.80	$9.00
50	62	GPT-5.1 Instant	1040	±13	915	2.7%	1.3%	50 tps	1.9s	400K	$1.25	$10.00
51	68	Qwen Plus (Aug'24)	1048	±22	730	2.0%	1.4%	53 tps	1.3s	30K	$0.40	$1.20
52	95	Gemini 2.5 Flash	1049	±18	2.1K	1.9%	1.3%	2 tps	3.7s	1M	$0.30	$2.50
53	40	Qwen3 235B A22B Instruct 2507	1053	±19	680	0.7%	6.8%	13 tps	1.9s	262K	$0.13	$0.52
54	33	Qwen3 30B A3B Instruct 2507	1056	±18	810	2.4%	1.2%	55 tps	1.3s	131K	$0.13	$0.72
55	65	GLM 4.6	1059	±25	640	2.3%	5.4%	39 tps	1.5s	200K	$0.42	$1.66
56	52	Claude Haiku 4.5	1060	±13	1.6K	3.1%	1.1%	100 tps	0.9s	200K	$1.00	$5.00
57	26	GPT-5 (High)	1061	±9	2.5K	2.7%	4.5%	81 tps	35.9s	400K	$1.25	$10.00
58	44	DeepSeek V3.1 Terminus Chat	1078	±12	580	1.7%	3.4%	27 tps	1.5s	131K	$0.86	$1.80
59	42	Qwen3 Max Instruct Preview	1083	±17	1.1K	1.7%	1.1%	31 tps	1.7s	256K	$1.43	$6.61
60	48	Claude Sonnet 4 (Thinking)	1093	±14	1.6K	2.4%	1.5%	52 tps	1.5s	200K	$3.00	$13.67
61	29	Qwen3 VL 235B A22B Instruct	1094	±15	675	2.2%	3.1%	75 tps	1.9s	129K	$0.37	$1.81
62	10	Claude Sonnet 4.5 (Thinking)	1102	±13	3.2K	3.6%	1.9%	44 tps	1.1s	200K	$3.00	$15.00
63	42	GPT-5.2 (Extra High)	1107	±24	890	2.7%	13.2%	17 tps	20.5s	400K	$1.75	$14.00
64	33	Kimi K2.5	1110	±26	720	2.0%	6.5%	33 tps	1.7s	262K	$0.34	$2.57
65	13	GPT-5.3 Instant	1110	±33	515	1.0%	0.9%	63 tps	0.8s	400K	$1.75	$14.00
66	32	Gemini 2.5 Pro High	1119	±10	2.5K	2.4%	1.5%	48 tps	2.3s	1M	$1.25	$10.00
67	26	Claude Haiku 4.5 (Extended Thinking)	1123	±19	1.1K	1.9%	1.4%	115 tps	0.7s	200K	$1.00	$5.00
68	44	Gemini 2.5 Pro	1125	±6	3.1K	3.7%	2.3%	45 tps	2.6s	1M	$1.25	$10.00
69	17	Claude Opus 4.5	1135	±21	1.1K	1.4%	1.5%	45 tps	1.5s	200K	$5.00	$25.00
70	17	GPT-5.2 (High)	1145	±15	2.2K	1.6%	6.7%	18 tps	16.3s	400K	$1.75	$14.00
71	16	GPT-5.2	1162	±18	785	1.9%	4.1%	18 tps	2.7s	400K	$1.75	$14.00
72	22	GPT-5 Chat	1164	±12	3.5K	1.6%	1.3%	95 tps	0.9s	400K	$1.25	$10.00
73	17	Gemini 3 Flash Preview	1165	±21	675	1.5%	1.3%	138 tps	1.4s	1M	$0.50	$3.00
74	14	Gemini 3 Flash Preview Thinking	1167	±17	1.4K	1.7%	1.6%	3 tps	6.2s	1M	$0.50	$3.00
75	5	Claude Sonnet 4.6 (Thinking)	1222	±23	630	1.6%	4.7%	57 tps	1.1s	200K	$3.00	$15.00
76	8	GPT-5.1	1230	±13	1.3K	1.9%	2.3%	71 tps	1.4s	400K	$1.42	$11.33
77	8	GPT-5.1 (High)	1231	±15	1.8K	1.7%	3.2%	76 tps	6.9s	400K	$1.25	$10.00
78	14	Gemini 3 Pro (Low)	1240	±19	1.2K	0.8%	2.4%	51 tps	3.5s	1M	$2.00	$12.00
79	6	Gemini 3.1 Pro	1244	±23	1.4K	1.7%	3.5%	35 tps	4.1s	1M	$2.00	$12.00
80	10	Gemini 3 Pro	1248	±16	3.5K	1.5%	2.1%	50 tps	3.6s	1M	$2.00	$12.00

2of3

View All (85 models)