Leaderboard | Text

Models

Choose model family

Claude by Anthropic

Mistral by Mistral AI

Topics

Choose topic

All topics Facts and Information Creative Writing and Ideation Logic and Problem-Solving Task Completion Coding

Choose language

All languages English Chinese Arabic Spanish Indonesian Japanese

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

All Single turn Multiple turns

Open license models

Filter the leaderboard to only show models that have an open license.

All selected Open license Proprietary license

1485

Claude Opus 4.6 (Thinking)

1483

Claude Opus 4.6

1344

Gemini 3.1 Pro

1295

Claude Sonnet 4.6

1274

GPT-5.2 Instant

1274

GPT-5.1 (High)

1270

Gemini 3 Pro (Low)

1268

GPT-5.1

1265

Gemini 3 Pro

1260

Claude Sonnet 4.5 (Thinking)

1248

Claude Opus 4.5 (Thinking)

1235

Claude Sonnet 4.6 (Thinking)

1228

Claude Opus 4.5

1220

GPT-5 Chat

1213

GPT-5.2

Last updated about 1 month ago

Rank	Overall	Name	VIBE Score	Confidence Interval	Votes	Downvote %	Abort %	Speed	Latency	Context	Cost (Input)	Cost (Output)
1	1	Claude Opus 4.6 (Thinking)	1485	±11	2.3K	1.3%	2.5%	56 tps	1.6s	200K	$5.00	$25.00
2	2	Claude Opus 4.6	1483	±11	3.2K	0.9%	2.1%	48 tps	1.7s	200K	$5.00	$25.00
3	6	Gemini 3.1 Pro	1344	±16	2.8K	2.4%	3.5%	35 tps	4.1s	1M	$2.00	$12.00
4	4	Claude Sonnet 4.6	1295	±16	1.8K	0.8%	1.6%	47 tps	1.2s	200K	$3.00	$15.00
5	10	GPT-5.2 Instant	1274	±8	3.9K	2.2%	1.7%	52 tps	2.0s	400K	$1.75	$14.00
6	8	GPT-5.1 (High)	1274	±6	4.2K	2.5%	3.2%	76 tps	6.9s	400K	$1.25	$10.00
7	14	Gemini 3 Pro (Low)	1270	±12	3.5K	2.4%	2.4%	51 tps	3.5s	1M	$2.00	$12.00
8	8	GPT-5.1	1268	±6	3.3K	1.5%	2.3%	71 tps	1.4s	400K	$1.42	$11.33
9	10	Gemini 3 Pro	1265	±5	12K	1.3%	2.1%	50 tps	3.6s	1M	$2.00	$12.00
10	10	Claude Sonnet 4.5 (Thinking)	1260	±4	8.3K	1.8%	1.9%	44 tps	1.1s	200K	$3.00	$15.00
11	7	Claude Opus 4.5 (Thinking)	1248	±6	9.3K	2.3%	1.8%	49 tps	1.4s	200K	$5.00	$25.00
12	5	Claude Sonnet 4.6 (Thinking)	1235	±18	1.7K	2.0%	4.7%	57 tps	1.1s	200K	$3.00	$15.00
13	17	Claude Opus 4.5	1228	±9	2.9K	1.8%	1.5%	45 tps	1.5s	200K	$5.00	$25.00
14	22	GPT-5 Chat	1220	±6	10.4K	2.1%	1.3%	95 tps	0.9s	400K	$1.25	$10.00
15	16	GPT-5.2	1213	±12	2.7K	2.2%	4.1%	18 tps	2.7s	400K	$1.75	$14.00
16	14	Gemini 3 Flash Preview Thinking	1212	±8	4.2K	1.8%	1.6%	3 tps	6.2s	1M	$0.50	$3.00
17	17	Gemini 3 Flash Preview	1198	±14	2K	2.0%	1.3%	138 tps	1.4s	1M	$0.50	$3.00
18	32	Gemini 2.5 Pro High	1189	±6	4.5K	2.5%	1.5%	48 tps	2.3s	1M	$1.25	$10.00
19	22	GLM 5	1184	±22	795	2.5%	3.4%	36 tps	2.7s	200K	$0.72	$2.55
20	17	GPT-5.2 (High)	1175	±10	6.4K	1.7%	6.7%	18 tps	16.3s	400K	$1.75	$14.00
21	42	GPT-5.2 (Extra High)	1168	±12	2.4K	1.8%	13.2%	17 tps	20.5s	400K	$1.75	$14.00
22	13	GPT-5.3 Instant	1160	±19	1.5K	1.9%	0.9%	63 tps	0.8s	400K	$1.75	$14.00
23	44	Gemini 2.5 Pro	1158	±5	7.8K	3.3%	2.3%	45 tps	2.6s	1M	$1.25	$10.00
24	52	GPT-5	1157	±7	5.3K	2.9%	3.1%	78 tps	23.1s	400K	$1.25	$9.67
25	37	Claude Sonnet 4.5	1156	±5	5.1K	2.8%	1.4%	41 tps	1.3s	200K	$1.80	$9.00
26	71	Gemini 2.5 Flash Thinking	1148	±7	2.6K	4.2%	2.2%	88 tps	6.4s	1M	$0.30	$2.50
27	48	Claude Sonnet 4 (Thinking)	1144	±6	4.2K	3.5%	1.5%	52 tps	1.5s	200K	$3.00	$13.67
28	29	Nova Experimental Chat 12-10	1135	±19	720	2.0%	2.4%	84 tps	12.9s	98K	$0	$0
29	60	Gemini 2.5 Flash Preview 0925	1131	±10	2.1K	2.7%	1.2%	5 tps	0.9s	1M	$0.13	$0.97
30	26	Grok 4.1 Fast Non-Reasoning	1130	±15	2K	4.3%	0.9%	101 tps	0.5s	2M	$0.20	$0.50
31	33	Qwen3 30B A3B Instruct 2507	1127	±9	2.5K	2.9%	1.2%	55 tps	1.3s	131K	$0.13	$0.72
32	81	OpenAI o3-pro	1126	±10	2.3K	3.1%	5.2%	22 tps	70.8s	200K	$20.00	$80.00
33	40	Qwen3 235B A22B Instruct 2507	1118	±11	2.5K	2.1%	6.8%	13 tps	1.9s	262K	$0.13	$0.52
34	68	Qwen Plus (Aug'24)	1115	±8	1.9K	2.6%	1.4%	53 tps	1.3s	30K	$0.40	$1.20
35	26	Claude Haiku 4.5 (Extended Thinking)	1115	±12	2.2K	2.7%	1.4%	115 tps	0.7s	200K	$1.00	$5.00
36	29	Qwen3 VL 235B A22B Instruct	1114	±8	1.3K	2.5%	3.1%	75 tps	1.9s	129K	$0.37	$1.81
37	71	Qwen3.5 397B A17B	1112	±24	580	2.5%	4.3%	57 tps	1.4s	256K	$0.52	$3.00
38	26	GPT-5 (High)	1110	±7	4.3K	3.1%	4.5%	81 tps	35.9s	400K	$1.25	$10.00
39	95	Gemini 2.5 Flash	1098	±9	4.8K	2.7%	1.3%	2 tps	3.7s	1M	$0.30	$2.50
40	68	Grok 4	1093	±5	5.7K	4.0%	3.9%	29 tps	11.1s	256K	$3.00	$15.00

1of3

View All (107 models)