Leaderboard | Text

Models

Choose model family

Claude by Anthropic

Mistral by Mistral AI

Topics

Choose topic

All topics Facts and Information Creative Writing and Ideation Logic and Problem-Solving Task Completion Coding

Choose language

All languages English Chinese Arabic Spanish Indonesian Japanese

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

All Single turn Multiple turns

Open license models

Filter the leaderboard to only show models that have an open license.

All selected Open license Proprietary license

1569

Claude Opus 4.6 (Thinking)

1493

GPT-5.4

1471

GPT-5.4 (High)

1469

Claude Opus 4.6

1418

Gemini 3.1 Pro

1368

GPT-5.1 (High)

1364

GPT-5.1 (Medium)

1364

Claude Sonnet 4.6

1361

GPT-5.2 Instant

1360

GPT-5.1

1345

Qwen3 30B A3B Instruct 2507

1343

Gemini 3 Pro

1329

GPT-5.2

1328

Mistral Medium 3.1

1313

Claude Opus 4.5 (Thinking)

Last updated about 1 month ago

Rank	Overall	Name	VIBE Score	Confidence Interval	Votes	Downvote %	Abort %	Speed	Latency	Context	Cost (Input)	Cost (Output)
1	1	Claude Opus 4.6 (Thinking)	1569	±10	1.5K	1.7%	2.5%	56 tps	1.6s	200K	$5.00	$25.00
2	2	GPT-5.4	1493	±14	695	1.4%	2.6%	55 tps	0.8s	1M	$2.50	$15.00
3	4	GPT-5.4 (High)	1471	±16	805	1.2%	4.6%	68 tps	7.9s	1M	$2.50	$15.00
4	2	Claude Opus 4.6	1469	±16	1.6K	2.4%	2.1%	48 tps	1.7s	200K	$5.00	$25.00
5	6	Gemini 3.1 Pro	1418	±11	4.9K	1.0%	3.5%	35 tps	4.1s	1M	$2.00	$12.00
6	8	GPT-5.1 (High)	1368	±10	4.9K	2.2%	3.2%	76 tps	6.9s	400K	$1.25	$10.00
7	8	GPT-5.1 (Medium)	1364	±12	995	4.3%	<0.1%	86 tps	3.8s	400K	$0.83	$6.67
8	4	Claude Sonnet 4.6	1364	±19	1.9K	1.3%	1.6%	47 tps	1.2s	200K	$3.00	$15.00
9	10	GPT-5.2 Instant	1361	±11	4K	1.1%	1.7%	52 tps	2.0s	400K	$1.75	$14.00
10	8	GPT-5.1	1360	±9	2.6K	2.4%	2.3%	71 tps	1.4s	400K	$1.42	$11.33
11	33	Qwen3 30B A3B Instruct 2507	1345	±6	3.4K	1.4%	1.2%	55 tps	1.3s	131K	$0.13	$0.72
12	10	Gemini 3 Pro	1343	±7	16.2K	1.4%	2.1%	50 tps	3.6s	1M	$2.00	$12.00
13	16	GPT-5.2	1329	±9	3.2K	1.2%	4.1%	18 tps	2.7s	400K	$1.75	$14.00
14	19	Mistral Medium 3.1	1328	±8	3.1K	1.9%	<0.1%	77 tps	0.7s	128K	$0.40	$2.00
15	7	Claude Opus 4.5 (Thinking)	1313	±9	6K	2.5%	1.8%	49 tps	1.4s	200K	$5.00	$25.00
16	37	Nova Experimental Chat 10-20	1313	±11	1.4K	7.9%	<0.1%	30 tps	0.5s	98K	$0	$0
17	14	Gemini 3 Pro (Low)	1300	±8	4.1K	1.8%	2.4%	51 tps	3.5s	1M	$2.00	$12.00
18	213	DeepSeek R1T Chimera	1289	±15	770	3.1%	<0.1%	46 tps	1.1s	164K	$0.09	$0.36
19	17	Gemini 3 Flash Preview	1281	±11	2K	1.2%	1.3%	138 tps	1.4s	1M	$0.50	$3.00
20	5	Claude Sonnet 4.6 (Thinking)	1280	±14	1.4K	2.2%	4.7%	57 tps	1.1s	200K	$3.00	$15.00
21	17	GPT-5.2 (High)	1275	±12	6.5K	1.4%	6.7%	18 tps	16.3s	400K	$1.75	$14.00
22	29	Qwen3 VL 235B A22B Instruct	1273	±14	1.1K	2.3%	3.1%	75 tps	1.9s	129K	$0.37	$1.81
23	22	GPT-5 Chat	1269	±5	7.9K	1.6%	1.3%	95 tps	0.9s	400K	$1.25	$10.00
24	40	Qwen3 235B A22B Instruct 2507	1261	±8	3.1K	1.4%	6.8%	13 tps	1.9s	262K	$0.13	$0.52
25	10	Claude Sonnet 4.5 (Thinking)	1261	±7	5.5K	1.9%	1.9%	44 tps	1.1s	200K	$3.00	$15.00
26	26	Grok 4.1 Fast Non-Reasoning	1260	±16	2.5K	3.7%	0.9%	101 tps	0.5s	2M	$0.20	$0.50
27	56	Gemini 2.5 Pro Low	1259	±9	2.4K	2.4%	<0.1%	89 tps	2.4s	1M	$1.25	$10.00
28	16	Nova Experimental Chat 11-10	1252	±14	1.7K	3.6%	0.4%	84 tps	8.9s	98K	$0	$0
29	14	Gemini 3 Flash Preview Thinking	1248	±10	3.7K	1.3%	1.6%	3 tps	6.2s	1M	$0.50	$3.00
30	32	Gemini 2.5 Pro High	1234	±6	4.6K	2.4%	1.5%	48 tps	2.3s	1M	$1.25	$10.00
31	62	GPT-5.1 Instant	1233	±12	2.6K	2.7%	1.3%	50 tps	1.9s	400K	$1.25	$10.00
32	81	GPT-4o	1228	±11	2.9K	1.7%	1.0%	49 tps	2.4s	128K	$3.71	$12.57
33	13	GPT-5.3 Instant	1225	±13	2.3K	1.3%	0.9%	63 tps	0.8s	400K	$1.75	$14.00
34	42	Qwen3 Max Instruct Preview	1192	±9	2.8K	3.0%	1.1%	31 tps	1.7s	256K	$1.43	$6.61
35	29	Nova Experimental Chat 12-10	1192	±11	1.4K	0.7%	2.4%	84 tps	12.9s	98K	$0	$0
36	56	Claude Opus 4.1 (Thinking)	1177	±11	1.1K	3.8%	<0.1%	20 tps	3.9s	200K	$15.00	$75.00
37	17	Claude Opus 4.5	1177	±15	1.9K	4.7%	1.5%	45 tps	1.5s	200K	$5.00	$25.00
38	77	GPT-4.5 Preview	1173	±13	495	2.9%	<0.1%	36 tps	3.0s	200K	$75.00	$150.00
39	42	GPT-5.2 (Extra High)	1172	±13	3K	1.6%	13.2%	17 tps	20.5s	400K	$1.75	$14.00
40	80	GPT-5 (Minimal)	1169	±10	1.9K	2.6%	<0.1%	67 tps	1.4s	400K	$1.25	$10.00

1of5

View All (175 models)