Leaderboard | Coding

Models

Choose model family

Claude by Anthropic

Mistral by Mistral AI

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

All Single turn Multiple turns

Open license models

Filter the leaderboard to only show models that have an open license.

All selected Open license Proprietary license

1089

DeepSeek V3.1

1090

OpenAI o3-pro

1093

DeepSeek V3 0324 Turbo

1098

Grok 3

1098

Gemini 2.5 Flash

1099

Qwen3 Coder 480B A35B Instruct

1100

DeepSeek V3 0324

1102

GPT-4o

1102

Grok 3 Fast

1103

Gemini 2.5 Flash Lite

1107

Qwen Max

1110

Qwen3 Omni 30B A3B Thinking

1110

DeepSeek V3.1 Chat

1113

GPT-5.2 Codex (Low)

1114

GPT-5 Mini Minimal

Last updated about 1 month ago

Rank	Overall	Name	VIBE Score	Confidence Interval	Votes	Downvote %	Abort %	Speed	Latency	Context	Cost (Input)	Cost (Output)
121	98	DeepSeek V3.1	1089	±12	2.3K	4.7%	0.8%	197 tps	0.4s	164K	$0.55	$1.60
122	98	OpenAI o3-pro	1090	±8	5.4K	4.3%	5.2%	22 tps	70.8s	200K	$20.00	$80.00
123	98	DeepSeek V3 0324 Turbo	1093	±5	15.5K	5.7%	6.3%	12 tps	2.4s	164K	$0.73	$1.79
124	98	Grok 3	1098	±4	19.1K	5.5%	1.5%	53 tps	0.6s	1M	$3.67	$18.33
125	98	Gemini 2.5 Flash	1098	±4	35.9K	3.2%	1.3%	2 tps	3.7s	1M	$0.30	$2.50
126	90	Qwen3 Coder 480B A35B Instruct	1099	±8	3.1K	4.5%	3.3%	61 tps	2.0s	262K	$0.71	$1.34
127	90	DeepSeek V3 0324	1100	±4	15.1K	4.3%	5.8%	12 tps	2.7s	164K	$0.38	$0.93
128	90	GPT-4o	1102	±5	8.5K	3.7%	1.0%	49 tps	2.4s	128K	$3.71	$12.57
129	90	Grok 3 Fast	1102	±14	2.5K	4.7%	1.7%	52 tps	2.4s	131K	$5.00	$25.00
130	90	Gemini 2.5 Flash Lite	1103	±5	21.3K	6.2%	1.3%	210 tps	0.7s	1M	$0.10	$0.40
131	90	Qwen Max	1107	±4	18.3K	4.2%	1.5%	49 tps	1.5s	33K	$1.60	$6.40
132	85	Qwen3 Omni 30B A3B Thinking	1110	±10	2.3K	6.0%	3.7%	67 tps	1.2s	66K	$0.97	$1.79
133	85	DeepSeek V3.1 Chat	1110	±7	4.9K	6.6%	2.8%	21 tps	1.6s	131K	$0.38	$1.00
134	85	GPT-5.2 Codex (Low)	1113	±19	1.2K	3.2%	4.5%	41 tps	5.0s	400K	$1.75	$14.00
135	85	GPT-5 Mini Minimal	1114	±12	3.2K	8.5%	1.2%	63 tps	1.4s	400K	$0.25	$2.00
136	85	Gemini 2.5 Flash Thinking	1118	±4	13.7K	3.6%	2.2%	88 tps	6.4s	1M	$0.30	$2.50
137	77	Gemini 2.5 Flash Lite Preview 0925	1122	±7	8.5K	6.6%	1.2%	209 tps	0.7s	1M	$0.25	$0.35
138	77	GPT-4.1	1123	±5	32.8K	5.2%	3.7%	112 tps	1.3s	1M	$2.00	$8.00
139	77	Grok 4	1125	±3	39.6K	4.4%	3.9%	29 tps	11.1s	256K	$3.00	$15.00
140	77	Qwen3 Max Thinking Preview	1127	±10	6.3K	5.7%	3.1%	40 tps	2.1s	256K	$1.20	$6.00
141	77	Grok 4.20 Multi Agent Beta	1129	±19	945	3.6%	1.2%	56 tps	8.8s	2M	$2.00	$6.00
142	77	DeepSeek V3.1 Turbo	1130	±7	4.8K	5.3%	0.9%	173 tps	1.3s	164K	$2.00	$3.75
143	77	GPT-5 Mini	1131	±5	8.6K	5.4%	2.6%	66 tps	14.2s	400K	$0.25	$2.00
144	74	Gemini 2.5 Flash Preview 0925	1140	±6	7.6K	6.0%	1.2%	5 tps	0.9s	1M	$0.13	$0.97
145	74	Qwen3.5 397B A17B	1142	±14	2.5K	2.9%	4.3%	57 tps	1.4s	256K	$0.52	$3.00
146	74	Qwen Plus (Aug'24)	1146	±5	17.2K	4.7%	1.4%	53 tps	1.3s	30K	$0.40	$1.20
147	69	DeepSeek V3.1 Terminus Chat	1158	±5	6.5K	6.9%	3.4%	27 tps	1.5s	131K	$0.86	$1.80
148	69	GLM 4.7	1161	±7	16.8K	3.7%	5.8%	40 tps	1.5s	200K	$0.77	$1.73
149	69	GPT-5 Codex (Low)	1163	±10	5K	4.1%	2.7%	112 tps	3.5s	400K	$1.25	$10.00
150	69	Qwen3.5 35B A3B	1164	±25	865	3.9%	2.1%	116 tps	2.1s	256K	$0.63	$1.13
151	60	Grok 4.20 Beta Reasoning	1167	±22	1.2K	4.1%	1.1%	77 tps	4.5s	2M	$2.00	$5.50
152	60	GPT-5.1 Instant	1171	±8	8.3K	4.1%	1.3%	50 tps	1.9s	400K	$1.25	$10.00
153	60	GPT-5.1 Codex (Medium)	1171	±14	3K	3.2%	4.6%	71 tps	3.7s	400K	$1.25	$10.00
154	60	Claude Sonnet 3.5 v2	1171	±6	5.5K	3.4%	<0.1%	46 tps	1.4s	200K	$3.00	$15.00
155	60	Qwen3 235B A22B Instruct 2507	1172	±4	12.6K	6.4%	6.8%	13 tps	1.9s	262K	$0.13	$0.52
156	60	Gemini 2.5 Pro	1176	±3	37.9K	4.8%	2.3%	45 tps	2.6s	1M	$1.25	$10.00
157	60	Grok 4 Fast Reasoning	1177	±3	14.5K	5.0%	2.1%	102 tps	3.1s	2M	$0.30	$0.75
158	60	Grok 4.1 Fast Reasoning	1178	±7	39.5K	4.4%	1.5%	58 tps	7.3s	2M	$0.20	$0.50
159	49	GPT-5.3 Codex (Low)	1178	±28	510	1.0%	1.8%	61 tps	4.3s	400K	$1.75	$14.00
160	49	GLM 4.6	1182	±7	17.2K	4.4%	5.4%	39 tps	1.5s	200K	$0.42	$1.66

4of6

View All (210 models)