Leaderboard | Coding

Models

Choose model family

Claude by Anthropic

Mistral by Mistral AI

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

All Single turn Multiple turns

Open license models

Filter the leaderboard to only show models that have an open license.

All selected Open license Proprietary license

1098

Grok 3

1098

Gemini 2.5 Flash

1099

Qwen3 Coder 480B A35B Instruct

1100

DeepSeek V3 0324

1102

Step 3.5 Flash

1102

GPT-4o

1102

Grok 3 Fast

1103

Gemini 2.5 Flash Lite

1104

GPT-5 Mini Low

1107

Qwen Max

1107

DeepSeek V3.2 Exp Chat

1110

Qwen3 Omni 30B A3B Thinking

1110

DeepSeek V3.1 Chat

1111

Gemini 2.5 Pro Preview 0325

1113

GPT-5.2 Codex (Low)

Last updated about 1 month ago

Rank	Overall	Name	VIBE Score	Confidence Interval	Votes	Downvote %	Abort %	Speed	Latency	Context	Cost (Input)	Cost (Output)
281	98	Grok 3	1098	±4	19.1K	5.5%	1.5%	53 tps	0.6s	1M	$3.67	$18.33
282	98	Gemini 2.5 Flash	1098	±4	35.9K	3.2%	1.3%	2 tps	3.7s	1M	$0.30	$2.50
283	90	Qwen3 Coder 480B A35B Instruct	1099	±8	3.1K	4.5%	3.3%	61 tps	2.0s	262K	$0.71	$1.34
284	90	DeepSeek V3 0324	1100	±4	15.1K	4.3%	5.8%	12 tps	2.7s	164K	$0.38	$0.93
285	90	Step 3.5 Flash	1102	±24	810	3.6%	2.2%	109 tps	0.6s	256K	$0.05	$0.15
286	90	GPT-4o	1102	±5	8.5K	3.7%	1.0%	49 tps	2.4s	128K	$3.71	$12.57
287	90	Grok 3 Fast	1102	±14	2.5K	4.7%	1.7%	52 tps	2.4s	131K	$5.00	$25.00
288	90	Gemini 2.5 Flash Lite	1103	±5	21.3K	6.2%	1.3%	210 tps	0.7s	1M	$0.10	$0.40
289	114	GPT-5 Mini Low	1104	±8	2.8K	7.2%	<0.1%	69 tps	3.2s	400K	$0.25	$2.00
290	90	Qwen Max	1107	±4	18.3K	4.2%	1.5%	49 tps	1.5s	33K	$1.60	$6.40
291	90	DeepSeek V3.2 Exp Chat	1107	±4	5.5K	6.1%	2.6%	29 tps	1.5s	131K	$0.27	$0.39
292	85	Qwen3 Omni 30B A3B Thinking	1110	±10	2.3K	6.0%	3.7%	67 tps	1.2s	66K	$0.97	$1.79
293	85	DeepSeek V3.1 Chat	1110	±7	4.9K	6.6%	2.8%	21 tps	1.6s	131K	$0.38	$1.00
294	108	Gemini 2.5 Pro Preview 0325	1111	±11	1.5K	3.2%	<0.1%	3 tps	16.6s	1M	$1.25	$10.00
295	85	GPT-5.2 Codex (Low)	1113	±19	1.2K	3.2%	4.5%	41 tps	5.0s	400K	$1.75	$14.00
296	85	GPT-5 Mini Minimal	1114	±12	3.2K	8.5%	1.2%	63 tps	1.4s	400K	$0.25	$2.00
297	85	Gemini 2.5 Flash Thinking	1118	±4	13.7K	3.6%	2.2%	88 tps	6.4s	1M	$0.30	$2.50
298	97	Gemini 2.5 Pro Preview 0605	1121	±10	1.7K	2.3%	<0.1%	0 tps	3.7s	1M	$1.25	$10.00
299	77	Gemini 2.5 Flash Lite Preview 0925	1122	±7	8.5K	6.6%	1.2%	209 tps	0.7s	1M	$0.25	$0.35
300	77	GPT-4.1	1123	±5	32.8K	5.2%	3.7%	112 tps	1.3s	1M	$2.00	$8.00
301	97	Ministral 8B 2512	1125	±15	510	7.3%	<0.1%	174 tps	0.5s	128K	$0.15	$0.15
302	77	Grok 4	1125	±3	39.6K	4.4%	3.9%	29 tps	11.1s	256K	$3.00	$15.00
303	77	Qwen3 Max Thinking Preview	1127	±10	6.3K	5.7%	3.1%	40 tps	2.1s	256K	$1.20	$6.00
304	77	Grok 4.20 Multi Agent Beta	1129	±19	945	3.6%	1.2%	56 tps	8.8s	2M	$2.00	$6.00
305	77	DeepSeek V3.1 Turbo	1130	±7	4.8K	5.3%	0.9%	173 tps	1.3s	164K	$2.00	$3.75
306	77	GPT-5 Mini	1131	±5	8.6K	5.4%	2.6%	66 tps	14.2s	400K	$0.25	$2.00
307	77	Mistral Large 3	1131	±8	5.4K	5.8%	2.1%	51 tps	1.0s	256K	$0.50	$1.50
308	97	Grok 3 Beta	1134	±9	2K	0.8%	<0.1%	58 tps	0.8s	131K	$3.00	$15.00
309	93	Gemini 2.5 Flash Preview Thinking	1136	±10	1.4K	1.8%	<0.1%	26 tps	1.8s	1M	$0.15	$1.76
310	74	Gemini 2.5 Flash Preview 0925	1140	±6	7.6K	6.0%	1.2%	5 tps	0.9s	1M	$0.13	$0.97
311	74	Qwen3.5 397B A17B	1142	±14	2.5K	2.9%	4.3%	57 tps	1.4s	256K	$0.52	$3.00
312	74	Qwen Plus (Aug'24)	1146	±5	17.2K	4.7%	1.4%	53 tps	1.3s	30K	$0.40	$1.20
313	69	DeepSeek V3.1 Terminus Chat	1158	±5	6.5K	6.9%	3.4%	27 tps	1.5s	131K	$0.86	$1.80
314	86	GPT-5 (Minimal)	1158	±5	8.3K	7.4%	<0.1%	67 tps	1.4s	400K	$1.25	$10.00
315	86	Gemini 2.5 Flash Preview	1161	±8	3K	1.1%	<0.1%	138 tps	6.9s	1M	$0.15	$0.60
316	69	GLM 4.7	1161	±7	16.8K	3.7%	5.8%	40 tps	1.5s	200K	$0.77	$1.73
317	69	GPT-5 Codex (Low)	1163	±10	5K	4.1%	2.7%	112 tps	3.5s	400K	$1.25	$10.00
318	69	Qwen3.5 35B A3B	1164	±25	865	3.9%	2.1%	116 tps	2.1s	256K	$0.63	$1.13
319	69	gpt-oss-120b	1165	±5	19.2K	5.0%	0.7%	213 tps	0.5s	131K	$0.11	$0.50
320	60	Grok 4.20 Beta Reasoning	1167	±22	1.2K	4.1%	1.1%	77 tps	4.5s	2M	$2.00	$5.50

8of11

View All (404 models)