Leaderboard | Coding

Models

Choose model family

Claude by Anthropic

Mistral by Mistral AI

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

All Single turn Multiple turns

Open license models

Filter the leaderboard to only show models that have an open license.

All selected Open license Proprietary license

1038

GLM 4.5

1035

Gemini 2.5 Flash Lite Thinking Preview 0925

1034

Gemini 3.1 Flash Lite Preview

1034

Gemini 2.0 Flash

1032

DeepSeek V3

1032

LongCat Flash Chat

1030

QwQ 32B

1027

OpenAI o4-mini

1026

DeepSeek V3.2 Speciale

1024

Qwen3 235B A22B Thinking 2507

1021

Command A

1020

DeepSeek V3 (Turbo)

1019

OpenAI o1

1018

Grok Code Fast 1

1018

GPT-5 (Low)

Last updated about 1 month ago

Rank	Overall	Name	VIBE Score	Confidence Interval	Votes	Downvote %	Abort %	Speed	Latency	Context	Cost (Input)	Cost (Output)
121	112	GLM 4.5	1038	±9	3.2K	8.8%	3.7%	46 tps	1.4s	131K	$0.43	$1.63
122	135	Gemini 2.5 Flash Lite Thinking Preview 0925	1035	±10	4.6K	7.3%	1.5%	152 tps	3.0s	1M	$0.10	$0.40
123	135	Gemini 3.1 Flash Lite Preview	1034	±30	775	4.3%	1.0%	8 tps	1.2s	1M	$0.25	$1.50
124	144	Gemini 2.0 Flash	1034	±7	7.7K	3.6%	<0.1%	76 tps	0.5s	1M	$0.14	$0.56
125	135	DeepSeek V3	1032	±6	8.4K	2.7%	0.9%	69 tps	1.1s	64K	$0.59	$1.49
126	119	LongCat Flash Chat	1032	±12	2.2K	5.9%	0.8%	85 tps	0.9s	131K	$0.14	$0.68
127	135	QwQ 32B	1030	±9	6.4K	7.9%	5.4%	41 tps	2.1s	16K	$0.43	$0.56
128	128	OpenAI o4-mini	1027	±11	4.1K	8.7%	1.4%	97 tps	7.0s	128K	$1.10	$4.40
129	135	DeepSeek V3.2 Speciale	1026	±18	1.6K	6.9%	6.0%	43 tps	1.4s	131K	$0.84	$1.52
130	148	Qwen3 235B A22B Thinking 2507	1024	±22	1.9K	5.1%	2.5%	53 tps	1.6s	131K	$0.59	$5.70
131	144	Command A	1021	±7	11.9K	4.1%	2.2%	42 tps	0.8s	256K	$2.00	$7.33
132	105	DeepSeek V3 (Turbo)	1020	±25	865	6.0%	1.5%	32 tps	1.5s	64K	$0.40	$1.30
133	119	OpenAI o1	1019	±13	4.3K	4.2%	4.2%	92 tps	5.5s	200K	$15.00	$60.00
134	159	Grok Code Fast 1	1018	±10	2K	5.6%	5.9%	294 tps	0.5s	256K	$0.20	$1.50
135	112	GPT-5 (Low)	1018	±28	480	5.0%	1.8%	75 tps	8.2s	400K	$1.25	$10.00
136	148	Nemotron 3 Nano (Thinking)	1018	±17	1.4K	6.3%	2.0%	200 tps	0.5s	256K	$0	$0
137	119	DeepSeek V3.1 Terminus Thinking	1016	±15	2.1K	10.7%	5.9%	27 tps	1.8s	131K	$0.56	$1.68
138	128	Kimi K2 Thinking	1013	±13	2.5K	5.4%	4.2%	61 tps	5.9s	262K	$0.24	$1.03
139	105	Seed 1.8 251228	1010	±19	2.1K	3.0%	3.7%	41 tps	2.1s	256K	$0.25	$2.00
140	148	DeepSeek-R1 Turbo	1005	±15	1.6K	6.3%	2.6%	29 tps	1.8s	64K	$2.85	$4.75
141	159	GPT-5 Nano	1001	±12	3.6K	8.3%	3.2%	113 tps	20.9s	400K	$0.05	$0.40
142	167	Llama 3.1 8B Turbo	999	±16	1.6K	1.5%	2.1%	650 tps	0.5s	128K	$0.13	$0.14
143	128	GLM 4.5 AirX	999	±39	635	8.6%	3.3%	75 tps	1.2s	131K	$1.10	$4.50
144	144	OpenAI o3	991	±13	4.7K	3.7%	0.9%	85 tps	6.8s	128K	$7.33	$29.33
145	167	Mistral Small 3.2 24B	990	±11	4.4K	4.7%	2.8%	141 tps	0.7s	33K	$0.02	$0.08
146	167	Pixtral Large	988	±20	2.3K	3.6%	2.5%	57 tps	1.3s	128K	$1.50	$4.50
147	148	Seed 1.6 250615	988	±22	1.2K	6.0%	3.1%	46 tps	2.2s	256K	$0.25	$2.00
148	148	Qwen3 30B A3B	987	±10	4.5K	8.3%	5.1%	163 tps	1.0s	41K	$0.06	$0.21
149	112	gpt-oss-20b	983	±10	4.1K	10.1%	0.5%	216 tps	0.5s	131K	$0.06	$0.26
150	148	Qwen3 Coder Plus	981	±33	500	3.8%	5.1%	56 tps	2.3s	128K	$1.80	$9.80
151	90	Step 3.5 Flash	981	±44	510	3.8%	2.2%	109 tps	0.6s	256K	$0.05	$0.15
152	167	Devstral Medium	978	±17	3.4K	5.0%	1.5%	77 tps	0.6s	131K	$0.40	$2.00
153	167	Qwen 2.5 32B Instruct	976	±11	2.9K	6.2%	2.5%	48 tps	1.0s	131K	$0.21	$0.25
154	167	DeepSeek V3.1 Thinking	974	±11	3.7K	11.1%	7.1%	18 tps	1.8s	131K	$0.23	$0.75
155	167	Qwen3 VL 30B A3B Thinking	973	±17	1.5K	9.3%	4.5%	84 tps	2.9s	127K	$0.20	$1.47
156	148	OpenAI o3-mini-high	972	±8	3.9K	5.1%	2.4%	231 tps	10.5s	200K	$1.10	$4.40
157	159	GLM 4.6V	972	±18	2.3K	5.8%	6.4%	21 tps	1.8s	128K	$0.38	$0.90
158	159	Mistral Small 3.1 24B Instruct	970	±15	2.8K	4.2%	7.5%	15 tps	2.4s	131K	$0.06	$0.18
159	167	Qwen 2.5 72B	969	±22	1.3K	4.0%	1.2%	96 tps	1.2s	131K	$0.14	$0.26
160	167	Llama 4 Scout	958	±7	9K	4.2%	0.6%	88 tps	5.1s	131K	$0.18	$0.46

4of7

View All (273 models)