Models
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

329
Seed Coder 8B Reasoning
406
QwQ 32B RpR v1
447
Phi 4 Mini Reasoning
454
Mistral Nemo 12B Inferor v0.0
463
CodeLlama 7B Instruct Solidity
481
DeepSeek-R1 Distill Qwen 1.5B
523
Qwen 2.5 VL 3B Instruct
573
Phi 4 Reasoning
588
Hunyuan A13B Instruct
595
DeepSeek-R1 Distill Llama 8B
599
Phi 4 Mini Instruct
600
MythoMax L2 13B
601
Llema 7B
602
ERNIE 4.5 0.3B
610
UI-TARS 1.5 7B

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
1404Seed Coder 8B Reasoning329±417004.1%<0.1%25 tpsN/A32K$0.99$0.99
2402QwQ 32B RpR v1406±351K10.9%<0.1%34 tps3.3s33K$0.02$0.07
3286Phi 4 Mini Reasoning447±153.4K12.0%9.7%30 tps0.9s128K$0.07$0.30
4399Mistral Nemo 12B Inferor v0.0454±285651.7%<0.1%83 tps0.8s16K$0.80$1.20
5284CodeLlama 7B Instruct Solidity463±544858.5%3.6%33 tps0.7s16K$0.80$1.20
6399DeepSeek-R1 Distill Qwen 1.5B481±197305.2%<0.1%20 tps0.0s131K$0.18$0.18
7284Qwen 2.5 VL 3B Instruct523±254.1K6.1%3.0%44 tps2.5s128K$0.21$0.63
8279Phi 4 Reasoning573±172.1K5.5%21.0%29 tps1.0s33K$0.06$0.25
9279Hunyuan A13B Instruct588±221.6K9.2%2.3%67 tps2.0s33K$0.01$0.01
10390DeepSeek-R1 Distill Llama 8B595±191.2K5.6%<0.1%17 tpsN/A32K$0.04$0.04
11279Phi 4 Mini Instruct599±211K7.1%7.4%40 tps1.1s128K$0.07$0.30
12279MythoMax L2 13B600±212.3K5.8%1.2%22 tps1.1s4K$0.18$0.18
13390Llema 7B601±218504.5%<0.1%1 tps15.0s4K$0.80$1.20
14390ERNIE 4.5 0.3B602±4068511.0%<0.1%85 tps2.2s120K$0$0
15279UI-TARS 1.5 7B610±4053011.7%4.0%75 tps0.9s128K$0.10$0.20
16386Shisa V2 Llama 3.3 70B623±245859.3%<0.1%8 tps2.0s33K$0.03$0.09
17386DeepSeek-R1 Distill Qwen 7B633±195655.0%<0.1%0 tpsN/A131K$0.05$0.10
18386Dolphin 2.9.2 Mixtral 8x22B652±191.1K2.6%<0.1%20 tps1.5s16K$0.90$0.90
19386MiMo 7B RL655±131.2K3.5%<0.1%31 tps0.4s32K$0.49$0.49
20276MiniMax M1686±133.8K5.3%<0.1%31 tps2.8s1M$0.55$2.20
21383ArliAI QwQ 32B Arliai RpR V1686±406359.3%<0.1%34 tps1.8s33K$0.02$0.07
22276DeepSeek-R1 Distill Qwen 32B696±202K5.5%6.2%22 tps1.8s131K$0.37$0.39
23374Phi 4 Multimodal Instruct697±162.1K6.8%<0.1%17 tps1.4s128K$0.03$0.05
24374Dolphin 3.0 R1 Mistral 24B701±168907.8%<0.1%13 tps0.1s33K$0.03$0.09
25276Hermes 3 405B Instruct702±201.4K4.1%2.3%20 tps1.1s131K$0.80$0.80
26269DeepHermes 3 Mistral 24B Preview706±307155.9%2.5%50 tps1.0s33K$0.06$0.25
27374Mythalion 13B709±101.1K1.3%<0.1%63 tps0.5s4K$0.56$1.13
28269Inflection 3 Pi719±181.5K4.1%1.1%33 tps3.4s8K$2.50$10.00
29374Solar Pro 250422720±195306.2%<0.1%13 tps0.6s33K$0$0
30269Pixtral 12B722±213K6.3%2.2%101 tps1.2s131K$0.08$0.08
31374Mistral Nemo 12B Celeste V1.9725±181.1K3.5%<0.1%6 tps10.2s8K$0.80$1.20
32361Zenith730±288509.6%<0.1%36 tps1.8s131K$0$0
33361Meridian734±399659.8%<0.1%92 tps1.2s131K$0$0
34361Command734±187654.4%<0.1%25 tpsN/A4K$0.83$1.33
35269Inflection 3 Productivity737±241.5K5.0%0.6%50 tps3.2s8K$2.50$10.00
36269Command R+738±151.6K5.6%2.8%36 tps0.7s128K$2.08$9.45
37269Mixtral 8x22B Instruct738±171.4K5.6%1.8%142 tps0.7s66K$0.45$0.45
38269Gemma 3 4B742±103.3K4.7%1.3%138 tps0.7s131K$0.02$0.04
39262Qwen 2.5 VL 72B Instruct746±202.1K6.0%5.3%25 tps3.7s128K$1.01$2.79
40361Seed Coder 8B Instruct751±226052.4%<0.1%35 tpsN/A32K$0.99$0.99
41262Goliath 120B754±247455.7%2.7%21 tps2.2s6K$6.56$9.38
42361DeepSeek-R1 Distill Qwen 14B756±161.9K6.3%<0.1%44 tps1.7s64K$0.63$0.63
43262Hermes 4 405B Reasoning FP8759±112.7K12.8%3.6%32 tps0.8s131K$1.00$3.00
44262Open Mistral 7B762±181.3K4.7%0.7%176 tps0.4s33K$0.25$0.25
45354OLMo 3 7B Think763±217707.8%4.2%77 tps0.4s66K$0.12$0.20
46262Mistral Small770±121.2K4.5%1.7%142 tps0.6s32K$0.43$1.30
47262Baichuan-M2-32B770±3074010.8%<0.1%32 tps3.3s131K$0.07$0.07
48262Command R778±182.2K4.9%5.8%54 tps0.6s128K$0.30$0.99
49252Hermes 4 70B781±294608.9%1.1%67 tps0.6s131K$0.12$0.39
50252Mistral Large785±161.1K5.8%1.5%54 tps0.7s33K$2.00$6.00
51252GPT-3.5 Turbo Instruct787±92K2.7%<0.1%46 tps1.2s4K$1.50$2.00
52252Mercury Coder793±275103.8%<0.1%247 tps2.2s32K$0.25$1.00
53346Magistral Medium 2507795±2566514.2%<0.1%86 tps0.7s41K$2.00$5.00
54252Hermes 4 405B FP8797±218158.4%3.5%31 tps0.9s131K$0.52$1.73
55252Phi 4798±161.7K3.4%5.1%28 tps1.3s128K$0.10$0.32
56252WizardLM-2 8x22B801±121.9K3.1%11.6%11 tps2.5s66K$0.77$0.77
57252Gemma 3 1B802±112K6.1%0.6%176 tps1.0s33K$0.06$0.10
58252Magistral Small 2509802±181.8K7.5%2.7%116 tps0.6s131K$0.50$1.50
59346Magistral Medium (Thinking)804±102.2K5.7%<0.1%67 tps0.8s41K$2.00$5.00
60337Qwen 2 72B Instruct805±191K3.3%<0.1%3 tpsN/A33K$0.90$0.90
61252Ministral 3B806±162.3K5.1%0.8%248 tps0.4s131K$0.08$0.08
62337GLM 4.1V 9B Thinking813±161.1K4.2%<0.1%69 tps1.3s66K$0.04$0.14
63240Gemma 2 27B815±171.5K4.1%1.4%44 tps1.4s8K$0.80$0.80
64240LFM2 8B A1B818±1882511.3%<0.1%142 tps0.3s33K$0.01$0.02
65240Moonshot V1 32k820±179503.1%1.4%53 tps1.4s33K$1.00$3.00
66240C4AI Aya Expanse 32B821±73.8K4.0%1.5%43 tps0.5s128K$0.50$1.50
67240Ministral 8B825±172.2K5.5%1.4%177 tps0.4s128K$0.14$0.14
68240Krutrim 2825±102.3K2.3%12.5%33 tps2.1s128K$1.00$1.00
69240LFM2 2.6B826±2681010.0%6.7%184 tps0.4s33K$0.01$0.02
70240Sky T1 32B Preview829±142.4K4.5%7.8%73 tps0.6s16K$0.12$0.18
71240Qwen 2.5 7B831±172K5.1%3.7%40 tps1.9s131K$0.08$0.27
72240Mixtral-8x7B Instruct v0.1832±231.3K4.6%1.3%54 tps0.4s33K$0.60$0.60
73324NVIDIA Llama 3.1 Nemotron Ultra 253B v1832±162.2K4.1%<0.1%40 tps0.8s128K$0.30$0.90
74324OLMo 2 0425 1B Instruct833±215602.6%<0.1%68 tps0.0s4K$0$0
75240GLM 4.5 Flash834±375208.8%12.2%15 tps2.2s131K$0$0
76324Typhoon 2 70B Instruct835±151.4K4.0%<0.1%19 tps0.1s8K$0.88$0.88
77240DeepSeek-R1 Distill Llama 70B835±93.4K5.2%3.6%27 tps1.6s32K$0.73$0.95
78234ERNIE 4.5 21B A3B Thinking838±231.1K6.9%1.8%87 tps1.5s120K$0.07$0.28
79324Cogito V2 671B838±171.6K5.9%<0.1%41 tps0.6s164K$1.25$1.25
80234GPT-3.5 Turbo 16k838±102.7K3.6%<0.1%22 tps0.6s16K$3.00$4.00
81324MAI-DS-R1842±73.5K11.7%<0.1%73 tps3.2s64K$0.10$0.40
82312Wikipedia846±79.8K4.9%<0.1%47 tps2.1s32K$0$0
83234Command R 7B849±153.3K4.8%1.1%76 tps0.4s128K$0.04$0.15
84234Llama 3.3 70B Instruct Turbo851±191.2K6.0%2.0%78 tps1.0s131K$0.88$0.88
85234Jamba 1.5 Large851±92.9K4.0%1.7%48 tps0.9s256K$1.50$6.00
86234Gemma 3 27B IT853±102.3K3.9%2.0%60 tps0.8s128K$0.17$0.29
87210Mixtral 8x7B Instruct854±161.4K4.4%0.2%79 tps0.7s33K$0.23$0.31
88210Ministral 3B 2512854±575158.0%2.8%339 tps0.6s131K$0.10$0.10
89210Mixtral 8x7B855±181.3K5.1%2.2%142 tps0.6s33K$0.23$0.23
90210Gemma 3 27B856±271.1K6.9%1.8%35 tps1.1s66K$0.06$0.10
91312Command Light856±161.1K4.9%<0.1%23 tpsN/A4K$0.10$0.20
92312DeepSeek-R1 0528 Qwen3 8B856±84.9K6.5%<0.1%45 tps2.4s128K$0.05$0.09
93312Yi Large858±121.5K<0.1%<0.1%34 tpsN/A33K$1.50$1.50
94210Qwen 2.5 14B Instruct861±162.4K5.7%2.4%40 tps1.6s1M$0.40$1.61
95293Hermes 2 Mixtral 8x7B DPO863±171.2K1.3%<0.1%1 tpsN/A33K$0.60$0.60
96210Moonshot V1 8k863±139155.2%1.0%55 tps1.5s8K$0.20$2.00
97210Mistral Small 24B Instruct864±161.5K4.1%1.5%84 tps0.4s33K$0.80$0.80
98210Hermes 2 Pro Llama 3 8B864±211.8K2.5%<0.1%76 tps1.0s131K$0.08$0.09
99210Gemma 3 12B867±112.5K4.9%4.2%73 tps0.8s131K$0.05$0.12
100210GLM 4 32B868±122.9K4.9%2.6%40 tps1.6s33K$0.14$0.14
101210Krutrim Spectre V2868±161.3K3.6%<0.1%33 tps3.1s4K$0.19$0.19
102293Claude Sonnet 3869±179001.6%<0.1%35 tps1.0s200K$3.00$15.00
103293AFM 4.5B869±74.4K8.9%<0.1%81 tps0.3s66K$0.05$0.20
104210Qwen 2.5 7B Turbo870±256156.1%0.5%125 tps0.4s131K$0.30$0.30
105293Devstral Small 2505871±151.7K6.2%<0.1%141 tps1.3s33K$0.03$0.09
106210Mixtral 8x22B871±221.3K5.0%1.2%140 tps0.6s64K$2.00$6.00
107210GLM 4.7 Flash871±286104.7%5.8%61 tps2.8s128K$0.07$0.39
108210Solar Mini 250422874±171.3K5.9%1.8%90 tps1.7s33K$0.15$0.15
109210Mistral Nemo875±159152.7%<0.1%112 tps0.4s131K$0.07$0.13
110210Mistral Medium 3875±234856.7%2.4%47 tps0.8s33K$0.40$2.00
111210DeepSeek R1T2 Chimera876±102.1K5.9%3.0%28 tps1.8s164K$0.13$0.45
112210Inception Mercury878±56.9K3.7%0.4%257 tps1.1s32K$0.25$1.00
113210Moonshot V1 128k879±191.1K4.6%1.4%54 tps1.5s131K$2.00$5.00
114210Mistral Small 3 24B Instruct880±101.7K3.6%2.6%77 tps0.6s33K$0.07$0.14
115280Magistral Small 2507880±1973013.1%<0.1%148 tps0.4s41K$0.50$1.50
116210Qwen3 4B880±85.1K9.6%1.9%94 tps1.5s128K$0.01$0.01
117280Refuel LLM 2 Small881±74.2K3.9%<0.1%116 tps0.5s8K$0.20$0.20
118210Gemma 3n E4B882±76K4.5%2.0%30 tps0.5s8K$0.01$0.02
119280AFM 4.5B Preview882±162.5K3.1%<0.1%32 tps0.0s66K$0$0
120210Magistral Medium 2509883±162.6K9.5%4.0%58 tps0.9s131K$2.00$5.00
121201Llama 3.2 11B Instruct885±152.1K4.1%1.5%152 tps0.5s8K$0.16$0.16
122280Gemini 1.5 Pro887±82.3K2.3%<0.1%15 tps0.0s2M$0.78$3.13
123280Arcee AI Blitz889±83K2.1%<0.1%6 tpsN/A33K$0.45$0.75
124280Venice Uncensored891±296156.1%<0.1%59 tps3.9s33K$0$0
125201GLM 4.6V Flash892±102.5K7.6%3.7%64 tps2.1s128K$0.04$0.40
126264Exaone 3.5 32B Instruct893±216503.0%<0.1%17 tpsN/A33K$0$0
127201Amazon Nova Pro 1.0894±105.7K4.0%0.9%96 tps0.7s300K$0.80$1.70
128264Arcee AI Virtuoso-Medium896±122K2.6%<0.1%3 tpsN/A131K$0.50$0.80
129264Llama 3.1 405B Instruct Turbo896±112K3.9%<0.1%26 tps0.8s131K$3.50$3.50
130201Moonshot V1 Auto899±229304.1%1.2%54 tps1.5s8K$2.00$5.00
131201GPT-4o mini900±142.8K5.3%2.1%71 tps1.7s128K$0.15$0.60
132264Grok 2901±72.3K2.7%<0.1%55 tps1.1s131K$2.00$10.00
133264DeepSeek R1T Chimera901±92.9K6.7%<0.1%46 tps1.1s164K$0.09$0.36
134201Mistral Small 3.2 24B Instruct903±208508.6%1.9%113 tps1.1s131K$0.02$0.08
135201Llama 3 8B904±103.2K3.5%6.0%85 tps0.7s8K$0.12$0.16
136264Fauna Fox908±103.4K8.2%<0.1%194 tps0.3s128K$0.04$0.15
137201GPT-3.5 Turbo908±151.2K2.5%1.3%74 tps0.9s16K$0.75$1.75
138201Magistral Small 2506908±114.3K3.1%1.6%156 tps0.5s40K$0.37$1.10
139264OLMo 3 32B Think910±254706.0%<0.1%84 tps0.6s66K$0.15$0.50
140264YouTube910±132K5.5%<0.1%34 tps2.7s32K$0.99$0.99
141264Arcee AI Spotlight910±84.6K4.3%<0.1%121 tps0.4s131K$0.18$0.18
142189Inception Mercury Coder Small Beta912±206103.2%1.7%270 tps1.4s32K$0.25$1.00
143245Solar Pro 3 (Reasoning)913±185954.8%3.2%118 tps1.2s131K$0.15$0.60
144189Seed 1.6 Flash 250715913±161.2K5.7%2.5%108 tps1.6s256K$0.07$0.30
145189Rnj-1 Instruct915±219706.7%0.6%103 tps0.3s33K$0.15$0.15
146189Devstral Small916±171.4K5.0%2.4%180 tps0.6s131K$0.10$0.30
147189Jamba 1.7 Large917±169758.0%1.3%58 tps1.0s256K$1.33$5.33
148189Open Mistral Nemo918±211.7K4.5%1.5%171 tps0.5s131K$0.15$0.15
149189Llama 3.3 Swallow 70B Instruct919±83.5K5.5%1.4%153 tps1.3s131K$0.13$0.39
150245Solar Pro 2 250710 (Reasoning)919±102.6K3.9%<0.1%9 tpsN/A66K$0.50$0.50
151245GPT-5 Nano Minimal920±131.4K10.8%<0.1%88 tps0.8s400K$0.05$0.40
152245GLM Z1 32B921±101.9K10.1%<0.1%18 tps9.3s33K$0.09$0.11
153245Grok 3 Mini Beta922±141.8K1.9%<0.1%75 tps0.5s131K$0.45$2.25
154189Jamba 1.6 Large924±93.3K3.8%2.0%59 tps1.2s256K$1.33$5.33
155189Mistral Small 3.1925±142.4K4.0%7.4%13 tps2.6s32K$0.17$0.28
156189Grok 3 Mini927±69.9K6.4%1.2%43 tps0.5s131K$0.30$0.50
157245NVIDIA Llama 3.1 Nemotron 70B928±75.3K2.0%<0.1%9 tps0.1s128K$0.33$0.39
158189Codestral931±198556.0%5.2%151 tps0.9s262K$0.15$0.45
159245Llama 3.1 70B Instruct Turbo933±114.1K3.8%<0.1%110 tps0.8s128K$0.88$0.88
160189Llama 3.3 70B933±103K6.6%0.3%500 tps0.5s8K$0.48$0.66
161179DeepSeek Prover v2935±101.3K3.0%5.2%14 tps1.3s164K$0.40$1.56
162230R1 1776935±93.3K4.2%<0.1%61 tps1.0s128K$2.00$8.00
163230Dobby Unhinged Llama 3.3 70B935±198602.8%<0.1%41 tps0.4s128K$0.90$0.90
164230Jamba 1.7 Mini936±241K8.4%<0.1%84 tps0.9s256K$0.20$0.40
165179DeepSeek-R1939±66.4K4.3%0.8%133 tps0.6s64K$0.91$3.07
166179NVIDIA Llama 3.3 Nemotron Super 49B v1.5941±191.4K6.9%2.0%50 tps0.6s131K$0.09$0.33
167179ERNIE 4.5 VL 424B A47B942±187256.5%4.9%36 tps3.5s123K$0.42$1.25
168230NVIDIA Llama 3.3 Nemotron Super 49B v1942±93.6K2.3%<0.1%13 tpsN/A131K$0.07$0.20
169179ERNIE 4.5 21B A3B943±285406.9%2.3%78 tps1.5s120K$0.05$0.19
170179Grok 3 Mini Fast943±79K7.0%1.6%44 tps0.5s131K$0.60$4.00
171179Ministral 14B 3.0948±168058.5%2.0%119 tps0.5s128K$0.20$0.20
172179Qwen3 8B948±94.2K8.2%2.4%61 tps1.4s41K$0.02$0.07
173179Switchpoint Router949±102.7K3.6%1.7%71 tps4.9s131K$0.85$3.40
174230Magistral Medium952±161.3K10.8%<0.1%95 tps0.5s41K$2.00$5.00
175179Qwen3 30B A3B Thinking 2507953±103.5K4.7%0.5%124 tps1.2s131K$0.16$1.70
176167Llama 3.1 8B Turbo958±142.2K2.0%2.1%650 tps0.5s128K$0.13$0.14
177167Qwen 2.5 72B960±151.4K4.5%1.2%96 tps1.2s131K$0.14$0.26
178167Qwen3 14B962±85.3K8.4%1.7%109 tps0.8s41K$0.04$0.15
179167Devstral Medium962±113.5K5.2%1.5%77 tps0.6s131K$0.40$2.00
180167Llama 4 Scout965±517.5K5.3%0.6%88 tps5.1s131K$0.18$0.46
181167Qwen3 VL 30B A3B Thinking967±111.9K8.9%4.5%84 tps2.9s127K$0.20$1.47
182167Pixtral Large969±143.5K3.9%2.5%57 tps1.3s128K$1.50$4.50
183167Mistral Small 3.2 24B970±134.6K4.9%2.8%141 tps0.7s33K$0.02$0.08
184167Llama 4 Maverick971±521K5.0%1.2%88 tps2.4s1M$0.23$0.83
185219Arcee Coder Large971±73.6K2.6%<0.1%54 tps1.3s33K$0.50$0.80
186167Qwen 2.5 32B Instruct972±74.1K6.5%2.5%48 tps1.0s131K$0.21$0.25
187211Arcee AI Coder-Large972±159854.4%<0.1%60 tps1.6s33K$0.50$0.80
188167Nemotron 3 Nano974±465806.5%1.3%216 tps0.8s256K$0.05$4.94
189211Grok 4 (Low Reasoning)975±215202.8%<0.1%18 tps9.5s256K$0$0
190167DeepSeek V3.1 Thinking976±95.2K9.5%7.1%18 tps1.8s131K$0.23$0.75
191159DeepSeek-R1 0528979±65.5K3.5%1.3%93 tps0.5s64K$1.60$3.67
192159Mistral Small 3.1 24B Instruct980±112.9K4.3%7.5%15 tps2.4s131K$0.06$0.18
193159Seed 2.0 Mini (Medium)981±355705.8%11.9%33 tps1.7s256K$0.15$0.60
194159Kimi K2 0711981±67K4.5%1.6%29 tps1.3s131K$0.72$2.60
195195GLM 4.6 FP8982±171.2K11.7%<0.1%56 tps1.8s200K$0.40$1.75
196195Cypher Alpha985±207358.7%<0.1%4 tpsN/A1M$0$0
197159GLM 4.6V986±83K5.5%6.4%21 tps1.8s128K$0.38$0.90
198159Grok Code Fast 1987±92.5K6.0%5.9%294 tps0.5s256K$0.20$1.50
199159OpenAI o3-mini-low988±612.2K6.4%0.7%139 tps1.5s200K$1.10$4.40
200159GPT-5 Nano989±64.6K8.0%3.2%113 tps20.9s400K$0.05$0.40
201195Claude Haiku 3992±112.8K3.0%0.4%62 tps0.5s200K$0.25$1.25
202148Qwen3 30B A3B994±86.3K6.9%5.1%163 tps1.0s41K$0.06$0.21
203195Arcee AI Virtuoso-Large994±83K5.7%<0.1%64 tps0.5s131K$0.75$1.20
204148Seed 1.6 250615995±211.6K6.0%3.1%46 tps2.2s256K$0.25$2.00
205148OpenAI o4-mini-high995±713.6K6.2%1.9%117 tps15.9s200K$1.10$4.40
206148OpenAI o3-mini999±415K5.5%0.8%143 tps3.3s200K$1.10$4.40
207148OpenAI o3-mini-high999±58.3K4.1%2.4%231 tps10.5s200K$1.10$4.40
208148Qwen3 235B A22B Thinking 25071000±102.8K4.4%2.5%53 tps1.6s131K$0.59$5.70
209148Qwen 2.5 VL 32B Instruct1001±218654.9%6.3%43 tps3.2s128K$0.35$0.62
210195GPT-5 Mini High1002±93K7.7%<0.1%33 tps3.9s400K$0.25$2.00
211148DeepSeek-R1 Turbo1003±92.5K5.6%2.6%29 tps1.8s64K$2.85$4.75
212189K2 Think1005±161.4K5.6%<0.1%418 tps2.8sN/A$0$0
213148Qwen3 Coder Plus1007±226104.7%5.1%56 tps2.3s128K$1.80$9.80
214148Qwen3 VL 235B A22B Thinking1009±64.6K8.3%4.3%47 tps3.0s127K$0.47$3.31
215148Nemotron 3 Nano (Thinking)1012±132K6.7%2.0%200 tps0.5s256K$0$0
216189GLM 4.5 Air1016±67.1K6.9%<0.1%22 tps1.4s131K$0.10$0.38
217144Gemini 2.0 Flash1018±78.2K3.8%<0.1%76 tps0.5s1M$0.14$0.56
218144OpenAI o31020±75.9K4.0%0.9%85 tps6.8s128K$7.33$29.33
219144DeepSeek V3.1 Nex N11021±195655.0%3.4%24 tps7.2s131K$0.14$0.50
220144Command A1024±422.4K4.8%2.2%42 tps0.8s256K$2.00$7.33
221135Amazon Nova 2 Lite1026±103.6K6.0%1.0%137 tps0.6s300K$0.35$2.95
222174Claude Haiku 3.51028±66.4K4.9%0.8%40 tps2.8s200K$0.80$4.00
223135Gemini 2.0 Flash Lite1029±514.7K9.5%<0.1%42 tps0.5s1M$0.08$0.30
224135DeepSeek V3.2 Speciale1030±102.3K6.1%6.0%43 tps1.4s131K$0.84$1.52
225135DeepSeek V31032±517.6K3.7%0.9%69 tps1.1s64K$0.59$1.49
226135Qwen3 VL 30B A3B Instruct1034±151K6.7%1.8%80 tps2.6s129K$0.18$0.67
227135Gemini 3.1 Flash Lite Preview1034±219804.4%1.0%8 tps1.2s1M$0.25$1.50
228135Gemini 2.5 Flash Lite Thinking Preview 09251035±75.8K6.8%1.5%152 tps3.0s1M$0.10$0.40
229135Qwen3 Next 80B A3B Thinking1035±56.2K7.4%0.6%175 tps1.3s256K$0.21$2.26
230174Qwen 2.5 72B Turbo1035±226705.0%<0.1%84 tps0.8s131K$0.60$0.60
231135QwQ 32B1035±411.6K6.4%5.4%41 tps2.1s16K$0.43$0.56
232164Llama 3 70B Turbo1037±64.3K1.0%<0.1%31 tps0.0s8K$0.73$0.83
233128Gemini 3.1 Flash Lite Preview Thinking1039±161.4K4.2%1.7%75 tps4.7s1M$0.25$1.50
234164EXAONE Deep 32B1040±148801.7%<0.1%24 tpsN/A33K$0$0
235128OpenAI o4-mini1042±58.5K6.4%1.4%97 tps7.0s128K$1.10$4.40
236128Kimi K2 Thinking1042±103.3K5.1%4.2%61 tps5.9s262K$0.24$1.03
237128GLM 4.5 AirX1042±151.1K6.9%3.3%75 tps1.2s131K$1.10$4.50
238164Grok 4 0709 EU1043±111.3K5.7%<0.1%33 tps8.2s128K$3.00$15.00
239128Qwen3 32B1044±198506.6%3.9%30 tps3.1s41K$0.12$0.42
240128Cogito v2.1 671B1044±191.2K4.6%0.8%85 tps0.5s128K$1.25$1.25
241164Arcee AI Maestro Reasoning1046±73.8K4.6%<0.1%85 tps0.3s131K$0.90$3.30
242128ERNIE 4.5 300B A47B1049±413.5K3.9%4.7%23 tps2.3s123K$0.28$1.10
243151GLM 4.5 X1051±166455.8%<0.1%48 tps2.8s131K$2.20$8.90
244119Qwen3 32B Fast1052±611.4K5.2%11.6%30 tps3.1s41K$0.10$0.25
245119GPT-5.1 Codex Mini (High)1054±152.2K3.9%5.9%70 tps4.6s400K$0.25$2.00
246119GPT-5.1 Codex Mini (Medium)1057±151.9K4.9%4.6%69 tps4.1s400K$0.25$2.00
247151OpenAI Codex Mini1057±59.8K3.3%<0.1%46 tps2.1s200K$1.50$6.00
248119LongCat Flash Chat1058±122.7K5.9%0.8%85 tps0.9s131K$0.14$0.68
249119Seed 2.0 Lite (Medium)1058±205253.7%6.6%33 tps1.6s256K$0.25$2.00
250151Llama 3 8B Turbo1059±246001.6%<0.1%97 tps0.1s8K$0.12$0.13
251151GLM 4.5 FP81060±186108.3%<0.1%59 tps1.2s131K$0.41$1.65
252119Gemini 2.5 Flash Lite Thinking1061±49.8K6.2%1.0%118 tps4.4s1M$0.03$0.13
253119OpenAI o1-pro1061±206807.5%5.2%33 tps72.8s200K$150.00$600.00
254119DeepSeek V3.1 Terminus Thinking1061±92.9K9.4%5.9%27 tps1.8s131K$0.56$1.68
255119OpenAI o11062±69.9K3.3%4.2%92 tps5.5s200K$15.00$60.00
256112Grok 4.20 Beta Non-reasoning1063±365004.8%1.1%151 tps0.6s2M$2.00$6.00
257144Qwen Turbo1064±510K6.0%<0.1%53 tps1.1s1M$0.05$0.20
258112gpt-oss-20b1066±67.7K7.1%0.5%216 tps0.5s131K$0.06$0.26
259112Kimi K2 0905 Turbo1070±67.5K9.1%0.7%373 tps0.5s262K$1.70$6.50
260112GPT-5 (Low)1070±146903.5%1.8%75 tps8.2s400K$1.25$10.00
261112Kimi K2 Fast1073±535K6.4%0.8%365 tps0.5s131K$1.00$3.00
262112Kimi K2 09051074±78.7K4.3%4.0%30 tps1.4s262K$0.63$2.39
263112GLM 4.51075±56K7.0%3.7%46 tps1.4s131K$0.43$1.63
264105Qwen3 Max Thinking1080±181.5K2.0%13.5%32 tps2.3s256K$1.20$6.00
265105Mistral Medium1080±49.6K5.6%1.8%48 tps0.6s33K$1.48$4.55
266105Seed 1.8 2512281081±103.2K3.1%3.7%41 tps2.1s256K$0.25$2.00
267132Solar Pro 2 2507101081±510.6K6.9%<0.1%9 tpsN/A66K$0.50$0.50
268105DeepSeek V3 (Turbo)1082±201.5K5.1%1.5%32 tps1.5s64K$0.40$1.30
269105Qwen3 Omni 30B A3B Instruct1085±137754.3%3.9%65 tps1.2s66K$0.35$0.97
270105GPT-4.1 nano1085±517K5.0%0.6%175 tps0.5s1M$0.10$0.40
271105GPT-4.1 mini1087±519.7K4.2%1.1%67 tps0.9s1M$0.34$1.60
272132Qwen Plus 0728 (Thinking)1087±91.2K8.9%<0.1%56 tps1.1s1M$0.40$4.00
273132Claude Sonnet 3.51088±102.9K4.9%1.0%40 tps2.7s200K$3.00$15.00
27498DeepSeek V3.2 Exp Thinking1089±75.9K3.5%7.2%26 tps3.0s131K$0.28$0.42
27598DeepSeek V3.11089±122.3K4.7%0.8%197 tps0.4s164K$0.55$1.60
27698OpenAI o3-pro1090±85.4K4.3%5.2%22 tps70.8s200K$20.00$80.00
277123Sherlock Dash Alpha1090±198356.7%<0.1%68 tps0.7s2M$0$0
278123Nova Experimental Chat 10-091091±73.2K10.7%<0.1%59 tps6.1s98K$0$0
27998Qwen3 235B A22B1093±64.5K8.0%5.3%71 tps0.9s41K$0.23$0.63
28098DeepSeek V3 0324 Turbo1093±515.5K5.7%6.3%12 tps2.4s164K$0.73$1.79
28198Grok 31098±419.1K5.5%1.5%53 tps0.6s1M$3.67$18.33
28298Gemini 2.5 Flash1098±435.9K3.2%1.3%2 tps3.7s1M$0.30$2.50
28390Qwen3 Coder 480B A35B Instruct1099±83.1K4.5%3.3%61 tps2.0s262K$0.71$1.34
28490DeepSeek V3 03241100±415.1K4.3%5.8%12 tps2.7s164K$0.38$0.93
28590Step 3.5 Flash1102±248103.6%2.2%109 tps0.6s256K$0.05$0.15
28690GPT-4o1102±58.5K3.7%1.0%49 tps2.4s128K$3.71$12.57
28790Grok 3 Fast1102±142.5K4.7%1.7%52 tps2.4s131K$5.00$25.00
28890Gemini 2.5 Flash Lite1103±521.3K6.2%1.3%210 tps0.7s1M$0.10$0.40
289114GPT-5 Mini Low1104±82.8K7.2%<0.1%69 tps3.2s400K$0.25$2.00
29090Qwen Max1107±418.3K4.2%1.5%49 tps1.5s33K$1.60$6.40
29190DeepSeek V3.2 Exp Chat1107±45.5K6.1%2.6%29 tps1.5s131K$0.27$0.39
29285Qwen3 Omni 30B A3B Thinking1110±102.3K6.0%3.7%67 tps1.2s66K$0.97$1.79
29385DeepSeek V3.1 Chat1110±74.9K6.6%2.8%21 tps1.6s131K$0.38$1.00
294108Gemini 2.5 Pro Preview 03251111±111.5K3.2%<0.1%3 tps16.6s1M$1.25$10.00
29585GPT-5.2 Codex (Low)1113±191.2K3.2%4.5%41 tps5.0s400K$1.75$14.00
29685GPT-5 Mini Minimal1114±123.2K8.5%1.2%63 tps1.4s400K$0.25$2.00
29785Gemini 2.5 Flash Thinking1118±413.7K3.6%2.2%88 tps6.4s1M$0.30$2.50
29897Gemini 2.5 Pro Preview 06051121±101.7K2.3%<0.1%0 tps3.7s1M$1.25$10.00
29977Gemini 2.5 Flash Lite Preview 09251122±78.5K6.6%1.2%209 tps0.7s1M$0.25$0.35
30077GPT-4.11123±532.8K5.2%3.7%112 tps1.3s1M$2.00$8.00
30197Ministral 8B 25121125±155107.3%<0.1%174 tps0.5s128K$0.15$0.15
30277Grok 41125±339.6K4.4%3.9%29 tps11.1s256K$3.00$15.00
30377Qwen3 Max Thinking Preview1127±106.3K5.7%3.1%40 tps2.1s256K$1.20$6.00
30477Grok 4.20 Multi Agent Beta1129±199453.6%1.2%56 tps8.8s2M$2.00$6.00
30577DeepSeek V3.1 Turbo1130±74.8K5.3%0.9%173 tps1.3s164K$2.00$3.75
30677GPT-5 Mini1131±58.6K5.4%2.6%66 tps14.2s400K$0.25$2.00
30777Mistral Large 31131±85.4K5.8%2.1%51 tps1.0s256K$0.50$1.50
30897Grok 3 Beta1134±92K0.8%<0.1%58 tps0.8s131K$3.00$15.00
30993Gemini 2.5 Flash Preview Thinking1136±101.4K1.8%<0.1%26 tps1.8s1M$0.15$1.76
31074Gemini 2.5 Flash Preview 09251140±67.6K6.0%1.2%5 tps0.9s1M$0.13$0.97
31174Qwen3.5 397B A17B1142±142.5K2.9%4.3%57 tps1.4s256K$0.52$3.00
31274Qwen Plus (Aug'24)1146±517.2K4.7%1.4%53 tps1.3s30K$0.40$1.20
31369DeepSeek V3.1 Terminus Chat1158±56.5K6.9%3.4%27 tps1.5s131K$0.86$1.80
31486GPT-5 (Minimal)1158±58.3K7.4%<0.1%67 tps1.4s400K$1.25$10.00
31586Gemini 2.5 Flash Preview1161±83K1.1%<0.1%138 tps6.9s1M$0.15$0.60
31669GLM 4.71161±716.8K3.7%5.8%40 tps1.5s200K$0.77$1.73
31769GPT-5 Codex (Low)1163±105K4.1%2.7%112 tps3.5s400K$1.25$10.00
31869Qwen3.5 35B A3B1164±258653.9%2.1%116 tps2.1s256K$0.63$1.13
31969gpt-oss-120b1165±519.2K5.0%0.7%213 tps0.5s131K$0.11$0.50
32060Grok 4.20 Beta Reasoning1167±221.2K4.1%1.1%77 tps4.5s2M$2.00$5.50
32175Gemini 2.5 Pro Low1170±49.6K8.1%<0.1%89 tps2.4s1M$1.25$10.00
32260GPT-5.1 Instant1171±88.3K4.1%1.3%50 tps1.9s400K$1.25$10.00
32360GPT-5.1 Codex (Medium)1171±143K3.2%4.6%71 tps3.7s400K$1.25$10.00
32460Claude Sonnet 3.5 v21171±65.5K3.4%<0.1%46 tps1.4s200K$3.00$15.00
32560Qwen3 235B A22B Instruct 25071172±412.6K6.4%6.8%13 tps1.9s262K$0.13$0.52
32675Gemini 2.5 Flash Thinking Preview 09251173±79.2K6.8%<0.1%111 tps4.7s1M$0.30$2.50
32760Gemini 2.5 Pro1176±337.9K4.8%2.3%45 tps2.6s1M$1.25$10.00
32860Grok 4 Fast Reasoning1177±314.5K5.0%2.1%102 tps3.1s2M$0.30$0.75
32960DeepSeek V3.2 Thinking1178±923.3K4.0%9.0%30 tps2.6s131K$0.28$0.42
33060Grok 4.1 Fast Reasoning1178±739.5K4.4%1.5%58 tps7.3s2M$0.20$0.50
33149GPT-5.3 Codex (Low)1178±285101.0%1.8%61 tps4.3s400K$1.75$14.00
33249GLM 4.61182±717.2K4.4%5.4%39 tps1.5s200K$0.42$1.66
33349Nova Experimental Chat 12-101182±92.9K3.8%2.4%84 tps12.9s98K$0$0
33449MiniMax M21183±519.7K4.2%2.2%39 tps2.3s205K$0.21$0.85
33549Grok 4 Fast Non-Reasoning1185±58.1K7.1%1.5%93 tps0.6s2M$0.27$0.67
33649GPT-51185±421.3K5.3%3.1%78 tps23.1s400K$1.25$9.67
33749MiniMax M2.5 FP81185±176103.2%3.6%33 tps1.7s205K$0.45$1.75
33862Qwen Plus 07281189±82.1K7.5%<0.1%55 tps0.9s1M$0.40$1.20
33949DeepSeek V3.21189±85.1K4.7%1.4%83 tps5.1s131K$0.43$1.09
34049MiniMax M2.11192±819.4K3.6%2.1%66 tps2.6s205K$0.30$1.20
34162OpenAI o1-mini1192±415K4.6%<0.1%118 tpsN/A128K$1.13$4.51
34249Kimi K2 Thinking Turbo1192±620.3K3.4%2.0%75 tps1.4s262K$1.15$8.00
34349Qwen3 30B A3B Instruct 25071194±512.7K5.7%1.2%55 tps1.3s131K$0.13$0.72
34443MiniMax M2.1 Lightning1197±238753.3%1.7%52 tps2.1s205K$0.30$2.40
34543GPT-5.1 Codex Max1200±126.4K3.9%3.0%118 tps4.1s400K$1.25$10.00
34658Claude Sonnet 3.71201±412.1K3.2%<0.1%39 tps1.6s200K$3.00$15.00
34743Qwen3 Max Instruct Preview1203±616.1K4.6%1.1%31 tps1.7s256K$1.43$6.61
34843Gemini 2.5 Pro High1204±321.1K5.7%1.5%48 tps2.3s1M$1.25$10.00
34943Gemini 3 Flash Preview1205±117.2K3.7%1.3%138 tps1.4s1M$0.50$3.00
35043Claude Sonnet 41205±343.2K3.7%1.8%49 tps1.3s200K$3.00$15.00
35153Mistral Medium 3.11206±516.4K5.1%<0.1%77 tps0.7s128K$0.40$2.00
35253Claude Sonnet 3.7 (Thinking)1210±313.6K3.1%<0.1%41 tps2.6s200K$3.00$15.00
35336Kimi K2.5 Instant1210±81.8K3.2%2.9%32 tps3.0s262K$0.50$3.00
35436Qwen3.5 27B1211±169104.7%3.7%55 tps2.6s256K$0.30$2.40
35536GPT-5.2 Codex (Medium)1211±122.4K3.0%5.7%37 tps6.3s400K$1.75$14.00
35636GPT-5 Codex (Medium)1214±68.8K3.9%4.1%122 tps5.2s400K$1.25$10.00
35736Qwen3.5 122B A17B1216±151.9K3.1%1.5%82 tps1.4s256K$0.40$3.20
35836Qwen3 VL 235B A22B Instruct1220±75.6K6.7%3.1%75 tps1.9s129K$0.37$1.81
35936GPT-5.2 (Extra High) 1221±98K3.5%13.2%17 tps20.5s400K$1.75$14.00
36044Nova Experimental Chat 10-201221±54.4K8.1%<0.1%30 tps0.5s98K$0$0
36144GPT-4.5 Preview1223±72.5K1.8%<0.1%36 tps3.0s200K$75.00$150.00
36237Polaris Alpha1226±147555.6%<0.1%48 tps1.1s256K$0$0
36331MiniMax M2.5 Lightning1228±141.7K3.2%1.5%51 tps2.0s205K$0.60$2.40
36437Nova Experimental Chat 11-101230±85.2K6.3%0.4%84 tps8.9s98K$0$0
36531Qwen3 Next 80B A3B Instruct1231±58.8K5.8%0.6%84 tps1.1s256K$0.20$1.42
36631GPT-5 Chat1231±435K4.5%1.3%95 tps0.9s400K$1.25$10.00
36731Grok 4.1 Fast Non-Reasoning1239±69.4K5.4%0.9%101 tps0.5s2M$0.20$0.50
36831GPT-5.1 Codex (High)1240±837K3.3%3.2%96 tps3.9s400K$1.25$10.00
36936Claude Opus 4.11254±47.1K4.6%3.0%17 tps3.7s200K$15.00$75.00
37027GPT-5.2 Codex (High)1257±123.1K2.8%8.8%41 tps12.9s400K$1.75$14.00
37127GPT-5 (High)1259±416.2K3.5%4.5%81 tps35.9s400K$1.25$10.00
37227GPT-5 Codex (High)1260±718.5K3.3%3.2%122 tps7.1s400K$1.25$10.00
37327Claude Sonnet 4 (Thinking)1261±325.9K2.9%1.5%52 tps1.5s200K$3.00$13.67
37419GPT-5.3 Instant1271±124.2K2.5%0.9%63 tps0.8s400K$1.75$14.00
37529Claude Opus 4.1 (Thinking)1272±57.7K5.2%<0.1%20 tps3.9s200K$15.00$75.00
37629Claude Opus 41274±412.4K2.7%<0.1%25 tps1.5s200K$15.00$75.00
37719GPT-5.3 Codex (Medium)1278±271.1K2.3%2.3%62 tps10.3s400K$1.75$14.00
37819MiniMax M2.51283±285103.8%1.4%70 tps1.9s205K$0.28$1.20
37919Claude Haiku 4.51283±316.4K4.5%1.1%100 tps0.9s200K$1.00$5.00
38019Gemini 3 Flash Preview Thinking1286±632.7K3.3%1.6%3 tps6.2s1M$0.50$3.00
38119GPT-5.1 (High)1290±619.1K3.5%3.2%76 tps6.9s400K$1.25$10.00
38219Gemini 3 Pro (Low)1291±611.9K4.2%2.4%51 tps3.5s1M$2.00$12.00
38321GPT-5.1 (Medium)1291±93.2K6.4%<0.1%86 tps3.8s400K$0.83$6.67
38419Kimi K2.51291±1116.5K3.4%6.5%33 tps1.7s262K$0.34$2.57
38517GPT-5.2 (High)1297±830.7K2.8%6.7%18 tps16.3s400K$1.75$14.00
38617Claude Sonnet 4.51307±320.9K5.0%1.4%41 tps1.3s200K$1.80$9.00
38715GPT-5.11319±712.9K3.4%2.3%71 tps1.4s400K$1.42$11.33
38815GLM 51324±1411.7K3.3%3.4%36 tps2.7s200K$0.72$2.55
38913Gemini 3 Pro1337±559.4K2.6%2.1%50 tps3.6s1M$2.00$12.00
39013GPT-5.21340±811.3K3.2%4.1%18 tps2.7s400K$1.75$14.00
39113Claude Opus 4 (Thinking)1352±52.6K2.6%<0.1%28 tps1.3s200K$15.00$75.00
39212Claude Haiku 4.5 (Extended Thinking)1353±414.3K3.8%1.4%115 tps0.7s200K$1.00$5.00
39310GPT-5.2 Instant1358±615.7K3.3%1.7%52 tps2.0s400K$1.75$14.00
39410Claude Sonnet 4.5 (Thinking)1362±458.2K3.3%1.9%44 tps1.1s200K$3.00$15.00
3959GPT-5.3 Codex (High)1381±93.2K1.2%2.0%61 tps17.8s400K$1.75$14.00
3967Claude Opus 4.51409±515.1K2.2%1.5%45 tps1.5s200K$5.00$25.00
3977Gemini 3.1 Pro1418±922K2.5%3.5%35 tps4.1s1M$2.00$12.00
3986Claude Opus 4.5 (Thinking)1446±460.8K1.9%1.8%49 tps1.4s200K$5.00$25.00
3996GPT-5.4 (High)1464±124.9K3.9%4.6%68 tps7.9s1M$2.50$15.00
4005Claude Sonnet 4.6 (Thinking)1506±816.2K3.5%4.7%57 tps1.1s200K$3.00$15.00
4014Claude Opus 4.6 (Thinking)1566±816.5K1.6%2.5%56 tps1.6s200K$5.00$25.00
4021GPT-5.41594±144.4K1.6%2.6%55 tps0.8s1M$2.50$15.00
4031Claude Sonnet 4.61594±1015.7K1.4%1.6%47 tps1.2s200K$3.00$15.00
4041Claude Opus 4.61596±621.6K1.1%2.1%48 tps1.7s200K$5.00$25.00
Show Less