Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

629
Qwen 2.5 VL 3B Instruct
630
LFM2.5 1.2B Thinking
635
Phi 4 Mini Reasoning
667
UI-TARS 1.5 7B
688
MiniMax M1
697
Phi 4 Reasoning
739
Hunyuan A13B Instruct
755
Pixtral 12B
778
Phi 4 Mini Instruct
779
Moonshot V1 128k Vision
791
MiniMax M2-her
794
Goliath 120B
798
C4AI Aya Expanse 8B
807
DeepSeek-R1 Distill Qwen 32B
814
MythoMax L2 13B

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
1288Qwen 2.5 VL 3B Instruct629±75.1K6.8%3.0%44 tps2.5s128K$0.21$0.63
2291LFM2.5 1.2B Thinking630±227054.7%2.6%258 tps0.4s33K$0$0
3291Phi 4 Mini Reasoning635±47.9K7.9%9.7%30 tps0.9s128K$0.07$0.30
4289UI-TARS 1.5 7B667±181.4K8.7%4.0%75 tps0.9s128K$0.10$0.20
5284MiniMax M1688±47.8K4.0%<0.1%31 tps2.8s1M$0.55$2.20
6287Phi 4 Reasoning697±84K3.5%21.0%29 tps1.0s33K$0.06$0.25
7285Hunyuan A13B Instruct739±54.6K5.0%2.3%67 tps2.0s33K$0.01$0.01
8274Pixtral 12B755±124.6K5.7%2.2%101 tps1.2s131K$0.08$0.08
9285Phi 4 Mini Instruct778±54.1K3.2%7.4%40 tps1.1s128K$0.07$0.30
10274Moonshot V1 128k Vision779±129555.0%3.1%44 tps3.8s131K$2.00$5.00
11274MiniMax M2-her791±111.1K2.2%<0.1%108 tps0.7s205K$0.30$1.20
12281Goliath 120B794±53.1K2.5%2.7%21 tps2.2s6K$6.56$9.38
13274C4AI Aya Expanse 8B798±121.2K6.5%0.9%61 tps0.4s8K$0.50$1.50
14274DeepSeek-R1 Distill Qwen 32B807±64K3.3%6.2%22 tps1.8s131K$0.37$0.39
15281MythoMax L2 13B814±49.8K2.5%1.2%22 tps1.1s4K$0.18$0.18
16274LFM2 8B A1B831±72.3K6.7%<0.1%142 tps0.3s33K$0.01$0.02
17281Gemma 2 9B831±91.6K3.7%<0.1%100 tps0.4s8K$0.09$0.09
18271Hermes 3 405B Instruct838±36K1.8%2.3%20 tps1.1s131K$0.80$0.80
19265Qwen 2.5 VL 72B Instruct849±102.6K6.3%5.3%25 tps3.7s128K$1.01$2.79
20274DeepHermes 3 Mistral 24B Preview850±121.7K4.7%2.5%50 tps1.0s33K$0.06$0.25
21271Mistral Large851±54.7K2.5%1.5%54 tps0.7s33K$2.00$6.00
22260Hermes 4 405B Reasoning FP8862±46.8K8.0%3.6%32 tps0.8s131K$1.00$3.00
23271Inflection 3 Pi862±37.3K1.4%1.1%33 tps3.4s8K$2.50$10.00
24265LFM2 2.6B863±52.2K6.3%6.7%184 tps0.4s33K$0.01$0.02
25246DeepSeek-R1 Distill Llama 70B874±56.5K3.4%3.6%27 tps1.6s32K$0.73$0.95
26265Magistral Small 2509876±73.4K4.8%2.7%116 tps0.6s131K$0.50$1.50
27246Hermes 4 70B878±91.3K4.3%1.1%67 tps0.6s131K$0.12$0.39
28260Open Mistral 7B879±35.4K2.4%0.7%176 tps0.4s33K$0.25$0.25
29265Mixtral-8x7B Instruct v0.1880±55.5K2.4%1.3%54 tps0.4s33K$0.60$0.60
30260Mistral Small881±34.8K2.5%1.7%142 tps0.6s32K$0.43$1.30
31265Inflection 3 Productivity883±47K1.7%0.6%50 tps3.2s8K$2.50$10.00
32229Magistral Medium 2509887±56.2K6.2%4.0%58 tps0.9s131K$2.00$5.00
33256Phi 4890±38.4K1.8%5.1%28 tps1.3s128K$0.10$0.32
34240Moonshot V1 32k893±54K1.2%1.4%53 tps1.4s33K$1.00$3.00
35256Mixtral 8x7B Instruct894±66.2K2.0%0.2%79 tps0.7s33K$0.23$0.31
36240Hermes 4 405B FP8897±102.1K5.6%3.5%31 tps0.9s131K$0.52$1.73
37260Apriel 1.6 15B Thinker898±111.1K2.1%2.6%92 tps0.4s131K$0$0
38265Ministral 3B 2512900±111.7K3.5%2.8%339 tps0.6s131K$0.10$0.10
39246Mixtral 8x22B Instruct900±56K2.2%1.8%142 tps0.7s66K$0.45$0.45
40256Solar Mini 250422901±53.7K4.4%1.8%90 tps1.7s33K$0.15$0.15
41214OpenAI o3-mini-high901±315.8K3.0%2.4%231 tps10.5s200K$1.10$4.40
42235Command R+902±56.6K2.2%2.8%36 tps0.7s128K$2.08$9.45
43253GPT-4 Turbo903±137854.3%4.7%21 tps1.9s128K$10.00$30.00
44246Mixtral 8x22B903±55.2K2.1%1.2%140 tps0.6s64K$2.00$6.00
45235GLM 4 32B904±211.7K2.1%2.6%40 tps1.6s33K$0.14$0.14
46253Gemma 2 27B906±36.9K1.8%1.4%44 tps1.4s8K$0.80$0.80
47256Gemma 3 1B909±47.1K3.0%0.6%176 tps1.0s33K$0.06$0.10
48235Gemma 3 4B909±212.6K1.9%1.3%138 tps0.7s131K$0.02$0.04
49246WizardLM-2 8x22B911±29.8K1.1%11.6%11 tps2.5s66K$0.77$0.77
50240Moonshot V1 8k911±64K1.8%1.0%55 tps1.5s8K$0.20$2.00
51229Krutrim Spectre V2912±47.4K1.1%<0.1%33 tps3.1s4K$0.19$0.19
52229Llama 3.1 8B915±81.3K2.9%1.9%61 tps1.0s8K$0.07$0.09
53214Moonshot V1 128k917±54.7K1.7%1.4%54 tps1.5s131K$2.00$5.00
54225Command R920±39.8K2.0%5.8%54 tps0.6s128K$0.30$0.99
55229Ministral 8B922±37.7K2.5%1.4%177 tps0.4s128K$0.14$0.14
56235Mixtral 8x7B923±45.3K2.2%2.2%142 tps0.6s33K$0.23$0.23
57225GPT-3.5 Turbo 16k923±310.7K1.6%<0.1%22 tps0.6s16K$3.00$4.00
58222Sky T1 32B Preview923±311.2K1.6%7.8%73 tps0.6s16K$0.12$0.18
59214Llama 3.3 70B Instruct Turbo925±54.2K3.3%2.0%78 tps1.0s131K$0.88$0.88
60240GPT-3.5 Turbo Instruct926±38K1.4%<0.1%46 tps1.2s4K$1.50$2.00
61240Llama 3.3 70B Instruct927±119002.7%5.3%28 tps1.3s128K$0.38$0.55
62240Mistral Nemo928±34.2K1.2%<0.1%112 tps0.4s131K$0.07$0.13
63229Moonshot V1 Auto928±64K1.6%1.2%54 tps1.5s8K$2.00$5.00
64246Ministral 3B929±38.6K2.2%0.8%248 tps0.4s131K$0.08$0.08
65229ERNIE 4.5 21B A3B Thinking930±62.7K3.6%1.8%87 tps1.5s120K$0.07$0.28
66214C4AI Aya Expanse 32B930±217.9K1.6%1.5%43 tps0.5s128K$0.50$1.50
67201GPT-4o mini932±49K3.4%2.1%71 tps1.7s128K$0.15$0.60
68186GLM 4.6V Flash933±47.9K3.7%3.7%64 tps2.1s128K$0.04$0.40
69214Qwen 2.5 7B934±47.5K2.3%3.7%40 tps1.9s131K$0.08$0.27
70201Gemma 3 27B IT938±39.7K1.7%2.0%60 tps0.8s128K$0.17$0.29
71209Llama 3.3 Swallow 70B Instruct938±310.7K3.2%1.4%153 tps1.3s131K$0.13$0.39
72225Command R 7B940±314K1.9%1.1%76 tps0.4s128K$0.04$0.15
73209Qwen 2.5 14B Instruct944±39K2.3%2.4%40 tps1.6s1M$0.40$1.61
74222Jamba 1.5 Large944±313.6K1.6%1.7%48 tps0.9s256K$1.50$6.00
75235Hermes 2 Pro Llama 3 8B945±28.8K1.0%<0.1%76 tps1.0s131K$0.08$0.09
76225Open Mistral Nemo946±37.3K2.1%1.5%171 tps0.5s131K$0.15$0.15
77214Krutrim 2947±211.8K0.6%12.5%33 tps2.1s128K$1.00$1.00
78186Grok 3 Mini948±228.5K3.9%1.2%43 tps0.5s131K$0.30$0.50
79209GPT-3.5 Turbo950±26K1.0%1.3%74 tps0.9s16K$0.75$1.75
80209Qwen3.5 9B FP8952±194953.9%5.8%64 tps0.7s256K$0.10$0.15
81201Devstral Small954±65.4K2.4%2.4%180 tps0.6s131K$0.10$0.30
82222Rnj-1 Instruct954±63K3.7%0.6%103 tps0.3s33K$0.15$0.15
83179GLM 4.7 Flash956±84.8K1.8%5.8%61 tps2.8s128K$0.07$0.39
84214Gemma 3 12B956±39.8K1.9%4.2%73 tps0.8s131K$0.05$0.12
85179Switchpoint Router957±48.5K2.0%1.7%71 tps4.9s131K$0.85$3.40
86170Kimi K2 0711957±223.3K2.3%1.6%29 tps1.3s131K$0.72$2.60
87201Mistral Small 24B Instruct958±46.8K2.1%1.5%84 tps0.4s33K$0.80$0.80
88214Qwen 2.5 VL 32B Instruct958±121.6K5.4%6.3%43 tps3.2s128K$0.35$0.62
89186Grok 3 Mini Fast958±226.4K4.4%1.6%44 tps0.5s131K$0.60$4.00
90194GLM 4.5 Flash960±161.4K4.8%12.2%15 tps2.2s131K$0$0
91209Seed 1.6 Flash 250715960±53.6K3.1%2.5%108 tps1.6s256K$0.07$0.30
92201Llama 3 8B960±213.1K1.8%6.0%85 tps0.7s8K$0.12$0.16
93161DeepSeek Prover v2961±63.3K1.8%5.2%14 tps1.3s164K$0.40$1.56
94201ERNIE 4.5 VL 424B A47B961±101.5K5.7%4.9%36 tps3.5s123K$0.42$1.25
95177OpenAI o3-mini962±233.6K4.2%0.8%143 tps3.3s200K$1.10$4.40
96194Magistral Small 2506966±317.5K1.5%1.6%156 tps0.5s40K$0.37$1.10
97175OpenAI o3-mini-low966±230.5K4.6%0.7%139 tps1.5s200K$1.10$4.40
98165DeepSeek R1T2 Chimera967±45.9K3.3%3.0%28 tps1.8s164K$0.13$0.45
99194Llama 3.2 11B Instruct967±29.6K1.9%1.5%152 tps0.5s8K$0.16$0.16
100175MiMo V2 Flash971±139004.3%7.2%24 tps1.9s262K$0.07$0.23
101179Qwen 2.5 72B972±45.6K2.1%1.2%96 tps1.2s131K$0.14$0.26
102194Mistral Small 3 24B Instruct972±47.7K1.5%2.6%77 tps0.6s33K$0.07$0.14
103157GPT-5 Nano974±310.1K6.0%3.2%113 tps20.9s400K$0.05$0.40
104186Gemma 3n E4B976±225.5K1.8%2.0%30 tps0.5s8K$0.01$0.02
105179Llama 3.1 70B Instruct976±149252.6%6.3%30 tps0.8s128K$0.17$0.22
106194Llama 3.3 70B976±310.8K4.1%0.3%500 tps0.5s8K$0.48$0.66
107186Jamba 1.6 Large977±215.8K1.3%2.0%59 tps1.2s256K$1.33$5.33
108179Inception Mercury979±228K1.8%0.4%257 tps1.1s32K$0.25$1.00
109179Amazon Nova Pro 1.0982±224.5K1.6%0.9%96 tps0.7s300K$0.80$1.70
110177Mistral Small 3.1 24B Instruct982±211.2K1.8%7.5%15 tps2.4s131K$0.06$0.18
111153OpenAI o1982±418.6K2.5%4.2%92 tps5.5s200K$15.00$60.00
112186Gemma 3 27B983±63.5K3.7%1.8%35 tps1.1s66K$0.06$0.10
113179Baichuan-M2-32B983±71.9K5.9%<0.1%32 tps3.3s131K$0.07$0.07
114148OpenAI o3987±312K2.6%0.9%85 tps6.8s128K$7.33$29.33
115133Kimi K2 0905988±316.2K3.9%4.0%30 tps1.4s262K$0.63$2.39
116148OpenAI o4-mini-high988±233.5K4.5%1.9%117 tps15.9s200K$1.10$4.40
117201Qwen 2.5 7B Turbo992±72.6K2.8%0.5%125 tps0.4s131K$0.30$0.30
118194Llama 3 70B993±91.9K1.3%4.5%21 tps1.7s8K$1.08$1.38
119165Pixtral Large994±49.9K2.6%2.5%57 tps1.3s128K$1.50$4.50
120170Mistral Small 3.2 24B994±315.2K2.5%2.8%141 tps0.7s33K$0.02$0.08
121186Mistral Small 3.2 24B Instruct995±92.1K5.0%1.9%113 tps1.1s131K$0.02$0.08
122161Qwen3 8B996±49.9K5.5%2.4%61 tps1.4s41K$0.02$0.07
123160Llama 4 Scout997±266.9K2.4%0.6%88 tps5.1s131K$0.18$0.46
124170Devstral Medium997±311.7K2.9%1.5%77 tps0.6s131K$0.40$2.00
125165Qwen3 4B998±312.9K6.5%1.9%94 tps1.5s128K$0.01$0.01
126186Jamba 1.7 Large998±52.8K4.9%1.3%58 tps1.0s256K$1.33$5.33
127157Qwen3 Next 80B A3B Thinking998±316.7K5.2%0.6%175 tps1.3s256K$0.21$2.26
128194INTELLECT-3999±108952.7%1.5%114 tps0.6s131K$0.20$1.10
129161Llama 4 Maverick999±173.4K2.5%1.2%88 tps2.4s1M$0.23$0.83
130126Qwen3 VL 235B A22B Thinking999±39.9K6.2%4.3%47 tps3.0s127K$0.47$3.31
131148DeepSeek-R11001±313.5K2.4%0.8%133 tps0.6s64K$0.91$3.07
132139OpenAI o4-mini1003±221.7K4.3%1.4%97 tps7.0s128K$1.10$4.40
133139Seed 2.0 Mini (Medium)1004±92.2K2.7%11.9%33 tps1.7s256K$0.15$0.60
134161Mistral Small 3.11006±39.7K2.0%7.4%13 tps2.6s32K$0.17$0.28
135143Gemini 2.0 Flash Lite1008±263.2K3.2%<0.1%42 tps0.5s1M$0.08$0.30
13671Gemini 3.1 Flash Lite Preview1008±102.5K2.5%1.0%8 tps1.2s1M$0.25$1.50
137129Qwen3 Max Thinking1010±58.9K1.0%13.5%32 tps2.3s256K$1.20$6.00
138153Qwen 2.5 32B Instruct1011±316.8K3.1%2.5%48 tps1.0s131K$0.21$0.25
139124Kimi K2 0905 Turbo1012±223.1K5.9%0.7%373 tps0.5s262K$1.70$6.50
140246Amazon Nova Micro 1.01015±161.3K1.6%4.1%193 tps0.6s128K$0.04$0.07
141157Cogito v2.1 671B1017±54.6K1.9%0.8%85 tps0.5s128K$1.25$1.25
142165Qwen3 VL 30B A3B Thinking1020±43.5K6.5%4.5%84 tps2.9s127K$0.20$1.47
143133DeepSeek-R1 05281020±312.3K2.1%1.3%93 tps0.5s64K$1.60$3.67
144148OpenAI o1-pro1021±99855.3%5.2%33 tps72.8s200K$150.00$600.00
145170Devstral Small 25071021±71.4K3.9%2.2%186 tps0.5s131K$0.10$0.30
146129DeepSeek V3.1 Thinking1022±312.7K6.2%7.1%18 tps1.8s131K$0.23$0.75
147139Qwen3 VL 30B A3B Instruct1022±72.1K4.8%1.8%80 tps2.6s129K$0.18$0.67
148165ERNIE 4.5 21B A3B1023±61.7K3.4%2.3%78 tps1.5s120K$0.05$0.19
149143Gemini 2.0 Flash1024±229.2K2.1%<0.1%76 tps0.5s1M$0.14$0.56
150139GLM 4.6V1024±210K2.3%6.4%21 tps1.8s128K$0.38$0.90
151133DeepSeek V3.2 Speciale1026±37.5K2.8%6.0%43 tps1.4s131K$0.84$1.52
152170Llama 3.1 8B Turbo1027±38.2K1.4%2.1%650 tps0.5s128K$0.13$0.14
153148Qwen3 30B A3B Thinking 25071027±37.7K2.3%0.5%124 tps1.2s131K$0.16$1.70
154113Gemini 2.5 Flash Lite Thinking1033±319K4.9%1.0%118 tps4.4s1M$0.03$0.13
155143Mistral Medium 31034±101.5K2.9%2.4%47 tps0.8s33K$0.40$2.00
156126DeepSeek V31035±257.9K1.7%0.9%69 tps1.1s64K$0.59$1.49
157113Kimi K2 Fast1037±2107.2K4.5%0.8%365 tps0.5s131K$1.00$3.00
158143Seed 1.6 2506151037±55K2.0%3.1%46 tps2.2s256K$0.25$2.00
159133Qwen3 14B1037±212.5K5.7%1.7%109 tps0.8s41K$0.04$0.15
160129Seed 2.0 Mini (Low)1038±137803.1%10.7%33 tps1.8s256K$0.20$0.80
161129Command A1039±282.1K2.2%2.2%42 tps0.8s256K$2.00$7.33
162133GPT-4.1 nano1040±161.4K2.5%0.6%175 tps0.5s1M$0.10$0.40
163124Qwen3 235B A22B Thinking 25071044±36.5K2.4%2.5%53 tps1.6s131K$0.59$5.70
164113GLM 4.51044±215.4K5.0%3.7%46 tps1.4s131K$0.43$1.63
165113Mistral Medium1048±237.8K2.5%1.8%48 tps0.6s33K$1.48$4.55
16681OpenAI o3-pro1049±46.7K3.4%5.2%22 tps70.8s200K$20.00$80.00
16795Gemini 2.5 Flash Lite Thinking Preview 09251051±414.2K4.3%1.5%152 tps3.0s1M$0.10$0.40
168126Qwen3 30B A3B1051±215.1K4.4%5.1%163 tps1.0s41K$0.06$0.21
169101GPT-5 (Low)1051±52.1K1.4%1.8%75 tps8.2s400K$1.25$10.00
170119GLM 4.7 FP81053±62.7K1.1%6.9%40 tps1.3s200K$0.30$1.20
171106Claude Sonnet 3.5 v21055±221.4K1.9%<0.1%46 tps1.4s200K$3.00$15.00
172118GPT-4.1 mini1055±267.2K2.2%1.1%67 tps0.9s1M$0.34$1.60
173113GLM 4.5 AirX1057±44.1K3.0%3.3%75 tps1.2s131K$1.10$4.50
17495Gemini 2.5 Flash1058±1118.2K1.8%1.3%2 tps3.7s1M$0.30$2.50
175119ERNIE 4.5 300B A47B1058±251.6K1.9%4.7%23 tps2.3s123K$0.28$1.10
176121Qwen3 32B Fast1059±225K3.8%11.6%30 tps3.1s41K$0.10$0.25
177143Solar Pro 2 2512151059±99852.5%1.8%107 tps1.5s66K$0.15$0.60
178101Gemini 2.5 Flash Lite1060±250.1K4.8%1.3%210 tps0.7s1M$0.10$0.40
179111Grok 3 Fast1060±312.4K1.1%1.7%52 tps2.4s131K$5.00$25.00
180106Grok 31063±265.8K2.6%1.5%53 tps0.6s1M$3.67$18.33
181121NVIDIA Llama 3.3 Nemotron Super 49B v1.51064±54.5K3.7%2.0%50 tps0.6s131K$0.09$0.33
182101Qwen3.5 35B A3B1064±82.1K2.3%2.1%116 tps2.1s256K$0.63$1.13
18386Claude Sonnet 41065±2113.6K2.4%1.8%49 tps1.3s200K$3.00$15.00
184106DeepSeek V3 03241065±146.7K2.5%5.8%12 tps2.7s164K$0.38$0.93
185121QwQ 32B1068±228.3K4.4%5.4%41 tps2.1s16K$0.43$0.56
18695DeepSeek V3.2 Exp Thinking1068±39.9K2.7%7.2%26 tps3.0s131K$0.28$0.42
18795DeepSeek-R1 Turbo1069±37.3K2.9%2.6%29 tps1.8s64K$2.85$4.75
188106DeepSeek V3.1 Terminus Thinking1071±38.5K4.9%5.9%27 tps1.8s131K$0.56$1.68
18995Kimi K2 Thinking1074±38.3K2.9%4.2%61 tps5.9s262K$0.24$1.03
190153Apriel 1.5 15B Thinker1076±52.1K1.6%2.4%146 tps0.4s131K$0$0
19193DeepSeek V3 0324 Turbo1076±256.4K2.9%6.3%12 tps2.4s164K$0.73$1.79
19271Gemini 2.5 Flash Thinking1076±222.8K2.3%2.2%88 tps6.4s1M$0.30$2.50
19393Qwen Max1078±165.6K2.1%1.5%49 tps1.5s33K$1.60$6.40
194153Ministral 14B 3.01079±53K3.4%2.0%119 tps0.5s128K$0.20$0.20
19584GPT-5 Mini Minimal1081±36.8K6.5%1.2%63 tps1.4s400K$0.25$2.00
19671GPT-5 Mini1082±217.5K4.3%2.6%66 tps14.2s400K$0.25$2.00
197111LongCat Flash Chat1082±46.5K3.2%0.8%85 tps0.9s131K$0.14$0.68
19886Seed 2.0 Lite (Medium)1082±62.1K1.9%6.6%33 tps1.6s256K$0.25$2.00
19971Gemini 2.5 Flash Lite Preview 09251083±220.9K4.8%1.2%209 tps0.7s1M$0.25$0.35
20056Gemini 3.1 Flash Lite Preview Thinking1084±63.6K3.1%1.7%75 tps4.7s1M$0.25$1.50
201133Nemotron 3 Nano1087±51.9K2.5%1.3%216 tps0.8s256K$0.05$4.94
20281GPT-4o1088±230.3K2.1%1.0%49 tps2.4s128K$3.71$12.57
20379Qwen3 Max Thinking Preview1089±217.8K3.3%3.1%40 tps2.1s256K$1.20$6.00
20486Qwen3 235B A22B1090±311.9K5.1%5.3%71 tps0.9s41K$0.23$0.63
20571Qwen3.5 397B A17B1092±57.2K1.8%4.3%57 tps1.4s256K$0.52$3.00
206101gpt-oss-20b1093±220.3K4.6%0.5%216 tps0.5s131K$0.06$0.26
20781Qwen3.5 27B1097±62.3K2.4%3.7%55 tps2.6s256K$0.30$2.40
20862GPT-5.1 Instant1098±221.7K2.4%1.3%50 tps1.9s400K$1.25$10.00
20968Qwen Plus (Aug'24)1098±260.9K2.4%1.4%53 tps1.3s30K$0.40$1.20
21086Amazon Nova 2 Lite1099±312.6K3.1%1.0%137 tps0.6s300K$0.35$2.95
211101DeepSeek V3 (Turbo)1100±34.8K2.5%1.5%32 tps1.5s64K$0.40$1.30
21268Grok 41100±1120.3K2.1%3.9%29 tps11.1s256K$3.00$15.00
21368GLM 4.71101±335.7K2.1%5.8%40 tps1.5s200K$0.77$1.73
21460Gemini 2.5 Flash Preview 09251102±219.5K4.3%1.2%5 tps0.9s1M$0.13$0.97
21586DeepSeek V3.1 Chat1102±313.4K4.1%2.8%21 tps1.6s131K$0.38$1.00
21671Seed 1.8 2512281104±319K1.5%3.7%41 tps2.1s256K$0.25$2.00
21784MiniMax M2.51105±82.1K1.6%1.4%70 tps1.9s205K$0.28$1.20
21865GLM 4.61108±325.8K4.3%5.4%39 tps1.5s200K$0.42$1.66
21986DeepSeek V3.1 Nex N11112±62.1K1.7%3.4%24 tps7.2s131K$0.14$0.50
22095Qwen3 32B1117±63.3K2.8%3.9%30 tps3.1s41K$0.12$0.42
22152GPT-51119±244.3K3.9%3.1%78 tps23.1s400K$1.25$9.67
22279MiniMax M2.5 Lightning1121±45.6K1.3%1.5%51 tps2.0s205K$0.60$2.40
22365Mistral Large 31122±314.3K3.3%2.1%51 tps1.0s256K$0.50$1.50
22471DeepSeek V3.11124±36.8K2.0%0.8%197 tps0.4s164K$0.55$1.60
22565DeepSeek V3.2 Exp Chat1124±314.3K4.0%2.6%29 tps1.5s131K$0.27$0.39
22662MiniMax M21125±233.6K3.5%2.2%39 tps2.3s205K$0.21$0.85
22786Nemotron 3 Nano (Thinking)1127±37.5K2.4%2.0%200 tps0.5s256K$0$0
22856DeepSeek V3.2 Thinking1127±337.6K2.6%9.0%30 tps2.6s131K$0.28$0.42
22952Grok 4 Fast Non-Reasoning1128±321.3K4.7%1.5%93 tps0.6s2M$0.27$0.67
23044Grok 4.1 Fast Reasoning1128±257K3.1%1.5%58 tps7.3s2M$0.20$0.50
23152Claude Haiku 4.51128±231.4K3.7%1.1%100 tps0.9s200K$1.00$5.00
23248Grok 4 Fast Reasoning1128±225.9K3.9%2.1%102 tps3.1s2M$0.30$0.75
23360MiniMax M2.11129±241.8K2.0%2.1%66 tps2.6s205K$0.30$1.20
23442GPT-5.2 (Extra High) 1133±320.9K1.9%13.2%17 tps20.5s400K$1.75$14.00
23544Gemini 2.5 Pro1136±168.8K3.9%2.3%45 tps2.6s1M$1.25$10.00
23644Kimi K2 Thinking Turbo1137±329.8K2.5%2.0%75 tps1.4s262K$1.15$8.00
23748Claude Sonnet 4 (Thinking)1138±230.7K2.6%1.5%52 tps1.5s200K$3.00$13.67
23837Claude Sonnet 4.51139±237.7K4.3%1.4%41 tps1.3s200K$1.80$9.00
23956DeepSeek V3.1 Turbo1140±214.5K2.3%0.9%173 tps1.3s164K$2.00$3.75
24071MiniMax M2.5 FP81141±42.9K1.7%3.6%33 tps1.7s205K$0.45$1.75
24148gpt-oss-120b1144±240.7K3.7%0.7%213 tps0.5s131K$0.11$0.50
24240DeepSeek V3.21144±320.7K1.9%1.4%83 tps5.1s131K$0.43$1.09
24340Qwen3 235B A22B Instruct 25071147±232.2K4.7%6.8%13 tps1.9s262K$0.13$0.52
24442Qwen3 Max Instruct Preview1148±236.6K3.5%1.1%31 tps1.7s256K$1.43$6.61
24544DeepSeek V3.1 Terminus Chat1150±317.8K4.2%3.4%27 tps1.5s131K$0.86$1.80
24633Kimi K2.51151±332.5K1.8%6.5%33 tps1.7s262K$0.34$2.57
24752Qwen3.5 122B A17B1151±44.7K1.6%1.5%82 tps1.4s256K$0.40$3.20
24833Qwen3 30B A3B Instruct 25071158±231.6K4.1%1.2%55 tps1.3s131K$0.13$0.72
24917Gemini 3 Flash Preview1159±317.8K2.1%1.3%138 tps1.4s1M$0.50$3.00
25032Gemini 2.5 Pro High1159±242.7K4.5%1.5%48 tps2.3s1M$1.25$10.00
25133Qwen3 Next 80B A3B Instruct1161±224.9K3.8%0.6%84 tps1.1s256K$0.20$1.42
25248Step 3.5 Flash1164±54K1.5%2.2%109 tps0.6s256K$0.05$0.15
25356MiniMax M2.1 Lightning1165±54.9K1.2%1.7%52 tps2.1s205K$0.30$2.40
25429MiniMax M2.71167±81.1K1.8%3.0%34 tps2.5s205K$0.30$1.20
25562Qwen3 Omni 30B A3B Instruct1168±53K2.3%3.9%65 tps1.2s66K$0.35$0.97
25637Kimi K2.5 Instant1171±46.2K1.8%2.9%32 tps3.0s262K$0.50$3.00
25726Claude Haiku 4.5 (Extended Thinking)1173±224.3K3.1%1.4%115 tps0.7s200K$1.00$5.00
25817Claude Opus 4.51173±222.5K2.2%1.5%45 tps1.5s200K$5.00$25.00
25917Grok 4.20 Beta Reasoning1175±73.3K1.8%1.1%77 tps4.5s2M$2.00$5.50
26016GPT-5.21176±222.6K1.8%4.1%18 tps2.7s400K$1.75$14.00
26126GPT-5 (High)1177±222.1K3.1%4.5%81 tps35.9s400K$1.25$10.00
262106GPT-5.4 nano1177±106502.3%0.7%149 tps0.5s400K$0.20$1.25
26326Grok 4.1 Fast Non-Reasoning1177±225.7K3.0%0.9%101 tps0.5s2M$0.20$0.50
26417GPT-5.2 (High)1180±254.6K1.9%6.7%18 tps16.3s400K$1.75$14.00
26514Gemini 3 Pro (Low)1180±328.9K2.2%2.4%51 tps3.5s1M$2.00$12.00
26622GLM 51182±417.3K2.1%3.4%36 tps2.7s200K$0.72$2.55
26733Grok 4.20 Multi Agent Beta1183±92.6K2.0%1.2%56 tps8.8s2M$2.00$6.00
26837Qwen3 Omni 30B A3B Thinking1186±37.5K2.1%3.7%67 tps1.2s66K$0.97$1.79
26929Nova Experimental Chat 12-101188±39.8K1.9%2.4%84 tps12.9s98K$0$0
27029Qwen3 VL 235B A22B Instruct1188±313.5K5.2%3.1%75 tps1.9s129K$0.37$1.81
27114Gemini 3 Flash Preview Thinking1190±247K2.3%1.6%3 tps6.2s1M$0.50$3.00
27222Grok 4.20 Beta Non-reasoning1192±111.3K3.1%1.1%151 tps0.6s2M$2.00$6.00
27322GPT-5 Chat1196±175.1K3.4%1.3%95 tps0.9s400K$1.25$10.00
27413GPT-5.3 Instant1199±69.3K1.7%0.9%63 tps0.8s400K$1.75$14.00
27517GPT-5.4 mini1203±108852.7%0.8%148 tps0.5s400K$0.75$4.50
27610Gemini 3 Pro1207±178K2.2%2.1%50 tps3.6s1M$2.00$12.00
27722MiniMax M2.7-highspeed1207±101.1K2.1%2.3%50 tps2.1s205K$0.60$2.40
27810Claude Sonnet 4.5 (Thinking)1228±166.2K3.4%1.9%44 tps1.1s200K$3.00$15.00
27910GPT-5.2 Instant1232±239.3K1.6%1.7%52 tps2.0s400K$1.75$14.00
2806Gemini 3.1 Pro1245±326K2.0%3.5%35 tps4.1s1M$2.00$12.00
2818GPT-5.1 (High)1250±337.8K2.4%3.2%76 tps6.9s400K$1.25$10.00
2827Claude Opus 4.5 (Thinking)1260±261K1.8%1.8%49 tps1.4s200K$5.00$25.00
2838GPT-5.11264±227.6K2.3%2.3%71 tps1.4s400K$1.42$11.33
2845Claude Sonnet 4.6 (Thinking)1305±317.7K2.7%4.7%57 tps1.1s200K$3.00$15.00
2854Claude Sonnet 4.61314±416.6K1.2%1.6%47 tps1.2s200K$3.00$15.00
2862GPT-5.41343±55.8K1.3%2.6%55 tps0.8s1M$2.50$15.00
2872Claude Opus 4.61381±321.8K1.1%2.1%48 tps1.7s200K$5.00$25.00
2881Claude Opus 4.6 (Thinking)1398±416.9K1.4%2.5%56 tps1.6s200K$5.00$25.00
Show Less