Models
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1596
Claude Opus 4.6
1594
Claude Sonnet 4.6
1594
GPT-5.4
1566
Claude Opus 4.6 (Thinking)
1506
Claude Sonnet 4.6 (Thinking)
1464
GPT-5.4 (High)
1446
Claude Opus 4.5 (Thinking)
1418
Gemini 3.1 Pro
1409
Claude Opus 4.5
1381
GPT-5.3 Codex (High)
1362
Claude Sonnet 4.5 (Thinking)
1358
GPT-5.2 Instant
1353
Claude Haiku 4.5 (Extended Thinking)
1352
Claude Opus 4 (Thinking)
1340
GPT-5.2

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
11Claude Opus 4.61596±621.6K1.1%2.1%48 tps1.7s200K$5.00$25.00
21Claude Sonnet 4.61594±1015.7K1.4%1.6%47 tps1.2s200K$3.00$15.00
31GPT-5.41594±144.4K1.6%2.6%55 tps0.8s1M$2.50$15.00
44Claude Opus 4.6 (Thinking)1566±816.5K1.6%2.5%56 tps1.6s200K$5.00$25.00
55Claude Sonnet 4.6 (Thinking)1506±816.2K3.5%4.7%57 tps1.1s200K$3.00$15.00
66GPT-5.4 (High)1464±124.9K3.9%4.6%68 tps7.9s1M$2.50$15.00
76Claude Opus 4.5 (Thinking)1446±460.8K1.9%1.8%49 tps1.4s200K$5.00$25.00
87Gemini 3.1 Pro1418±922K2.5%3.5%35 tps4.1s1M$2.00$12.00
97Claude Opus 4.51409±515.1K2.2%1.5%45 tps1.5s200K$5.00$25.00
109GPT-5.3 Codex (High)1381±93.2K1.2%2.0%61 tps17.8s400K$1.75$14.00
1110Claude Sonnet 4.5 (Thinking)1362±458.2K3.3%1.9%44 tps1.1s200K$3.00$15.00
1210GPT-5.2 Instant1358±615.7K3.3%1.7%52 tps2.0s400K$1.75$14.00
1312Claude Haiku 4.5 (Extended Thinking)1353±414.3K3.8%1.4%115 tps0.7s200K$1.00$5.00
1413Claude Opus 4 (Thinking)1352±52.6K2.6%<0.1%28 tps1.3s200K$15.00$75.00
1513GPT-5.21340±811.3K3.2%4.1%18 tps2.7s400K$1.75$14.00
1613Gemini 3 Pro1337±559.4K2.6%2.1%50 tps3.6s1M$2.00$12.00
1715GLM 51324±1411.7K3.3%3.4%36 tps2.7s200K$0.72$2.55
1815GPT-5.11319±712.9K3.4%2.3%71 tps1.4s400K$1.42$11.33
1917Claude Sonnet 4.51307±320.9K5.0%1.4%41 tps1.3s200K$1.80$9.00
2017GPT-5.2 (High)1297±830.7K2.8%6.7%18 tps16.3s400K$1.75$14.00
2119Kimi K2.51291±1116.5K3.4%6.5%33 tps1.7s262K$0.34$2.57
2221GPT-5.1 (Medium)1291±93.2K6.4%<0.1%86 tps3.8s400K$0.83$6.67
2319Gemini 3 Pro (Low)1291±611.9K4.2%2.4%51 tps3.5s1M$2.00$12.00
2419GPT-5.1 (High)1290±619.1K3.5%3.2%76 tps6.9s400K$1.25$10.00
2519Gemini 3 Flash Preview Thinking1286±632.7K3.3%1.6%3 tps6.2s1M$0.50$3.00
2619Claude Haiku 4.51283±316.4K4.5%1.1%100 tps0.9s200K$1.00$5.00
2719MiniMax M2.51283±285103.8%1.4%70 tps1.9s205K$0.28$1.20
2819GPT-5.3 Codex (Medium)1278±271.1K2.3%2.3%62 tps10.3s400K$1.75$14.00
2929Claude Opus 41274±412.4K2.7%<0.1%25 tps1.5s200K$15.00$75.00
3029Claude Opus 4.1 (Thinking)1272±57.7K5.2%<0.1%20 tps3.9s200K$15.00$75.00
3119GPT-5.3 Instant1271±124.2K2.5%0.9%63 tps0.8s400K$1.75$14.00
3227Claude Sonnet 4 (Thinking)1261±325.9K2.9%1.5%52 tps1.5s200K$3.00$13.67
3327GPT-5 Codex (High)1260±718.5K3.3%3.2%122 tps7.1s400K$1.25$10.00
3427GPT-5 (High)1259±416.2K3.5%4.5%81 tps35.9s400K$1.25$10.00
3527GPT-5.2 Codex (High)1257±123.1K2.8%8.8%41 tps12.9s400K$1.75$14.00
3636Claude Opus 4.11254±47.1K4.6%3.0%17 tps3.7s200K$15.00$75.00
3731GPT-5.1 Codex (High)1240±837K3.3%3.2%96 tps3.9s400K$1.25$10.00
3831Grok 4.1 Fast Non-Reasoning1239±69.4K5.4%0.9%101 tps0.5s2M$0.20$0.50
3931GPT-5 Chat1231±435K4.5%1.3%95 tps0.9s400K$1.25$10.00
4031Qwen3 Next 80B A3B Instruct1231±58.8K5.8%0.6%84 tps1.1s256K$0.20$1.42
4137Nova Experimental Chat 11-101230±85.2K6.3%0.4%84 tps8.9s98K$0$0
4231MiniMax M2.5 Lightning1228±141.7K3.2%1.5%51 tps2.0s205K$0.60$2.40
4337Polaris Alpha1226±147555.6%<0.1%48 tps1.1s256K$0$0
4444GPT-4.5 Preview1223±72.5K1.8%<0.1%36 tps3.0s200K$75.00$150.00
4544Nova Experimental Chat 10-201221±54.4K8.1%<0.1%30 tps0.5s98K$0$0
4636GPT-5.2 (Extra High) 1221±98K3.5%13.2%17 tps20.5s400K$1.75$14.00
4736Qwen3 VL 235B A22B Instruct1220±75.6K6.7%3.1%75 tps1.9s129K$0.37$1.81
4836Qwen3.5 122B A17B1216±151.9K3.1%1.5%82 tps1.4s256K$0.40$3.20
4936GPT-5 Codex (Medium)1214±68.8K3.9%4.1%122 tps5.2s400K$1.25$10.00
5036GPT-5.2 Codex (Medium)1211±122.4K3.0%5.7%37 tps6.3s400K$1.75$14.00
5136Qwen3.5 27B1211±169104.7%3.7%55 tps2.6s256K$0.30$2.40
5236Kimi K2.5 Instant1210±81.8K3.2%2.9%32 tps3.0s262K$0.50$3.00
5353Claude Sonnet 3.7 (Thinking)1210±313.6K3.1%<0.1%41 tps2.6s200K$3.00$15.00
5453Mistral Medium 3.11206±516.4K5.1%<0.1%77 tps0.7s128K$0.40$2.00
5543Claude Sonnet 41205±343.2K3.7%1.8%49 tps1.3s200K$3.00$15.00
5643Gemini 3 Flash Preview1205±117.2K3.7%1.3%138 tps1.4s1M$0.50$3.00
5743Gemini 2.5 Pro High1204±321.1K5.7%1.5%48 tps2.3s1M$1.25$10.00
5843Qwen3 Max Instruct Preview1203±616.1K4.6%1.1%31 tps1.7s256K$1.43$6.61
5958Claude Sonnet 3.71201±412.1K3.2%<0.1%39 tps1.6s200K$3.00$15.00
6043GPT-5.1 Codex Max1200±126.4K3.9%3.0%118 tps4.1s400K$1.25$10.00
6143MiniMax M2.1 Lightning1197±238753.3%1.7%52 tps2.1s205K$0.30$2.40
6249Qwen3 30B A3B Instruct 25071194±512.7K5.7%1.2%55 tps1.3s131K$0.13$0.72
6349Kimi K2 Thinking Turbo1192±620.3K3.4%2.0%75 tps1.4s262K$1.15$8.00
6462OpenAI o1-mini1192±415K4.6%<0.1%118 tpsN/A128K$1.13$4.51
6549MiniMax M2.11192±819.4K3.6%2.1%66 tps2.6s205K$0.30$1.20
6649DeepSeek V3.21189±85.1K4.7%1.4%83 tps5.1s131K$0.43$1.09
6762Qwen Plus 07281189±82.1K7.5%<0.1%55 tps0.9s1M$0.40$1.20
6849MiniMax M2.5 FP81185±176103.2%3.6%33 tps1.7s205K$0.45$1.75
6949GPT-51185±421.3K5.3%3.1%78 tps23.1s400K$1.25$9.67
7049Grok 4 Fast Non-Reasoning1185±58.1K7.1%1.5%93 tps0.6s2M$0.27$0.67
7149MiniMax M21183±519.7K4.2%2.2%39 tps2.3s205K$0.21$0.85
7249Nova Experimental Chat 12-101182±92.9K3.8%2.4%84 tps12.9s98K$0$0
7349GLM 4.61182±717.2K4.4%5.4%39 tps1.5s200K$0.42$1.66
7449GPT-5.3 Codex (Low)1178±285101.0%1.8%61 tps4.3s400K$1.75$14.00
7560Grok 4.1 Fast Reasoning1178±739.5K4.4%1.5%58 tps7.3s2M$0.20$0.50
7660DeepSeek V3.2 Thinking1178±923.3K4.0%9.0%30 tps2.6s131K$0.28$0.42
7760Grok 4 Fast Reasoning1177±314.5K5.0%2.1%102 tps3.1s2M$0.30$0.75
7860Gemini 2.5 Pro1176±337.9K4.8%2.3%45 tps2.6s1M$1.25$10.00
7975Gemini 2.5 Flash Thinking Preview 09251173±79.2K6.8%<0.1%111 tps4.7s1M$0.30$2.50
8060Qwen3 235B A22B Instruct 25071172±412.6K6.4%6.8%13 tps1.9s262K$0.13$0.52
8160Claude Sonnet 3.5 v21171±65.5K3.4%<0.1%46 tps1.4s200K$3.00$15.00
8260GPT-5.1 Codex (Medium)1171±143K3.2%4.6%71 tps3.7s400K$1.25$10.00
8360GPT-5.1 Instant1171±88.3K4.1%1.3%50 tps1.9s400K$1.25$10.00
8475Gemini 2.5 Pro Low1170±49.6K8.1%<0.1%89 tps2.4s1M$1.25$10.00
8560Grok 4.20 Beta Reasoning1167±221.2K4.1%1.1%77 tps4.5s2M$2.00$5.50
8669gpt-oss-120b1165±519.2K5.0%0.7%213 tps0.5s131K$0.11$0.50
8769Qwen3.5 35B A3B1164±258653.9%2.1%116 tps2.1s256K$0.63$1.13
8869GPT-5 Codex (Low)1163±105K4.1%2.7%112 tps3.5s400K$1.25$10.00
8969GLM 4.71161±716.8K3.7%5.8%40 tps1.5s200K$0.77$1.73
9086Gemini 2.5 Flash Preview1161±83K1.1%<0.1%138 tps6.9s1M$0.15$0.60
9186GPT-5 (Minimal)1158±58.3K7.4%<0.1%67 tps1.4s400K$1.25$10.00
9269DeepSeek V3.1 Terminus Chat1158±56.5K6.9%3.4%27 tps1.5s131K$0.86$1.80
9374Qwen Plus (Aug'24)1146±517.2K4.7%1.4%53 tps1.3s30K$0.40$1.20
9474Qwen3.5 397B A17B1142±142.5K2.9%4.3%57 tps1.4s256K$0.52$3.00
9574Gemini 2.5 Flash Preview 09251140±67.6K6.0%1.2%5 tps0.9s1M$0.13$0.97
9693Gemini 2.5 Flash Preview Thinking1136±101.4K1.8%<0.1%26 tps1.8s1M$0.15$1.76
9797Grok 3 Beta1134±92K0.8%<0.1%58 tps0.8s131K$3.00$15.00
9877Mistral Large 31131±85.4K5.8%2.1%51 tps1.0s256K$0.50$1.50
9977GPT-5 Mini1131±58.6K5.4%2.6%66 tps14.2s400K$0.25$2.00
10077DeepSeek V3.1 Turbo1130±74.8K5.3%0.9%173 tps1.3s164K$2.00$3.75
10177Grok 4.20 Multi Agent Beta1129±199453.6%1.2%56 tps8.8s2M$2.00$6.00
10277Qwen3 Max Thinking Preview1127±106.3K5.7%3.1%40 tps2.1s256K$1.20$6.00
10377Grok 41125±339.6K4.4%3.9%29 tps11.1s256K$3.00$15.00
10497Ministral 8B 25121125±155107.3%<0.1%174 tps0.5s128K$0.15$0.15
10577GPT-4.11123±532.8K5.2%3.7%112 tps1.3s1M$2.00$8.00
10677Gemini 2.5 Flash Lite Preview 09251122±78.5K6.6%1.2%209 tps0.7s1M$0.25$0.35
10797Gemini 2.5 Pro Preview 06051121±101.7K2.3%<0.1%0 tps3.7s1M$1.25$10.00
10885Gemini 2.5 Flash Thinking1118±413.7K3.6%2.2%88 tps6.4s1M$0.30$2.50
10985GPT-5 Mini Minimal1114±123.2K8.5%1.2%63 tps1.4s400K$0.25$2.00
11085GPT-5.2 Codex (Low)1113±191.2K3.2%4.5%41 tps5.0s400K$1.75$14.00
111108Gemini 2.5 Pro Preview 03251111±111.5K3.2%<0.1%3 tps16.6s1M$1.25$10.00
11285DeepSeek V3.1 Chat1110±74.9K6.6%2.8%21 tps1.6s131K$0.38$1.00
11385Qwen3 Omni 30B A3B Thinking1110±102.3K6.0%3.7%67 tps1.2s66K$0.97$1.79
11490DeepSeek V3.2 Exp Chat1107±45.5K6.1%2.6%29 tps1.5s131K$0.27$0.39
11590Qwen Max1107±418.3K4.2%1.5%49 tps1.5s33K$1.60$6.40
116114GPT-5 Mini Low1104±82.8K7.2%<0.1%69 tps3.2s400K$0.25$2.00
11790Gemini 2.5 Flash Lite1103±521.3K6.2%1.3%210 tps0.7s1M$0.10$0.40
11890Grok 3 Fast1102±142.5K4.7%1.7%52 tps2.4s131K$5.00$25.00
11990GPT-4o1102±58.5K3.7%1.0%49 tps2.4s128K$3.71$12.57
12090Step 3.5 Flash1102±248103.6%2.2%109 tps0.6s256K$0.05$0.15
12190DeepSeek V3 03241100±415.1K4.3%5.8%12 tps2.7s164K$0.38$0.93
12290Qwen3 Coder 480B A35B Instruct1099±83.1K4.5%3.3%61 tps2.0s262K$0.71$1.34
12398Gemini 2.5 Flash1098±435.9K3.2%1.3%2 tps3.7s1M$0.30$2.50
12498Grok 31098±419.1K5.5%1.5%53 tps0.6s1M$3.67$18.33
12598DeepSeek V3 0324 Turbo1093±515.5K5.7%6.3%12 tps2.4s164K$0.73$1.79
12698Qwen3 235B A22B1093±64.5K8.0%5.3%71 tps0.9s41K$0.23$0.63
127123Nova Experimental Chat 10-091091±73.2K10.7%<0.1%59 tps6.1s98K$0$0
128123Sherlock Dash Alpha1090±198356.7%<0.1%68 tps0.7s2M$0$0
12998OpenAI o3-pro1090±85.4K4.3%5.2%22 tps70.8s200K$20.00$80.00
13098DeepSeek V3.11089±122.3K4.7%0.8%197 tps0.4s164K$0.55$1.60
13198DeepSeek V3.2 Exp Thinking1089±75.9K3.5%7.2%26 tps3.0s131K$0.28$0.42
132132Claude Sonnet 3.51088±102.9K4.9%1.0%40 tps2.7s200K$3.00$15.00
133132Qwen Plus 0728 (Thinking)1087±91.2K8.9%<0.1%56 tps1.1s1M$0.40$4.00
134105GPT-4.1 mini1087±519.7K4.2%1.1%67 tps0.9s1M$0.34$1.60
135105GPT-4.1 nano1085±517K5.0%0.6%175 tps0.5s1M$0.10$0.40
136105Qwen3 Omni 30B A3B Instruct1085±137754.3%3.9%65 tps1.2s66K$0.35$0.97
137105DeepSeek V3 (Turbo)1082±201.5K5.1%1.5%32 tps1.5s64K$0.40$1.30
138132Solar Pro 2 2507101081±510.6K6.9%<0.1%9 tpsN/A66K$0.50$0.50
139105Seed 1.8 2512281081±103.2K3.1%3.7%41 tps2.1s256K$0.25$2.00
140105Mistral Medium1080±49.6K5.6%1.8%48 tps0.6s33K$1.48$4.55
141105Qwen3 Max Thinking1080±181.5K2.0%13.5%32 tps2.3s256K$1.20$6.00
142112GLM 4.51075±56K7.0%3.7%46 tps1.4s131K$0.43$1.63
143112Kimi K2 09051074±78.7K4.3%4.0%30 tps1.4s262K$0.63$2.39
144112Kimi K2 Fast1073±535K6.4%0.8%365 tps0.5s131K$1.00$3.00
145112GPT-5 (Low)1070±146903.5%1.8%75 tps8.2s400K$1.25$10.00
146112Kimi K2 0905 Turbo1070±67.5K9.1%0.7%373 tps0.5s262K$1.70$6.50
147112gpt-oss-20b1066±67.7K7.1%0.5%216 tps0.5s131K$0.06$0.26
148144Qwen Turbo1064±510K6.0%<0.1%53 tps1.1s1M$0.05$0.20
149112Grok 4.20 Beta Non-reasoning1063±365004.8%1.1%151 tps0.6s2M$2.00$6.00
150119OpenAI o11062±69.9K3.3%4.2%92 tps5.5s200K$15.00$60.00
151119DeepSeek V3.1 Terminus Thinking1061±92.9K9.4%5.9%27 tps1.8s131K$0.56$1.68
152119OpenAI o1-pro1061±206807.5%5.2%33 tps72.8s200K$150.00$600.00
153119Gemini 2.5 Flash Lite Thinking1061±49.8K6.2%1.0%118 tps4.4s1M$0.03$0.13
154151GLM 4.5 FP81060±186108.3%<0.1%59 tps1.2s131K$0.41$1.65
155151Llama 3 8B Turbo1059±246001.6%<0.1%97 tps0.1s8K$0.12$0.13
156119Seed 2.0 Lite (Medium)1058±205253.7%6.6%33 tps1.6s256K$0.25$2.00
157119LongCat Flash Chat1058±122.7K5.9%0.8%85 tps0.9s131K$0.14$0.68
158151OpenAI Codex Mini1057±59.8K3.3%<0.1%46 tps2.1s200K$1.50$6.00
159119GPT-5.1 Codex Mini (Medium)1057±151.9K4.9%4.6%69 tps4.1s400K$0.25$2.00
160119GPT-5.1 Codex Mini (High)1054±152.2K3.9%5.9%70 tps4.6s400K$0.25$2.00
161119Qwen3 32B Fast1052±611.4K5.2%11.6%30 tps3.1s41K$0.10$0.25
162151GLM 4.5 X1051±166455.8%<0.1%48 tps2.8s131K$2.20$8.90
163128ERNIE 4.5 300B A47B1049±413.5K3.9%4.7%23 tps2.3s123K$0.28$1.10
164164Arcee AI Maestro Reasoning1046±73.8K4.6%<0.1%85 tps0.3s131K$0.90$3.30
165128Cogito v2.1 671B1044±191.2K4.6%0.8%85 tps0.5s128K$1.25$1.25
166128Qwen3 32B1044±198506.6%3.9%30 tps3.1s41K$0.12$0.42
167164Grok 4 0709 EU1043±111.3K5.7%<0.1%33 tps8.2s128K$3.00$15.00
168128GLM 4.5 AirX1042±151.1K6.9%3.3%75 tps1.2s131K$1.10$4.50
169128Kimi K2 Thinking1042±103.3K5.1%4.2%61 tps5.9s262K$0.24$1.03
170128OpenAI o4-mini1042±58.5K6.4%1.4%97 tps7.0s128K$1.10$4.40
171164EXAONE Deep 32B1040±148801.7%<0.1%24 tpsN/A33K$0$0
172128Gemini 3.1 Flash Lite Preview Thinking1039±161.4K4.2%1.7%75 tps4.7s1M$0.25$1.50
173164Llama 3 70B Turbo1037±64.3K1.0%<0.1%31 tps0.0s8K$0.73$0.83
174135QwQ 32B1035±411.6K6.4%5.4%41 tps2.1s16K$0.43$0.56
175174Qwen 2.5 72B Turbo1035±226705.0%<0.1%84 tps0.8s131K$0.60$0.60
176135Qwen3 Next 80B A3B Thinking1035±56.2K7.4%0.6%175 tps1.3s256K$0.21$2.26
177135Gemini 2.5 Flash Lite Thinking Preview 09251035±75.8K6.8%1.5%152 tps3.0s1M$0.10$0.40
178135Gemini 3.1 Flash Lite Preview1034±219804.4%1.0%8 tps1.2s1M$0.25$1.50
179135Qwen3 VL 30B A3B Instruct1034±151K6.7%1.8%80 tps2.6s129K$0.18$0.67
180135DeepSeek V31032±517.6K3.7%0.9%69 tps1.1s64K$0.59$1.49
181135DeepSeek V3.2 Speciale1030±102.3K6.1%6.0%43 tps1.4s131K$0.84$1.52
182135Gemini 2.0 Flash Lite1029±514.7K9.5%<0.1%42 tps0.5s1M$0.08$0.30
183174Claude Haiku 3.51028±66.4K4.9%0.8%40 tps2.8s200K$0.80$4.00
184135Amazon Nova 2 Lite1026±103.6K6.0%1.0%137 tps0.6s300K$0.35$2.95
185144Command A1024±422.4K4.8%2.2%42 tps0.8s256K$2.00$7.33
186144DeepSeek V3.1 Nex N11021±195655.0%3.4%24 tps7.2s131K$0.14$0.50
187144OpenAI o31020±75.9K4.0%0.9%85 tps6.8s128K$7.33$29.33
188144Gemini 2.0 Flash1018±78.2K3.8%<0.1%76 tps0.5s1M$0.14$0.56
189189GLM 4.5 Air1016±67.1K6.9%<0.1%22 tps1.4s131K$0.10$0.38
190148Nemotron 3 Nano (Thinking)1012±132K6.7%2.0%200 tps0.5s256K$0$0
191148Qwen3 VL 235B A22B Thinking1009±64.6K8.3%4.3%47 tps3.0s127K$0.47$3.31
192148Qwen3 Coder Plus1007±226104.7%5.1%56 tps2.3s128K$1.80$9.80
193189K2 Think1005±161.4K5.6%<0.1%418 tps2.8sN/A$0$0
194148DeepSeek-R1 Turbo1003±92.5K5.6%2.6%29 tps1.8s64K$2.85$4.75
195195GPT-5 Mini High1002±93K7.7%<0.1%33 tps3.9s400K$0.25$2.00
196148Qwen 2.5 VL 32B Instruct1001±218654.9%6.3%43 tps3.2s128K$0.35$0.62
197148Qwen3 235B A22B Thinking 25071000±102.8K4.4%2.5%53 tps1.6s131K$0.59$5.70
198148OpenAI o3-mini-high999±58.3K4.1%2.4%231 tps10.5s200K$1.10$4.40
199148OpenAI o3-mini999±415K5.5%0.8%143 tps3.3s200K$1.10$4.40
200148OpenAI o4-mini-high995±713.6K6.2%1.9%117 tps15.9s200K$1.10$4.40
201148Seed 1.6 250615995±211.6K6.0%3.1%46 tps2.2s256K$0.25$2.00
202195Arcee AI Virtuoso-Large994±83K5.7%<0.1%64 tps0.5s131K$0.75$1.20
203148Qwen3 30B A3B994±86.3K6.9%5.1%163 tps1.0s41K$0.06$0.21
204195Claude Haiku 3992±112.8K3.0%0.4%62 tps0.5s200K$0.25$1.25
205159GPT-5 Nano989±64.6K8.0%3.2%113 tps20.9s400K$0.05$0.40
206159OpenAI o3-mini-low988±612.2K6.4%0.7%139 tps1.5s200K$1.10$4.40
207159Grok Code Fast 1987±92.5K6.0%5.9%294 tps0.5s256K$0.20$1.50
208159GLM 4.6V986±83K5.5%6.4%21 tps1.8s128K$0.38$0.90
209195Cypher Alpha985±207358.7%<0.1%4 tpsN/A1M$0$0
210195GLM 4.6 FP8982±171.2K11.7%<0.1%56 tps1.8s200K$0.40$1.75
211159Kimi K2 0711981±67K4.5%1.6%29 tps1.3s131K$0.72$2.60
212159Seed 2.0 Mini (Medium)981±355705.8%11.9%33 tps1.7s256K$0.15$0.60
213159Mistral Small 3.1 24B Instruct980±112.9K4.3%7.5%15 tps2.4s131K$0.06$0.18
214159DeepSeek-R1 0528979±65.5K3.5%1.3%93 tps0.5s64K$1.60$3.67
215167DeepSeek V3.1 Thinking976±95.2K9.5%7.1%18 tps1.8s131K$0.23$0.75
216211Grok 4 (Low Reasoning)975±215202.8%<0.1%18 tps9.5s256K$0$0
217167Nemotron 3 Nano974±465806.5%1.3%216 tps0.8s256K$0.05$4.94
218211Arcee AI Coder-Large972±159854.4%<0.1%60 tps1.6s33K$0.50$0.80
219167Qwen 2.5 32B Instruct972±74.1K6.5%2.5%48 tps1.0s131K$0.21$0.25
220219Arcee Coder Large971±73.6K2.6%<0.1%54 tps1.3s33K$0.50$0.80
221167Llama 4 Maverick971±521K5.0%1.2%88 tps2.4s1M$0.23$0.83
222167Mistral Small 3.2 24B970±134.6K4.9%2.8%141 tps0.7s33K$0.02$0.08
223167Pixtral Large969±143.5K3.9%2.5%57 tps1.3s128K$1.50$4.50
224167Qwen3 VL 30B A3B Thinking967±111.9K8.9%4.5%84 tps2.9s127K$0.20$1.47
225167Llama 4 Scout965±517.5K5.3%0.6%88 tps5.1s131K$0.18$0.46
226167Devstral Medium962±113.5K5.2%1.5%77 tps0.6s131K$0.40$2.00
227167Qwen3 14B962±85.3K8.4%1.7%109 tps0.8s41K$0.04$0.15
228167Qwen 2.5 72B960±151.4K4.5%1.2%96 tps1.2s131K$0.14$0.26
229167Llama 3.1 8B Turbo958±142.2K2.0%2.1%650 tps0.5s128K$0.13$0.14
230179Qwen3 30B A3B Thinking 2507953±103.5K4.7%0.5%124 tps1.2s131K$0.16$1.70
231230Magistral Medium952±161.3K10.8%<0.1%95 tps0.5s41K$2.00$5.00
232179Switchpoint Router949±102.7K3.6%1.7%71 tps4.9s131K$0.85$3.40
233179Qwen3 8B948±94.2K8.2%2.4%61 tps1.4s41K$0.02$0.07
234179Ministral 14B 3.0948±168058.5%2.0%119 tps0.5s128K$0.20$0.20
235179Grok 3 Mini Fast943±79K7.0%1.6%44 tps0.5s131K$0.60$4.00
236179ERNIE 4.5 21B A3B943±285406.9%2.3%78 tps1.5s120K$0.05$0.19
237230NVIDIA Llama 3.3 Nemotron Super 49B v1942±93.6K2.3%<0.1%13 tpsN/A131K$0.07$0.20
238179ERNIE 4.5 VL 424B A47B942±187256.5%4.9%36 tps3.5s123K$0.42$1.25
239179NVIDIA Llama 3.3 Nemotron Super 49B v1.5941±191.4K6.9%2.0%50 tps0.6s131K$0.09$0.33
240179DeepSeek-R1939±66.4K4.3%0.8%133 tps0.6s64K$0.91$3.07
241230Jamba 1.7 Mini936±241K8.4%<0.1%84 tps0.9s256K$0.20$0.40
242230Dobby Unhinged Llama 3.3 70B935±198602.8%<0.1%41 tps0.4s128K$0.90$0.90
243230R1 1776935±93.3K4.2%<0.1%61 tps1.0s128K$2.00$8.00
244179DeepSeek Prover v2935±101.3K3.0%5.2%14 tps1.3s164K$0.40$1.56
245189Llama 3.3 70B933±103K6.6%0.3%500 tps0.5s8K$0.48$0.66
246245Llama 3.1 70B Instruct Turbo933±114.1K3.8%<0.1%110 tps0.8s128K$0.88$0.88
247189Codestral931±198556.0%5.2%151 tps0.9s262K$0.15$0.45
248245NVIDIA Llama 3.1 Nemotron 70B928±75.3K2.0%<0.1%9 tps0.1s128K$0.33$0.39
249189Grok 3 Mini927±69.9K6.4%1.2%43 tps0.5s131K$0.30$0.50
250189Mistral Small 3.1925±142.4K4.0%7.4%13 tps2.6s32K$0.17$0.28
251189Jamba 1.6 Large924±93.3K3.8%2.0%59 tps1.2s256K$1.33$5.33
252245Grok 3 Mini Beta922±141.8K1.9%<0.1%75 tps0.5s131K$0.45$2.25
253245GLM Z1 32B921±101.9K10.1%<0.1%18 tps9.3s33K$0.09$0.11
254245GPT-5 Nano Minimal920±131.4K10.8%<0.1%88 tps0.8s400K$0.05$0.40
255245Solar Pro 2 250710 (Reasoning)919±102.6K3.9%<0.1%9 tpsN/A66K$0.50$0.50
256189Llama 3.3 Swallow 70B Instruct919±83.5K5.5%1.4%153 tps1.3s131K$0.13$0.39
257189Open Mistral Nemo918±211.7K4.5%1.5%171 tps0.5s131K$0.15$0.15
258189Jamba 1.7 Large917±169758.0%1.3%58 tps1.0s256K$1.33$5.33
259189Devstral Small916±171.4K5.0%2.4%180 tps0.6s131K$0.10$0.30
260189Rnj-1 Instruct915±219706.7%0.6%103 tps0.3s33K$0.15$0.15
261189Seed 1.6 Flash 250715913±161.2K5.7%2.5%108 tps1.6s256K$0.07$0.30
262245Solar Pro 3 (Reasoning)913±185954.8%3.2%118 tps1.2s131K$0.15$0.60
263189Inception Mercury Coder Small Beta912±206103.2%1.7%270 tps1.4s32K$0.25$1.00
264264Arcee AI Spotlight910±84.6K4.3%<0.1%121 tps0.4s131K$0.18$0.18
265264YouTube910±132K5.5%<0.1%34 tps2.7s32K$0.99$0.99
266264OLMo 3 32B Think910±254706.0%<0.1%84 tps0.6s66K$0.15$0.50
267201Magistral Small 2506908±114.3K3.1%1.6%156 tps0.5s40K$0.37$1.10
268201GPT-3.5 Turbo908±151.2K2.5%1.3%74 tps0.9s16K$0.75$1.75
269264Fauna Fox908±103.4K8.2%<0.1%194 tps0.3s128K$0.04$0.15
270201Llama 3 8B904±103.2K3.5%6.0%85 tps0.7s8K$0.12$0.16
271201Mistral Small 3.2 24B Instruct903±208508.6%1.9%113 tps1.1s131K$0.02$0.08
272264DeepSeek R1T Chimera901±92.9K6.7%<0.1%46 tps1.1s164K$0.09$0.36
273264Grok 2901±72.3K2.7%<0.1%55 tps1.1s131K$2.00$10.00
274201GPT-4o mini900±142.8K5.3%2.1%71 tps1.7s128K$0.15$0.60
275201Moonshot V1 Auto899±229304.1%1.2%54 tps1.5s8K$2.00$5.00
276264Llama 3.1 405B Instruct Turbo896±112K3.9%<0.1%26 tps0.8s131K$3.50$3.50
277264Arcee AI Virtuoso-Medium896±122K2.6%<0.1%3 tpsN/A131K$0.50$0.80
278201Amazon Nova Pro 1.0894±105.7K4.0%0.9%96 tps0.7s300K$0.80$1.70
279264Exaone 3.5 32B Instruct893±216503.0%<0.1%17 tpsN/A33K$0$0
280201GLM 4.6V Flash892±102.5K7.6%3.7%64 tps2.1s128K$0.04$0.40
281280Venice Uncensored891±296156.1%<0.1%59 tps3.9s33K$0$0
282280Arcee AI Blitz889±83K2.1%<0.1%6 tpsN/A33K$0.45$0.75
283280Gemini 1.5 Pro887±82.3K2.3%<0.1%15 tps0.0s2M$0.78$3.13
284201Llama 3.2 11B Instruct885±152.1K4.1%1.5%152 tps0.5s8K$0.16$0.16
285210Magistral Medium 2509883±162.6K9.5%4.0%58 tps0.9s131K$2.00$5.00
286280AFM 4.5B Preview882±162.5K3.1%<0.1%32 tps0.0s66K$0$0
287210Gemma 3n E4B882±76K4.5%2.0%30 tps0.5s8K$0.01$0.02
288280Refuel LLM 2 Small881±74.2K3.9%<0.1%116 tps0.5s8K$0.20$0.20
289210Qwen3 4B880±85.1K9.6%1.9%94 tps1.5s128K$0.01$0.01
290280Magistral Small 2507880±1973013.1%<0.1%148 tps0.4s41K$0.50$1.50
291210Mistral Small 3 24B Instruct880±101.7K3.6%2.6%77 tps0.6s33K$0.07$0.14
292210Moonshot V1 128k879±191.1K4.6%1.4%54 tps1.5s131K$2.00$5.00
293210Inception Mercury878±56.9K3.7%0.4%257 tps1.1s32K$0.25$1.00
294210DeepSeek R1T2 Chimera876±102.1K5.9%3.0%28 tps1.8s164K$0.13$0.45
295210Mistral Medium 3875±234856.7%2.4%47 tps0.8s33K$0.40$2.00
296210Mistral Nemo875±159152.7%<0.1%112 tps0.4s131K$0.07$0.13
297210Solar Mini 250422874±171.3K5.9%1.8%90 tps1.7s33K$0.15$0.15
298210GLM 4.7 Flash871±286104.7%5.8%61 tps2.8s128K$0.07$0.39
299210Mixtral 8x22B871±221.3K5.0%1.2%140 tps0.6s64K$2.00$6.00
300293Devstral Small 2505871±151.7K6.2%<0.1%141 tps1.3s33K$0.03$0.09
301210Qwen 2.5 7B Turbo870±256156.1%0.5%125 tps0.4s131K$0.30$0.30
302293AFM 4.5B869±74.4K8.9%<0.1%81 tps0.3s66K$0.05$0.20
303293Claude Sonnet 3869±179001.6%<0.1%35 tps1.0s200K$3.00$15.00
304210Krutrim Spectre V2868±161.3K3.6%<0.1%33 tps3.1s4K$0.19$0.19
305210GLM 4 32B868±122.9K4.9%2.6%40 tps1.6s33K$0.14$0.14
306210Gemma 3 12B867±112.5K4.9%4.2%73 tps0.8s131K$0.05$0.12
307210Hermes 2 Pro Llama 3 8B864±211.8K2.5%<0.1%76 tps1.0s131K$0.08$0.09
308210Mistral Small 24B Instruct864±161.5K4.1%1.5%84 tps0.4s33K$0.80$0.80
309210Moonshot V1 8k863±139155.2%1.0%55 tps1.5s8K$0.20$2.00
310293Hermes 2 Mixtral 8x7B DPO863±171.2K1.3%<0.1%1 tpsN/A33K$0.60$0.60
311210Qwen 2.5 14B Instruct861±162.4K5.7%2.4%40 tps1.6s1M$0.40$1.61
312312Yi Large858±121.5K<0.1%<0.1%34 tpsN/A33K$1.50$1.50
313312DeepSeek-R1 0528 Qwen3 8B856±84.9K6.5%<0.1%45 tps2.4s128K$0.05$0.09
314312Command Light856±161.1K4.9%<0.1%23 tpsN/A4K$0.10$0.20
315210Gemma 3 27B856±271.1K6.9%1.8%35 tps1.1s66K$0.06$0.10
316210Mixtral 8x7B855±181.3K5.1%2.2%142 tps0.6s33K$0.23$0.23
317210Ministral 3B 2512854±575158.0%2.8%339 tps0.6s131K$0.10$0.10
318210Mixtral 8x7B Instruct854±161.4K4.4%0.2%79 tps0.7s33K$0.23$0.31
319234Gemma 3 27B IT853±102.3K3.9%2.0%60 tps0.8s128K$0.17$0.29
320234Jamba 1.5 Large851±92.9K4.0%1.7%48 tps0.9s256K$1.50$6.00
321234Llama 3.3 70B Instruct Turbo851±191.2K6.0%2.0%78 tps1.0s131K$0.88$0.88
322234Command R 7B849±153.3K4.8%1.1%76 tps0.4s128K$0.04$0.15
323312Wikipedia846±79.8K4.9%<0.1%47 tps2.1s32K$0$0
324324MAI-DS-R1842±73.5K11.7%<0.1%73 tps3.2s64K$0.10$0.40
325234GPT-3.5 Turbo 16k838±102.7K3.6%<0.1%22 tps0.6s16K$3.00$4.00
326324Cogito V2 671B838±171.6K5.9%<0.1%41 tps0.6s164K$1.25$1.25
327234ERNIE 4.5 21B A3B Thinking838±231.1K6.9%1.8%87 tps1.5s120K$0.07$0.28
328240DeepSeek-R1 Distill Llama 70B835±93.4K5.2%3.6%27 tps1.6s32K$0.73$0.95
329324Typhoon 2 70B Instruct835±151.4K4.0%<0.1%19 tps0.1s8K$0.88$0.88
330240GLM 4.5 Flash834±375208.8%12.2%15 tps2.2s131K$0$0
331324OLMo 2 0425 1B Instruct833±215602.6%<0.1%68 tps0.0s4K$0$0
332324NVIDIA Llama 3.1 Nemotron Ultra 253B v1832±162.2K4.1%<0.1%40 tps0.8s128K$0.30$0.90
333240Mixtral-8x7B Instruct v0.1832±231.3K4.6%1.3%54 tps0.4s33K$0.60$0.60
334240Qwen 2.5 7B831±172K5.1%3.7%40 tps1.9s131K$0.08$0.27
335240Sky T1 32B Preview829±142.4K4.5%7.8%73 tps0.6s16K$0.12$0.18
336240LFM2 2.6B826±2681010.0%6.7%184 tps0.4s33K$0.01$0.02
337240Krutrim 2825±102.3K2.3%12.5%33 tps2.1s128K$1.00$1.00
338240Ministral 8B825±172.2K5.5%1.4%177 tps0.4s128K$0.14$0.14
339240C4AI Aya Expanse 32B821±73.8K4.0%1.5%43 tps0.5s128K$0.50$1.50
340240Moonshot V1 32k820±179503.1%1.4%53 tps1.4s33K$1.00$3.00
341240LFM2 8B A1B818±1882511.3%<0.1%142 tps0.3s33K$0.01$0.02
342240Gemma 2 27B815±171.5K4.1%1.4%44 tps1.4s8K$0.80$0.80
343337GLM 4.1V 9B Thinking813±161.1K4.2%<0.1%69 tps1.3s66K$0.04$0.14
344252Ministral 3B806±162.3K5.1%0.8%248 tps0.4s131K$0.08$0.08
345337Qwen 2 72B Instruct805±191K3.3%<0.1%3 tpsN/A33K$0.90$0.90
346346Magistral Medium (Thinking)804±102.2K5.7%<0.1%67 tps0.8s41K$2.00$5.00
347252Magistral Small 2509802±181.8K7.5%2.7%116 tps0.6s131K$0.50$1.50
348252Gemma 3 1B802±112K6.1%0.6%176 tps1.0s33K$0.06$0.10
349252WizardLM-2 8x22B801±121.9K3.1%11.6%11 tps2.5s66K$0.77$0.77
350252Phi 4798±161.7K3.4%5.1%28 tps1.3s128K$0.10$0.32
351252Hermes 4 405B FP8797±218158.4%3.5%31 tps0.9s131K$0.52$1.73
352346Magistral Medium 2507795±2566514.2%<0.1%86 tps0.7s41K$2.00$5.00
353252Mercury Coder793±275103.8%<0.1%247 tps2.2s32K$0.25$1.00
354252GPT-3.5 Turbo Instruct787±92K2.7%<0.1%46 tps1.2s4K$1.50$2.00
355252Mistral Large785±161.1K5.8%1.5%54 tps0.7s33K$2.00$6.00
356252Hermes 4 70B781±294608.9%1.1%67 tps0.6s131K$0.12$0.39
357262Command R778±182.2K4.9%5.8%54 tps0.6s128K$0.30$0.99
358262Baichuan-M2-32B770±3074010.8%<0.1%32 tps3.3s131K$0.07$0.07
359262Mistral Small770±121.2K4.5%1.7%142 tps0.6s32K$0.43$1.30
360354OLMo 3 7B Think763±217707.8%4.2%77 tps0.4s66K$0.12$0.20
361262Open Mistral 7B762±181.3K4.7%0.7%176 tps0.4s33K$0.25$0.25
362262Hermes 4 405B Reasoning FP8759±112.7K12.8%3.6%32 tps0.8s131K$1.00$3.00
363361DeepSeek-R1 Distill Qwen 14B756±161.9K6.3%<0.1%44 tps1.7s64K$0.63$0.63
364262Goliath 120B754±247455.7%2.7%21 tps2.2s6K$6.56$9.38
365361Seed Coder 8B Instruct751±226052.4%<0.1%35 tpsN/A32K$0.99$0.99
366262Qwen 2.5 VL 72B Instruct746±202.1K6.0%5.3%25 tps3.7s128K$1.01$2.79
367269Gemma 3 4B742±103.3K4.7%1.3%138 tps0.7s131K$0.02$0.04
368269Mixtral 8x22B Instruct738±171.4K5.6%1.8%142 tps0.7s66K$0.45$0.45
369269Command R+738±151.6K5.6%2.8%36 tps0.7s128K$2.08$9.45
370269Inflection 3 Productivity737±241.5K5.0%0.6%50 tps3.2s8K$2.50$10.00
371361Command734±187654.4%<0.1%25 tpsN/A4K$0.83$1.33
372361Meridian734±399659.8%<0.1%92 tps1.2s131K$0$0
373361Zenith730±288509.6%<0.1%36 tps1.8s131K$0$0
374374Mistral Nemo 12B Celeste V1.9725±181.1K3.5%<0.1%6 tps10.2s8K$0.80$1.20
375269Pixtral 12B722±213K6.3%2.2%101 tps1.2s131K$0.08$0.08
376374Solar Pro 250422720±195306.2%<0.1%13 tps0.6s33K$0$0
377269Inflection 3 Pi719±181.5K4.1%1.1%33 tps3.4s8K$2.50$10.00
378374Mythalion 13B709±101.1K1.3%<0.1%63 tps0.5s4K$0.56$1.13
379269DeepHermes 3 Mistral 24B Preview706±307155.9%2.5%50 tps1.0s33K$0.06$0.25
380276Hermes 3 405B Instruct702±201.4K4.1%2.3%20 tps1.1s131K$0.80$0.80
381374Dolphin 3.0 R1 Mistral 24B701±168907.8%<0.1%13 tps0.1s33K$0.03$0.09
382374Phi 4 Multimodal Instruct697±162.1K6.8%<0.1%17 tps1.4s128K$0.03$0.05
383276DeepSeek-R1 Distill Qwen 32B696±202K5.5%6.2%22 tps1.8s131K$0.37$0.39
384383ArliAI QwQ 32B Arliai RpR V1686±406359.3%<0.1%34 tps1.8s33K$0.02$0.07
385276MiniMax M1686±133.8K5.3%<0.1%31 tps2.8s1M$0.55$2.20
386386MiMo 7B RL655±131.2K3.5%<0.1%31 tps0.4s32K$0.49$0.49
387386Dolphin 2.9.2 Mixtral 8x22B652±191.1K2.6%<0.1%20 tps1.5s16K$0.90$0.90
388386DeepSeek-R1 Distill Qwen 7B633±195655.0%<0.1%0 tpsN/A131K$0.05$0.10
389386Shisa V2 Llama 3.3 70B623±245859.3%<0.1%8 tps2.0s33K$0.03$0.09
390279UI-TARS 1.5 7B610±4053011.7%4.0%75 tps0.9s128K$0.10$0.20
391390ERNIE 4.5 0.3B602±4068511.0%<0.1%85 tps2.2s120K$0$0
392390Llema 7B601±218504.5%<0.1%1 tps15.0s4K$0.80$1.20
393279MythoMax L2 13B600±212.3K5.8%1.2%22 tps1.1s4K$0.18$0.18
394279Phi 4 Mini Instruct599±211K7.1%7.4%40 tps1.1s128K$0.07$0.30
395390DeepSeek-R1 Distill Llama 8B595±191.2K5.6%<0.1%17 tpsN/A32K$0.04$0.04
396279Hunyuan A13B Instruct588±221.6K9.2%2.3%67 tps2.0s33K$0.01$0.01
397279Phi 4 Reasoning573±172.1K5.5%21.0%29 tps1.0s33K$0.06$0.25
398284Qwen 2.5 VL 3B Instruct523±254.1K6.1%3.0%44 tps2.5s128K$0.21$0.63
399399DeepSeek-R1 Distill Qwen 1.5B481±197305.2%<0.1%20 tps0.0s131K$0.18$0.18
400284CodeLlama 7B Instruct Solidity463±544858.5%3.6%33 tps0.7s16K$0.80$1.20
401399Mistral Nemo 12B Inferor v0.0454±285651.7%<0.1%83 tps0.8s16K$0.80$1.20
402286Phi 4 Mini Reasoning447±153.4K12.0%9.7%30 tps0.9s128K$0.07$0.30
403402QwQ 32B RpR v1406±351K10.9%<0.1%34 tps3.3s33K$0.02$0.07
404404Seed Coder 8B Reasoning329±417004.1%<0.1%25 tpsN/A32K$0.99$0.99
Show Less