Models
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1596
Claude Opus 4.6
1594
Claude Sonnet 4.6
1594
GPT-5.4
1566
Claude Opus 4.6 (Thinking)
1506
Claude Sonnet 4.6 (Thinking)
1464
GPT-5.4 (High)
1446
Claude Opus 4.5 (Thinking)
1418
Gemini 3.1 Pro
1409
Claude Opus 4.5
1381
GPT-5.3 Codex (High)
1362
Claude Sonnet 4.5 (Thinking)
1358
GPT-5.2 Instant
1353
Claude Haiku 4.5 (Extended Thinking)
1352
Claude Opus 4 (Thinking)
1340
GPT-5.2

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
11Claude Opus 4.61596±621.6K1.1%2.1%48 tps1.7s200K$5.00$25.00
21Claude Sonnet 4.61594±1015.7K1.4%1.6%47 tps1.2s200K$3.00$15.00
31GPT-5.41594±144.4K1.6%2.6%55 tps0.8s1M$2.50$15.00
44Claude Opus 4.6 (Thinking)1566±816.5K1.6%2.5%56 tps1.6s200K$5.00$25.00
55Claude Sonnet 4.6 (Thinking)1506±816.2K3.5%4.7%57 tps1.1s200K$3.00$15.00
66GPT-5.4 (High)1464±124.9K3.9%4.6%68 tps7.9s1M$2.50$15.00
76Claude Opus 4.5 (Thinking)1446±460.8K1.9%1.8%49 tps1.4s200K$5.00$25.00
87Gemini 3.1 Pro1418±922K2.5%3.5%35 tps4.1s1M$2.00$12.00
97Claude Opus 4.51409±515.1K2.2%1.5%45 tps1.5s200K$5.00$25.00
109GPT-5.3 Codex (High)1381±93.2K1.2%2.0%61 tps17.8s400K$1.75$14.00
1110Claude Sonnet 4.5 (Thinking)1362±458.2K3.3%1.9%44 tps1.1s200K$3.00$15.00
1210GPT-5.2 Instant1358±615.7K3.3%1.7%52 tps2.0s400K$1.75$14.00
1312Claude Haiku 4.5 (Extended Thinking)1353±414.3K3.8%1.4%115 tps0.7s200K$1.00$5.00
1413Claude Opus 4 (Thinking)1352±52.6K2.6%<0.1%28 tps1.3s200K$15.00$75.00
1513GPT-5.21340±811.3K3.2%4.1%18 tps2.7s400K$1.75$14.00
1613Gemini 3 Pro1337±559.4K2.6%2.1%50 tps3.6s1M$2.00$12.00
1715GLM 51324±1411.7K3.3%3.4%36 tps2.7s200K$0.72$2.55
1815GPT-5.11319±712.9K3.4%2.3%71 tps1.4s400K$1.42$11.33
1917Claude Sonnet 4.51307±320.9K5.0%1.4%41 tps1.3s200K$1.80$9.00
2017GPT-5.2 (High)1297±830.7K2.8%6.7%18 tps16.3s400K$1.75$14.00
2121GPT-5.1 (Medium)1291±93.2K6.4%<0.1%86 tps3.8s400K$0.83$6.67
2219Gemini 3 Pro (Low)1291±611.9K4.2%2.4%51 tps3.5s1M$2.00$12.00
2319GPT-5.1 (High)1290±619.1K3.5%3.2%76 tps6.9s400K$1.25$10.00
2419Gemini 3 Flash Preview Thinking1286±632.7K3.3%1.6%3 tps6.2s1M$0.50$3.00
2519Claude Haiku 4.51283±316.4K4.5%1.1%100 tps0.9s200K$1.00$5.00
2619MiniMax M2.51283±285103.8%1.4%70 tps1.9s205K$0.28$1.20
2719GPT-5.3 Codex (Medium)1278±271.1K2.3%2.3%62 tps10.3s400K$1.75$14.00
2829Claude Opus 41274±412.4K2.7%<0.1%25 tps1.5s200K$15.00$75.00
2929Claude Opus 4.1 (Thinking)1272±57.7K5.2%<0.1%20 tps3.9s200K$15.00$75.00
3019GPT-5.3 Instant1271±124.2K2.5%0.9%63 tps0.8s400K$1.75$14.00
3127Claude Sonnet 4 (Thinking)1261±325.9K2.9%1.5%52 tps1.5s200K$3.00$13.67
3227GPT-5 Codex (High)1260±718.5K3.3%3.2%122 tps7.1s400K$1.25$10.00
3327GPT-5 (High)1259±416.2K3.5%4.5%81 tps35.9s400K$1.25$10.00
3427GPT-5.2 Codex (High)1257±123.1K2.8%8.8%41 tps12.9s400K$1.75$14.00
3536Claude Opus 4.11254±47.1K4.6%3.0%17 tps3.7s200K$15.00$75.00
3631GPT-5.1 Codex (High)1240±837K3.3%3.2%96 tps3.9s400K$1.25$10.00
3731Grok 4.1 Fast Non-Reasoning1239±69.4K5.4%0.9%101 tps0.5s2M$0.20$0.50
3831GPT-5 Chat1231±435K4.5%1.3%95 tps0.9s400K$1.25$10.00
3937Nova Experimental Chat 11-101230±85.2K6.3%0.4%84 tps8.9s98K$0$0
4037Polaris Alpha1226±147555.6%<0.1%48 tps1.1s256K$0$0
4144GPT-4.5 Preview1223±72.5K1.8%<0.1%36 tps3.0s200K$75.00$150.00
4244Nova Experimental Chat 10-201221±54.4K8.1%<0.1%30 tps0.5s98K$0$0
4336GPT-5.2 (Extra High) 1221±98K3.5%13.2%17 tps20.5s400K$1.75$14.00
4436Qwen3 VL 235B A22B Instruct1220±75.6K6.7%3.1%75 tps1.9s129K$0.37$1.81
4536GPT-5 Codex (Medium)1214±68.8K3.9%4.1%122 tps5.2s400K$1.25$10.00
4636GPT-5.2 Codex (Medium)1211±122.4K3.0%5.7%37 tps6.3s400K$1.75$14.00
4753Claude Sonnet 3.7 (Thinking)1210±313.6K3.1%<0.1%41 tps2.6s200K$3.00$15.00
4853Mistral Medium 3.11206±516.4K5.1%<0.1%77 tps0.7s128K$0.40$2.00
4943Claude Sonnet 41205±343.2K3.7%1.8%49 tps1.3s200K$3.00$15.00
5043Gemini 3 Flash Preview1205±117.2K3.7%1.3%138 tps1.4s1M$0.50$3.00
5143Gemini 2.5 Pro High1204±321.1K5.7%1.5%48 tps2.3s1M$1.25$10.00
5243Qwen3 Max Instruct Preview1203±616.1K4.6%1.1%31 tps1.7s256K$1.43$6.61
5358Claude Sonnet 3.71201±412.1K3.2%<0.1%39 tps1.6s200K$3.00$15.00
5443GPT-5.1 Codex Max1200±126.4K3.9%3.0%118 tps4.1s400K$1.25$10.00
5543MiniMax M2.1 Lightning1197±238753.3%1.7%52 tps2.1s205K$0.30$2.40
5649Qwen3 30B A3B Instruct 25071194±512.7K5.7%1.2%55 tps1.3s131K$0.13$0.72
5762OpenAI o1-mini1192±415K4.6%<0.1%118 tpsN/A128K$1.13$4.51
5849MiniMax M2.11192±819.4K3.6%2.1%66 tps2.6s205K$0.30$1.20
5949DeepSeek V3.21189±85.1K4.7%1.4%83 tps5.1s131K$0.43$1.09
6062Qwen Plus 07281189±82.1K7.5%<0.1%55 tps0.9s1M$0.40$1.20
6149MiniMax M2.5 FP81185±176103.2%3.6%33 tps1.7s205K$0.45$1.75
6249GPT-51185±421.3K5.3%3.1%78 tps23.1s400K$1.25$9.67
6349Grok 4 Fast Non-Reasoning1185±58.1K7.1%1.5%93 tps0.6s2M$0.27$0.67
6449MiniMax M21183±519.7K4.2%2.2%39 tps2.3s205K$0.21$0.85
6549Nova Experimental Chat 12-101182±92.9K3.8%2.4%84 tps12.9s98K$0$0
6649GLM 4.61182±717.2K4.4%5.4%39 tps1.5s200K$0.42$1.66
6749GPT-5.3 Codex (Low)1178±285101.0%1.8%61 tps4.3s400K$1.75$14.00
6860Grok 4.1 Fast Reasoning1178±739.5K4.4%1.5%58 tps7.3s2M$0.20$0.50
6960Grok 4 Fast Reasoning1177±314.5K5.0%2.1%102 tps3.1s2M$0.30$0.75
7060Gemini 2.5 Pro1176±337.9K4.8%2.3%45 tps2.6s1M$1.25$10.00
7175Gemini 2.5 Flash Thinking Preview 09251173±79.2K6.8%<0.1%111 tps4.7s1M$0.30$2.50
7260Qwen3 235B A22B Instruct 25071172±412.6K6.4%6.8%13 tps1.9s262K$0.13$0.52
7360Claude Sonnet 3.5 v21171±65.5K3.4%<0.1%46 tps1.4s200K$3.00$15.00
7460GPT-5.1 Codex (Medium)1171±143K3.2%4.6%71 tps3.7s400K$1.25$10.00
7560GPT-5.1 Instant1171±88.3K4.1%1.3%50 tps1.9s400K$1.25$10.00
7675Gemini 2.5 Pro Low1170±49.6K8.1%<0.1%89 tps2.4s1M$1.25$10.00
7760Grok 4.20 Beta Reasoning1167±221.2K4.1%1.1%77 tps4.5s2M$2.00$5.50
7869Qwen3.5 35B A3B1164±258653.9%2.1%116 tps2.1s256K$0.63$1.13
7969GPT-5 Codex (Low)1163±105K4.1%2.7%112 tps3.5s400K$1.25$10.00
8069GLM 4.71161±716.8K3.7%5.8%40 tps1.5s200K$0.77$1.73
8186Gemini 2.5 Flash Preview1161±83K1.1%<0.1%138 tps6.9s1M$0.15$0.60
8286GPT-5 (Minimal)1158±58.3K7.4%<0.1%67 tps1.4s400K$1.25$10.00
8369DeepSeek V3.1 Terminus Chat1158±56.5K6.9%3.4%27 tps1.5s131K$0.86$1.80
8474Qwen Plus (Aug'24)1146±517.2K4.7%1.4%53 tps1.3s30K$0.40$1.20
8574Qwen3.5 397B A17B1142±142.5K2.9%4.3%57 tps1.4s256K$0.52$3.00
8674Gemini 2.5 Flash Preview 09251140±67.6K6.0%1.2%5 tps0.9s1M$0.13$0.97
8793Gemini 2.5 Flash Preview Thinking1136±101.4K1.8%<0.1%26 tps1.8s1M$0.15$1.76
8877GPT-5 Mini1131±58.6K5.4%2.6%66 tps14.2s400K$0.25$2.00
8977DeepSeek V3.1 Turbo1130±74.8K5.3%0.9%173 tps1.3s164K$2.00$3.75
9077Grok 4.20 Multi Agent Beta1129±199453.6%1.2%56 tps8.8s2M$2.00$6.00
9177Qwen3 Max Thinking Preview1127±106.3K5.7%3.1%40 tps2.1s256K$1.20$6.00
9277Grok 41125±339.6K4.4%3.9%29 tps11.1s256K$3.00$15.00
9397Ministral 8B 25121125±155107.3%<0.1%174 tps0.5s128K$0.15$0.15
9477GPT-4.11123±532.8K5.2%3.7%112 tps1.3s1M$2.00$8.00
9577Gemini 2.5 Flash Lite Preview 09251122±78.5K6.6%1.2%209 tps0.7s1M$0.25$0.35
9697Gemini 2.5 Pro Preview 06051121±101.7K2.3%<0.1%0 tps3.7s1M$1.25$10.00
9785Gemini 2.5 Flash Thinking1118±413.7K3.6%2.2%88 tps6.4s1M$0.30$2.50
9885GPT-5 Mini Minimal1114±123.2K8.5%1.2%63 tps1.4s400K$0.25$2.00
9985GPT-5.2 Codex (Low)1113±191.2K3.2%4.5%41 tps5.0s400K$1.75$14.00
100108Gemini 2.5 Pro Preview 03251111±111.5K3.2%<0.1%3 tps16.6s1M$1.25$10.00
10185DeepSeek V3.1 Chat1110±74.9K6.6%2.8%21 tps1.6s131K$0.38$1.00
10285Qwen3 Omni 30B A3B Thinking1110±102.3K6.0%3.7%67 tps1.2s66K$0.97$1.79
10390Qwen Max1107±418.3K4.2%1.5%49 tps1.5s33K$1.60$6.40
104114GPT-5 Mini Low1104±82.8K7.2%<0.1%69 tps3.2s400K$0.25$2.00
10590Gemini 2.5 Flash Lite1103±521.3K6.2%1.3%210 tps0.7s1M$0.10$0.40
10690Grok 3 Fast1102±142.5K4.7%1.7%52 tps2.4s131K$5.00$25.00
10790GPT-4o1102±58.5K3.7%1.0%49 tps2.4s128K$3.71$12.57
10890DeepSeek V3 03241100±415.1K4.3%5.8%12 tps2.7s164K$0.38$0.93
10990Qwen3 Coder 480B A35B Instruct1099±83.1K4.5%3.3%61 tps2.0s262K$0.71$1.34
11098Gemini 2.5 Flash1098±435.9K3.2%1.3%2 tps3.7s1M$0.30$2.50
11198Grok 31098±419.1K5.5%1.5%53 tps0.6s1M$3.67$18.33
11298DeepSeek V3 0324 Turbo1093±515.5K5.7%6.3%12 tps2.4s164K$0.73$1.79
113123Nova Experimental Chat 10-091091±73.2K10.7%<0.1%59 tps6.1s98K$0$0
114123Sherlock Dash Alpha1090±198356.7%<0.1%68 tps0.7s2M$0$0
11598OpenAI o3-pro1090±85.4K4.3%5.2%22 tps70.8s200K$20.00$80.00
11698DeepSeek V3.11089±122.3K4.7%0.8%197 tps0.4s164K$0.55$1.60
117132Claude Sonnet 3.51088±102.9K4.9%1.0%40 tps2.7s200K$3.00$15.00
118132Qwen Plus 0728 (Thinking)1087±91.2K8.9%<0.1%56 tps1.1s1M$0.40$4.00
119105GPT-4.1 mini1087±519.7K4.2%1.1%67 tps0.9s1M$0.34$1.60
120105GPT-4.1 nano1085±517K5.0%0.6%175 tps0.5s1M$0.10$0.40
121105Qwen3 Omni 30B A3B Instruct1085±137754.3%3.9%65 tps1.2s66K$0.35$0.97
122105DeepSeek V3 (Turbo)1082±201.5K5.1%1.5%32 tps1.5s64K$0.40$1.30
123132Solar Pro 2 2507101081±510.6K6.9%<0.1%9 tpsN/A66K$0.50$0.50
124105Seed 1.8 2512281081±103.2K3.1%3.7%41 tps2.1s256K$0.25$2.00
125105Mistral Medium1080±49.6K5.6%1.8%48 tps0.6s33K$1.48$4.55
126105Qwen3 Max Thinking1080±181.5K2.0%13.5%32 tps2.3s256K$1.20$6.00
127112GLM 4.51075±56K7.0%3.7%46 tps1.4s131K$0.43$1.63
128112Kimi K2 09051074±78.7K4.3%4.0%30 tps1.4s262K$0.63$2.39
129112GPT-5 (Low)1070±146903.5%1.8%75 tps8.2s400K$1.25$10.00
130112Kimi K2 0905 Turbo1070±67.5K9.1%0.7%373 tps0.5s262K$1.70$6.50
131144Qwen Turbo1064±510K6.0%<0.1%53 tps1.1s1M$0.05$0.20
132112Grok 4.20 Beta Non-reasoning1063±365004.8%1.1%151 tps0.6s2M$2.00$6.00
133119OpenAI o11062±69.9K3.3%4.2%92 tps5.5s200K$15.00$60.00
134119OpenAI o1-pro1061±206807.5%5.2%33 tps72.8s200K$150.00$600.00
135119Gemini 2.5 Flash Lite Thinking1061±49.8K6.2%1.0%118 tps4.4s1M$0.03$0.13
136151GLM 4.5 FP81060±186108.3%<0.1%59 tps1.2s131K$0.41$1.65
137119Seed 2.0 Lite (Medium)1058±205253.7%6.6%33 tps1.6s256K$0.25$2.00
138119LongCat Flash Chat1058±122.7K5.9%0.8%85 tps0.9s131K$0.14$0.68
139151OpenAI Codex Mini1057±59.8K3.3%<0.1%46 tps2.1s200K$1.50$6.00
140119GPT-5.1 Codex Mini (Medium)1057±151.9K4.9%4.6%69 tps4.1s400K$0.25$2.00
141119GPT-5.1 Codex Mini (High)1054±152.2K3.9%5.9%70 tps4.6s400K$0.25$2.00
142151GLM 4.5 X1051±166455.8%<0.1%48 tps2.8s131K$2.20$8.90
143128ERNIE 4.5 300B A47B1049±413.5K3.9%4.7%23 tps2.3s123K$0.28$1.10
144164Arcee AI Maestro Reasoning1046±73.8K4.6%<0.1%85 tps0.3s131K$0.90$3.30
145128Cogito v2.1 671B1044±191.2K4.6%0.8%85 tps0.5s128K$1.25$1.25
146128Qwen3 32B1044±198506.6%3.9%30 tps3.1s41K$0.12$0.42
147164Grok 4 0709 EU1043±111.3K5.7%<0.1%33 tps8.2s128K$3.00$15.00
148128GLM 4.5 AirX1042±151.1K6.9%3.3%75 tps1.2s131K$1.10$4.50
149128Kimi K2 Thinking1042±103.3K5.1%4.2%61 tps5.9s262K$0.24$1.03
150128OpenAI o4-mini1042±58.5K6.4%1.4%97 tps7.0s128K$1.10$4.40
151164EXAONE Deep 32B1040±148801.7%<0.1%24 tpsN/A33K$0$0
152128Gemini 3.1 Flash Lite Preview Thinking1039±161.4K4.2%1.7%75 tps4.7s1M$0.25$1.50
153135Qwen3 Next 80B A3B Thinking1035±56.2K7.4%0.6%175 tps1.3s256K$0.21$2.26
154135Gemini 2.5 Flash Lite Thinking Preview 09251035±75.8K6.8%1.5%152 tps3.0s1M$0.10$0.40
155135Gemini 3.1 Flash Lite Preview1034±219804.4%1.0%8 tps1.2s1M$0.25$1.50
156135DeepSeek V3.2 Speciale1030±102.3K6.1%6.0%43 tps1.4s131K$0.84$1.52
157135Gemini 2.0 Flash Lite1029±514.7K9.5%<0.1%42 tps0.5s1M$0.08$0.30
158174Claude Haiku 3.51028±66.4K4.9%0.8%40 tps2.8s200K$0.80$4.00
159135Amazon Nova 2 Lite1026±103.6K6.0%1.0%137 tps0.6s300K$0.35$2.95
160144DeepSeek V3.1 Nex N11021±195655.0%3.4%24 tps7.2s131K$0.14$0.50
161144OpenAI o31020±75.9K4.0%0.9%85 tps6.8s128K$7.33$29.33
162144Gemini 2.0 Flash1018±78.2K3.8%<0.1%76 tps0.5s1M$0.14$0.56
163189GLM 4.5 Air1016±67.1K6.9%<0.1%22 tps1.4s131K$0.10$0.38
164148Qwen3 VL 235B A22B Thinking1009±64.6K8.3%4.3%47 tps3.0s127K$0.47$3.31
165148Qwen3 Coder Plus1007±226104.7%5.1%56 tps2.3s128K$1.80$9.80
166195GPT-5 Mini High1002±93K7.7%<0.1%33 tps3.9s400K$0.25$2.00
167148Qwen 2.5 VL 32B Instruct1001±218654.9%6.3%43 tps3.2s128K$0.35$0.62
168148Qwen3 235B A22B Thinking 25071000±102.8K4.4%2.5%53 tps1.6s131K$0.59$5.70
169148OpenAI o3-mini-high999±58.3K4.1%2.4%231 tps10.5s200K$1.10$4.40
170148OpenAI o3-mini999±415K5.5%0.8%143 tps3.3s200K$1.10$4.40
171148OpenAI o4-mini-high995±713.6K6.2%1.9%117 tps15.9s200K$1.10$4.40
172148Seed 1.6 250615995±211.6K6.0%3.1%46 tps2.2s256K$0.25$2.00
173195Arcee AI Virtuoso-Large994±83K5.7%<0.1%64 tps0.5s131K$0.75$1.20
174195Claude Haiku 3992±112.8K3.0%0.4%62 tps0.5s200K$0.25$1.25
175159GPT-5 Nano989±64.6K8.0%3.2%113 tps20.9s400K$0.05$0.40
176159OpenAI o3-mini-low988±612.2K6.4%0.7%139 tps1.5s200K$1.10$4.40
177159Grok Code Fast 1987±92.5K6.0%5.9%294 tps0.5s256K$0.20$1.50
178159GLM 4.6V986±83K5.5%6.4%21 tps1.8s128K$0.38$0.90
179195Cypher Alpha985±207358.7%<0.1%4 tpsN/A1M$0$0
180195GLM 4.6 FP8982±171.2K11.7%<0.1%56 tps1.8s200K$0.40$1.75
181159Kimi K2 0711981±67K4.5%1.6%29 tps1.3s131K$0.72$2.60
182159Seed 2.0 Mini (Medium)981±355705.8%11.9%33 tps1.7s256K$0.15$0.60
183167DeepSeek V3.1 Thinking976±95.2K9.5%7.1%18 tps1.8s131K$0.23$0.75
184211Grok 4 (Low Reasoning)975±215202.8%<0.1%18 tps9.5s256K$0$0
185167Nemotron 3 Nano974±465806.5%1.3%216 tps0.8s256K$0.05$4.94
186211Arcee AI Coder-Large972±159854.4%<0.1%60 tps1.6s33K$0.50$0.80
187219Arcee Coder Large971±73.6K2.6%<0.1%54 tps1.3s33K$0.50$0.80
188167Mistral Small 3.2 24B970±134.6K4.9%2.8%141 tps0.7s33K$0.02$0.08
189167Llama 4 Scout965±517.5K5.3%0.6%88 tps5.1s131K$0.18$0.46
190167Devstral Medium962±113.5K5.2%1.5%77 tps0.6s131K$0.40$2.00
191167Qwen 2.5 72B960±151.4K4.5%1.2%96 tps1.2s131K$0.14$0.26
192179Qwen3 30B A3B Thinking 2507953±103.5K4.7%0.5%124 tps1.2s131K$0.16$1.70
193230Magistral Medium952±161.3K10.8%<0.1%95 tps0.5s41K$2.00$5.00
194179Switchpoint Router949±102.7K3.6%1.7%71 tps4.9s131K$0.85$3.40
195179Qwen3 8B948±94.2K8.2%2.4%61 tps1.4s41K$0.02$0.07
196179Ministral 14B 3.0948±168058.5%2.0%119 tps0.5s128K$0.20$0.20
197179Grok 3 Mini Fast943±79K7.0%1.6%44 tps0.5s131K$0.60$4.00
198179ERNIE 4.5 21B A3B943±285406.9%2.3%78 tps1.5s120K$0.05$0.19
199179ERNIE 4.5 VL 424B A47B942±187256.5%4.9%36 tps3.5s123K$0.42$1.25
200179NVIDIA Llama 3.3 Nemotron Super 49B v1.5941±191.4K6.9%2.0%50 tps0.6s131K$0.09$0.33
201230Jamba 1.7 Mini936±241K8.4%<0.1%84 tps0.9s256K$0.20$0.40
202230Dobby Unhinged Llama 3.3 70B935±198602.8%<0.1%41 tps0.4s128K$0.90$0.90
203230R1 1776935±93.3K4.2%<0.1%61 tps1.0s128K$2.00$8.00
204189Llama 3.3 70B933±103K6.6%0.3%500 tps0.5s8K$0.48$0.66
205245Llama 3.1 70B Instruct Turbo933±114.1K3.8%<0.1%110 tps0.8s128K$0.88$0.88
206189Grok 3 Mini927±69.9K6.4%1.2%43 tps0.5s131K$0.30$0.50
207189Jamba 1.6 Large924±93.3K3.8%2.0%59 tps1.2s256K$1.33$5.33
208245Grok 3 Mini Beta922±141.8K1.9%<0.1%75 tps0.5s131K$0.45$2.25
209245GLM Z1 32B921±101.9K10.1%<0.1%18 tps9.3s33K$0.09$0.11
210245GPT-5 Nano Minimal920±131.4K10.8%<0.1%88 tps0.8s400K$0.05$0.40
211245Solar Pro 2 250710 (Reasoning)919±102.6K3.9%<0.1%9 tpsN/A66K$0.50$0.50
212189Llama 3.3 Swallow 70B Instruct919±83.5K5.5%1.4%153 tps1.3s131K$0.13$0.39
213189Jamba 1.7 Large917±169758.0%1.3%58 tps1.0s256K$1.33$5.33
214189Devstral Small916±171.4K5.0%2.4%180 tps0.6s131K$0.10$0.30
215189Rnj-1 Instruct915±219706.7%0.6%103 tps0.3s33K$0.15$0.15
216189Seed 1.6 Flash 250715913±161.2K5.7%2.5%108 tps1.6s256K$0.07$0.30
217245Solar Pro 3 (Reasoning)913±185954.8%3.2%118 tps1.2s131K$0.15$0.60
218189Inception Mercury Coder Small Beta912±206103.2%1.7%270 tps1.4s32K$0.25$1.00
219264YouTube910±132K5.5%<0.1%34 tps2.7s32K$0.99$0.99
220201Magistral Small 2506908±114.3K3.1%1.6%156 tps0.5s40K$0.37$1.10
221201GPT-3.5 Turbo908±151.2K2.5%1.3%74 tps0.9s16K$0.75$1.75
222264Fauna Fox908±103.4K8.2%<0.1%194 tps0.3s128K$0.04$0.15
223201Llama 3 8B904±103.2K3.5%6.0%85 tps0.7s8K$0.12$0.16
224201Mistral Small 3.2 24B Instruct903±208508.6%1.9%113 tps1.1s131K$0.02$0.08
225264DeepSeek R1T Chimera901±92.9K6.7%<0.1%46 tps1.1s164K$0.09$0.36
226264Grok 2901±72.3K2.7%<0.1%55 tps1.1s131K$2.00$10.00
227201GPT-4o mini900±142.8K5.3%2.1%71 tps1.7s128K$0.15$0.60
228201Moonshot V1 Auto899±229304.1%1.2%54 tps1.5s8K$2.00$5.00
229201Amazon Nova Pro 1.0894±105.7K4.0%0.9%96 tps0.7s300K$0.80$1.70
230264Exaone 3.5 32B Instruct893±216503.0%<0.1%17 tpsN/A33K$0$0
231280Venice Uncensored891±296156.1%<0.1%59 tps3.9s33K$0$0
232280Gemini 1.5 Pro887±82.3K2.3%<0.1%15 tps0.0s2M$0.78$3.13
233210Magistral Medium 2509883±162.6K9.5%4.0%58 tps0.9s131K$2.00$5.00
234280AFM 4.5B Preview882±162.5K3.1%<0.1%32 tps0.0s66K$0$0
235210Gemma 3n E4B882±76K4.5%2.0%30 tps0.5s8K$0.01$0.02
236280Refuel LLM 2 Small881±74.2K3.9%<0.1%116 tps0.5s8K$0.20$0.20
237210Qwen3 4B880±85.1K9.6%1.9%94 tps1.5s128K$0.01$0.01
238280Magistral Small 2507880±1973013.1%<0.1%148 tps0.4s41K$0.50$1.50
239210Moonshot V1 128k879±191.1K4.6%1.4%54 tps1.5s131K$2.00$5.00
240210Inception Mercury878±56.9K3.7%0.4%257 tps1.1s32K$0.25$1.00
241210DeepSeek R1T2 Chimera876±102.1K5.9%3.0%28 tps1.8s164K$0.13$0.45
242210Mistral Medium 3875±234856.7%2.4%47 tps0.8s33K$0.40$2.00
243210Solar Mini 250422874±171.3K5.9%1.8%90 tps1.7s33K$0.15$0.15
244210GLM 4.7 Flash871±286104.7%5.8%61 tps2.8s128K$0.07$0.39
245210Mixtral 8x22B871±221.3K5.0%1.2%140 tps0.6s64K$2.00$6.00
246293Devstral Small 2505871±151.7K6.2%<0.1%141 tps1.3s33K$0.03$0.09
247293AFM 4.5B869±74.4K8.9%<0.1%81 tps0.3s66K$0.05$0.20
248293Claude Sonnet 3869±179001.6%<0.1%35 tps1.0s200K$3.00$15.00
249210Krutrim Spectre V2868±161.3K3.6%<0.1%33 tps3.1s4K$0.19$0.19
250210GLM 4 32B868±122.9K4.9%2.6%40 tps1.6s33K$0.14$0.14
251210Gemma 3 12B867±112.5K4.9%4.2%73 tps0.8s131K$0.05$0.12
252210Moonshot V1 8k863±139155.2%1.0%55 tps1.5s8K$0.20$2.00
253210Qwen 2.5 14B Instruct861±162.4K5.7%2.4%40 tps1.6s1M$0.40$1.61
254312DeepSeek-R1 0528 Qwen3 8B856±84.9K6.5%<0.1%45 tps2.4s128K$0.05$0.09
255210Ministral 3B 2512854±575158.0%2.8%339 tps0.6s131K$0.10$0.10
256234Jamba 1.5 Large851±92.9K4.0%1.7%48 tps0.9s256K$1.50$6.00
257312Wikipedia846±79.8K4.9%<0.1%47 tps2.1s32K$0$0
258324MAI-DS-R1842±73.5K11.7%<0.1%73 tps3.2s64K$0.10$0.40
259234GPT-3.5 Turbo 16k838±102.7K3.6%<0.1%22 tps0.6s16K$3.00$4.00
260324Cogito V2 671B838±171.6K5.9%<0.1%41 tps0.6s164K$1.25$1.25
261234ERNIE 4.5 21B A3B Thinking838±231.1K6.9%1.8%87 tps1.5s120K$0.07$0.28
262240GLM 4.5 Flash834±375208.8%12.2%15 tps2.2s131K$0$0
263324NVIDIA Llama 3.1 Nemotron Ultra 253B v1832±162.2K4.1%<0.1%40 tps0.8s128K$0.30$0.90
264240LFM2 2.6B826±2681010.0%6.7%184 tps0.4s33K$0.01$0.02
265240Krutrim 2825±102.3K2.3%12.5%33 tps2.1s128K$1.00$1.00
266240Moonshot V1 32k820±179503.1%1.4%53 tps1.4s33K$1.00$3.00
267240LFM2 8B A1B818±1882511.3%<0.1%142 tps0.3s33K$0.01$0.02
268337GLM 4.1V 9B Thinking813±161.1K4.2%<0.1%69 tps1.3s66K$0.04$0.14
269337Qwen 2 72B Instruct805±191K3.3%<0.1%3 tpsN/A33K$0.90$0.90
270346Magistral Medium (Thinking)804±102.2K5.7%<0.1%67 tps0.8s41K$2.00$5.00
271252Magistral Small 2509802±181.8K7.5%2.7%116 tps0.6s131K$0.50$1.50
272252WizardLM-2 8x22B801±121.9K3.1%11.6%11 tps2.5s66K$0.77$0.77
273252Hermes 4 405B FP8797±218158.4%3.5%31 tps0.9s131K$0.52$1.73
274346Magistral Medium 2507795±2566514.2%<0.1%86 tps0.7s41K$2.00$5.00
275252Mercury Coder793±275103.8%<0.1%247 tps2.2s32K$0.25$1.00
276252GPT-3.5 Turbo Instruct787±92K2.7%<0.1%46 tps1.2s4K$1.50$2.00
277252Mistral Large785±161.1K5.8%1.5%54 tps0.7s33K$2.00$6.00
278252Hermes 4 70B781±294608.9%1.1%67 tps0.6s131K$0.12$0.39
279262Baichuan-M2-32B770±3074010.8%<0.1%32 tps3.3s131K$0.07$0.07
280354OLMo 3 7B Think763±217707.8%4.2%77 tps0.4s66K$0.12$0.20
281262Open Mistral 7B762±181.3K4.7%0.7%176 tps0.4s33K$0.25$0.25
282361DeepSeek-R1 Distill Qwen 14B756±161.9K6.3%<0.1%44 tps1.7s64K$0.63$0.63
283361Seed Coder 8B Instruct751±226052.4%<0.1%35 tpsN/A32K$0.99$0.99
284262Qwen 2.5 VL 72B Instruct746±202.1K6.0%5.3%25 tps3.7s128K$1.01$2.79
285269Inflection 3 Productivity737±241.5K5.0%0.6%50 tps3.2s8K$2.50$10.00
286361Meridian734±399659.8%<0.1%92 tps1.2s131K$0$0
287361Zenith730±288509.6%<0.1%36 tps1.8s131K$0$0
288374Mistral Nemo 12B Celeste V1.9725±181.1K3.5%<0.1%6 tps10.2s8K$0.80$1.20
289374Solar Pro 250422720±195306.2%<0.1%13 tps0.6s33K$0$0
290269Inflection 3 Pi719±181.5K4.1%1.1%33 tps3.4s8K$2.50$10.00
291269DeepHermes 3 Mistral 24B Preview706±307155.9%2.5%50 tps1.0s33K$0.06$0.25
292374Dolphin 3.0 R1 Mistral 24B701±168907.8%<0.1%13 tps0.1s33K$0.03$0.09
293276DeepSeek-R1 Distill Qwen 32B696±202K5.5%6.2%22 tps1.8s131K$0.37$0.39
294383ArliAI QwQ 32B Arliai RpR V1686±406359.3%<0.1%34 tps1.8s33K$0.02$0.07
295276MiniMax M1686±133.8K5.3%<0.1%31 tps2.8s1M$0.55$2.20
296386MiMo 7B RL655±131.2K3.5%<0.1%31 tps0.4s32K$0.49$0.49
297386Dolphin 2.9.2 Mixtral 8x22B652±191.1K2.6%<0.1%20 tps1.5s16K$0.90$0.90
298386Shisa V2 Llama 3.3 70B623±245859.3%<0.1%8 tps2.0s33K$0.03$0.09
299279UI-TARS 1.5 7B610±4053011.7%4.0%75 tps0.9s128K$0.10$0.20
300390ERNIE 4.5 0.3B602±4068511.0%<0.1%85 tps2.2s120K$0$0
301390DeepSeek-R1 Distill Llama 8B595±191.2K5.6%<0.1%17 tpsN/A32K$0.04$0.04
302279Hunyuan A13B Instruct588±221.6K9.2%2.3%67 tps2.0s33K$0.01$0.01
303399Mistral Nemo 12B Inferor v0.0454±285651.7%<0.1%83 tps0.8s16K$0.80$1.20
304402QwQ 32B RpR v1406±351K10.9%<0.1%34 tps3.3s33K$0.02$0.07
305404Seed Coder 8B Reasoning329±417004.1%<0.1%25 tpsN/A32K$0.99$0.99
Show Less