Models
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1596
Claude Opus 4.6
1594
Claude Sonnet 4.6
1594
GPT-5.4
1566
Claude Opus 4.6 (Thinking)
1506
Claude Sonnet 4.6 (Thinking)
1446
Claude Opus 4.5 (Thinking)
1418
Gemini 3.1 Pro
1409
Claude Opus 4.5
1381
GPT-5.3 Codex (High)
1362
Claude Sonnet 4.5 (Thinking)
1358
GPT-5.2 Instant
1353
Claude Haiku 4.5 (Extended Thinking)
1340
GPT-5.2
1337
Gemini 3 Pro
1324
GLM 5

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
11Claude Opus 4.61596±621.6K1.1%2.1%48 tps1.7s200K$5.00$25.00
21Claude Sonnet 4.61594±1015.7K1.4%1.6%47 tps1.2s200K$3.00$15.00
31GPT-5.41594±144.4K1.6%2.6%55 tps0.8s1M$2.50$15.00
44Claude Opus 4.6 (Thinking)1566±816.5K1.6%2.5%56 tps1.6s200K$5.00$25.00
55Claude Sonnet 4.6 (Thinking)1506±816.2K3.5%4.7%57 tps1.1s200K$3.00$15.00
66Claude Opus 4.5 (Thinking)1446±460.8K1.9%1.8%49 tps1.4s200K$5.00$25.00
77Gemini 3.1 Pro1418±922K2.5%3.5%35 tps4.1s1M$2.00$12.00
87Claude Opus 4.51409±515.1K2.2%1.5%45 tps1.5s200K$5.00$25.00
99GPT-5.3 Codex (High)1381±93.2K1.2%2.0%61 tps17.8s400K$1.75$14.00
1010Claude Sonnet 4.5 (Thinking)1362±458.2K3.3%1.9%44 tps1.1s200K$3.00$15.00
1110GPT-5.2 Instant1358±615.7K3.3%1.7%52 tps2.0s400K$1.75$14.00
1212Claude Haiku 4.5 (Extended Thinking)1353±414.3K3.8%1.4%115 tps0.7s200K$1.00$5.00
1313GPT-5.21340±811.3K3.2%4.1%18 tps2.7s400K$1.75$14.00
1413Gemini 3 Pro1337±559.4K2.6%2.1%50 tps3.6s1M$2.00$12.00
1515GLM 51324±1411.7K3.3%3.4%36 tps2.7s200K$0.72$2.55
1615GPT-5.11319±712.9K3.4%2.3%71 tps1.4s400K$1.42$11.33
1717Claude Sonnet 4.51307±320.9K5.0%1.4%41 tps1.3s200K$1.80$9.00
1817GPT-5.2 (High)1297±830.7K2.8%6.7%18 tps16.3s400K$1.75$14.00
1919Gemini 3 Pro (Low)1291±611.9K4.2%2.4%51 tps3.5s1M$2.00$12.00
2019GPT-5.1 (High)1290±619.1K3.5%3.2%76 tps6.9s400K$1.25$10.00
2119Gemini 3 Flash Preview Thinking1286±632.7K3.3%1.6%3 tps6.2s1M$0.50$3.00
2219Claude Haiku 4.51283±316.4K4.5%1.1%100 tps0.9s200K$1.00$5.00
2319MiniMax M2.51283±285103.8%1.4%70 tps1.9s205K$0.28$1.20
2419GPT-5.3 Codex (Medium)1278±271.1K2.3%2.3%62 tps10.3s400K$1.75$14.00
2519GPT-5.3 Instant1271±124.2K2.5%0.9%63 tps0.8s400K$1.75$14.00
2627Claude Sonnet 4 (Thinking)1261±325.9K2.9%1.5%52 tps1.5s200K$3.00$13.67
2727GPT-5 Codex (High)1260±718.5K3.3%3.2%122 tps7.1s400K$1.25$10.00
2827GPT-5 (High)1259±416.2K3.5%4.5%81 tps35.9s400K$1.25$10.00
2927GPT-5.2 Codex (High)1257±123.1K2.8%8.8%41 tps12.9s400K$1.75$14.00
3031GPT-5.1 Codex (High)1240±837K3.3%3.2%96 tps3.9s400K$1.25$10.00
3131Grok 4.1 Fast Non-Reasoning1239±69.4K5.4%0.9%101 tps0.5s2M$0.20$0.50
3231GPT-5 Chat1231±435K4.5%1.3%95 tps0.9s400K$1.25$10.00
3336GPT-5.2 (Extra High) 1221±98K3.5%13.2%17 tps20.5s400K$1.75$14.00
3436Qwen3 VL 235B A22B Instruct1220±75.6K6.7%3.1%75 tps1.9s129K$0.37$1.81
3536GPT-5 Codex (Medium)1214±68.8K3.9%4.1%122 tps5.2s400K$1.25$10.00
3636GPT-5.2 Codex (Medium)1211±122.4K3.0%5.7%37 tps6.3s400K$1.75$14.00
3743Claude Sonnet 41205±343.2K3.7%1.8%49 tps1.3s200K$3.00$15.00
3843Gemini 3 Flash Preview1205±117.2K3.7%1.3%138 tps1.4s1M$0.50$3.00
3943Gemini 2.5 Pro High1204±321.1K5.7%1.5%48 tps2.3s1M$1.25$10.00
4043Qwen3 Max Instruct Preview1203±616.1K4.6%1.1%31 tps1.7s256K$1.43$6.61
4143GPT-5.1 Codex Max1200±126.4K3.9%3.0%118 tps4.1s400K$1.25$10.00
4243MiniMax M2.1 Lightning1197±238753.3%1.7%52 tps2.1s205K$0.30$2.40
4349Qwen3 30B A3B Instruct 25071194±512.7K5.7%1.2%55 tps1.3s131K$0.13$0.72
4449MiniMax M2.11192±819.4K3.6%2.1%66 tps2.6s205K$0.30$1.20
4549DeepSeek V3.21189±85.1K4.7%1.4%83 tps5.1s131K$0.43$1.09
4649MiniMax M2.5 FP81185±176103.2%3.6%33 tps1.7s205K$0.45$1.75
4749GPT-51185±421.3K5.3%3.1%78 tps23.1s400K$1.25$9.67
4849Grok 4 Fast Non-Reasoning1185±58.1K7.1%1.5%93 tps0.6s2M$0.27$0.67
4949MiniMax M21183±519.7K4.2%2.2%39 tps2.3s205K$0.21$0.85
5049Nova Experimental Chat 12-101182±92.9K3.8%2.4%84 tps12.9s98K$0$0
5149GLM 4.61182±717.2K4.4%5.4%39 tps1.5s200K$0.42$1.66
5249GPT-5.3 Codex (Low)1178±285101.0%1.8%61 tps4.3s400K$1.75$14.00
5360Grok 4.1 Fast Reasoning1178±739.5K4.4%1.5%58 tps7.3s2M$0.20$0.50
5460Grok 4 Fast Reasoning1177±314.5K5.0%2.1%102 tps3.1s2M$0.30$0.75
5560Gemini 2.5 Pro1176±337.9K4.8%2.3%45 tps2.6s1M$1.25$10.00
5660Qwen3 235B A22B Instruct 25071172±412.6K6.4%6.8%13 tps1.9s262K$0.13$0.52
5760Claude Sonnet 3.5 v21171±65.5K3.4%<0.1%46 tps1.4s200K$3.00$15.00
5860GPT-5.1 Codex (Medium)1171±143K3.2%4.6%71 tps3.7s400K$1.25$10.00
5960GPT-5.1 Instant1171±88.3K4.1%1.3%50 tps1.9s400K$1.25$10.00
6060Grok 4.20 Beta Reasoning1167±221.2K4.1%1.1%77 tps4.5s2M$2.00$5.50
6169Qwen3.5 35B A3B1164±258653.9%2.1%116 tps2.1s256K$0.63$1.13
6269GPT-5 Codex (Low)1163±105K4.1%2.7%112 tps3.5s400K$1.25$10.00
6369GLM 4.71161±716.8K3.7%5.8%40 tps1.5s200K$0.77$1.73
6469DeepSeek V3.1 Terminus Chat1158±56.5K6.9%3.4%27 tps1.5s131K$0.86$1.80
6574Qwen Plus (Aug'24)1146±517.2K4.7%1.4%53 tps1.3s30K$0.40$1.20
6674Qwen3.5 397B A17B1142±142.5K2.9%4.3%57 tps1.4s256K$0.52$3.00
6774Gemini 2.5 Flash Preview 09251140±67.6K6.0%1.2%5 tps0.9s1M$0.13$0.97
6877GPT-5 Mini1131±58.6K5.4%2.6%66 tps14.2s400K$0.25$2.00
6977DeepSeek V3.1 Turbo1130±74.8K5.3%0.9%173 tps1.3s164K$2.00$3.75
7077Grok 4.20 Multi Agent Beta1129±199453.6%1.2%56 tps8.8s2M$2.00$6.00
7177Qwen3 Max Thinking Preview1127±106.3K5.7%3.1%40 tps2.1s256K$1.20$6.00
7277Grok 41125±339.6K4.4%3.9%29 tps11.1s256K$3.00$15.00
7377GPT-4.11123±532.8K5.2%3.7%112 tps1.3s1M$2.00$8.00
7477Gemini 2.5 Flash Lite Preview 09251122±78.5K6.6%1.2%209 tps0.7s1M$0.25$0.35
7585Gemini 2.5 Flash Thinking1118±413.7K3.6%2.2%88 tps6.4s1M$0.30$2.50
7685GPT-5 Mini Minimal1114±123.2K8.5%1.2%63 tps1.4s400K$0.25$2.00
7785GPT-5.2 Codex (Low)1113±191.2K3.2%4.5%41 tps5.0s400K$1.75$14.00
7885DeepSeek V3.1 Chat1110±74.9K6.6%2.8%21 tps1.6s131K$0.38$1.00
7985Qwen3 Omni 30B A3B Thinking1110±102.3K6.0%3.7%67 tps1.2s66K$0.97$1.79
8090Qwen Max1107±418.3K4.2%1.5%49 tps1.5s33K$1.60$6.40
8190Gemini 2.5 Flash Lite1103±521.3K6.2%1.3%210 tps0.7s1M$0.10$0.40
8290Grok 3 Fast1102±142.5K4.7%1.7%52 tps2.4s131K$5.00$25.00
8390GPT-4o1102±58.5K3.7%1.0%49 tps2.4s128K$3.71$12.57
8490DeepSeek V3 03241100±415.1K4.3%5.8%12 tps2.7s164K$0.38$0.93
8590Qwen3 Coder 480B A35B Instruct1099±83.1K4.5%3.3%61 tps2.0s262K$0.71$1.34
8698Gemini 2.5 Flash1098±435.9K3.2%1.3%2 tps3.7s1M$0.30$2.50
8798Grok 31098±419.1K5.5%1.5%53 tps0.6s1M$3.67$18.33
8898DeepSeek V3 0324 Turbo1093±515.5K5.7%6.3%12 tps2.4s164K$0.73$1.79
8998OpenAI o3-pro1090±85.4K4.3%5.2%22 tps70.8s200K$20.00$80.00
9098DeepSeek V3.11089±122.3K4.7%0.8%197 tps0.4s164K$0.55$1.60
91105GPT-4.1 mini1087±519.7K4.2%1.1%67 tps0.9s1M$0.34$1.60
92105GPT-4.1 nano1085±517K5.0%0.6%175 tps0.5s1M$0.10$0.40
93105Qwen3 Omni 30B A3B Instruct1085±137754.3%3.9%65 tps1.2s66K$0.35$0.97
94105DeepSeek V3 (Turbo)1082±201.5K5.1%1.5%32 tps1.5s64K$0.40$1.30
95105Seed 1.8 2512281081±103.2K3.1%3.7%41 tps2.1s256K$0.25$2.00
96105Mistral Medium1080±49.6K5.6%1.8%48 tps0.6s33K$1.48$4.55
97105Qwen3 Max Thinking1080±181.5K2.0%13.5%32 tps2.3s256K$1.20$6.00
98112GLM 4.51075±56K7.0%3.7%46 tps1.4s131K$0.43$1.63
99112Kimi K2 09051074±78.7K4.3%4.0%30 tps1.4s262K$0.63$2.39
100112GPT-5 (Low)1070±146903.5%1.8%75 tps8.2s400K$1.25$10.00
101112Kimi K2 0905 Turbo1070±67.5K9.1%0.7%373 tps0.5s262K$1.70$6.50
102112Grok 4.20 Beta Non-reasoning1063±365004.8%1.1%151 tps0.6s2M$2.00$6.00
103119OpenAI o11062±69.9K3.3%4.2%92 tps5.5s200K$15.00$60.00
104119OpenAI o1-pro1061±206807.5%5.2%33 tps72.8s200K$150.00$600.00
105119Gemini 2.5 Flash Lite Thinking1061±49.8K6.2%1.0%118 tps4.4s1M$0.03$0.13
106119Seed 2.0 Lite (Medium)1058±205253.7%6.6%33 tps1.6s256K$0.25$2.00
107119LongCat Flash Chat1058±122.7K5.9%0.8%85 tps0.9s131K$0.14$0.68
108119GPT-5.1 Codex Mini (Medium)1057±151.9K4.9%4.6%69 tps4.1s400K$0.25$2.00
109119GPT-5.1 Codex Mini (High)1054±152.2K3.9%5.9%70 tps4.6s400K$0.25$2.00
110128ERNIE 4.5 300B A47B1049±413.5K3.9%4.7%23 tps2.3s123K$0.28$1.10
111128Cogito v2.1 671B1044±191.2K4.6%0.8%85 tps0.5s128K$1.25$1.25
112128Qwen3 32B1044±198506.6%3.9%30 tps3.1s41K$0.12$0.42
113128GLM 4.5 AirX1042±151.1K6.9%3.3%75 tps1.2s131K$1.10$4.50
114128Kimi K2 Thinking1042±103.3K5.1%4.2%61 tps5.9s262K$0.24$1.03
115128OpenAI o4-mini1042±58.5K6.4%1.4%97 tps7.0s128K$1.10$4.40
116128Gemini 3.1 Flash Lite Preview Thinking1039±161.4K4.2%1.7%75 tps4.7s1M$0.25$1.50
117135Qwen3 Next 80B A3B Thinking1035±56.2K7.4%0.6%175 tps1.3s256K$0.21$2.26
118135Gemini 2.5 Flash Lite Thinking Preview 09251035±75.8K6.8%1.5%152 tps3.0s1M$0.10$0.40
119135Gemini 3.1 Flash Lite Preview1034±219804.4%1.0%8 tps1.2s1M$0.25$1.50
120135DeepSeek V3.2 Speciale1030±102.3K6.1%6.0%43 tps1.4s131K$0.84$1.52
121135Gemini 2.0 Flash Lite1029±514.7K9.5%<0.1%42 tps0.5s1M$0.08$0.30
122135Amazon Nova 2 Lite1026±103.6K6.0%1.0%137 tps0.6s300K$0.35$2.95
123144DeepSeek V3.1 Nex N11021±195655.0%3.4%24 tps7.2s131K$0.14$0.50
124144OpenAI o31020±75.9K4.0%0.9%85 tps6.8s128K$7.33$29.33
125144Gemini 2.0 Flash1018±78.2K3.8%<0.1%76 tps0.5s1M$0.14$0.56
126148Qwen3 VL 235B A22B Thinking1009±64.6K8.3%4.3%47 tps3.0s127K$0.47$3.31
127148Qwen3 Coder Plus1007±226104.7%5.1%56 tps2.3s128K$1.80$9.80
128148Qwen 2.5 VL 32B Instruct1001±218654.9%6.3%43 tps3.2s128K$0.35$0.62
129148Qwen3 235B A22B Thinking 25071000±102.8K4.4%2.5%53 tps1.6s131K$0.59$5.70
130148OpenAI o3-mini-high999±58.3K4.1%2.4%231 tps10.5s200K$1.10$4.40
131148OpenAI o3-mini999±415K5.5%0.8%143 tps3.3s200K$1.10$4.40
132148OpenAI o4-mini-high995±713.6K6.2%1.9%117 tps15.9s200K$1.10$4.40
133148Seed 1.6 250615995±211.6K6.0%3.1%46 tps2.2s256K$0.25$2.00
134159GPT-5 Nano989±64.6K8.0%3.2%113 tps20.9s400K$0.05$0.40
135159OpenAI o3-mini-low988±612.2K6.4%0.7%139 tps1.5s200K$1.10$4.40
136159Grok Code Fast 1987±92.5K6.0%5.9%294 tps0.5s256K$0.20$1.50
137159GLM 4.6V986±83K5.5%6.4%21 tps1.8s128K$0.38$0.90
138159Kimi K2 0711981±67K4.5%1.6%29 tps1.3s131K$0.72$2.60
139159Seed 2.0 Mini (Medium)981±355705.8%11.9%33 tps1.7s256K$0.15$0.60
140167DeepSeek V3.1 Thinking976±95.2K9.5%7.1%18 tps1.8s131K$0.23$0.75
141167Nemotron 3 Nano974±465806.5%1.3%216 tps0.8s256K$0.05$4.94
142167Mistral Small 3.2 24B970±134.6K4.9%2.8%141 tps0.7s33K$0.02$0.08
143167Llama 4 Scout965±517.5K5.3%0.6%88 tps5.1s131K$0.18$0.46
144167Devstral Medium962±113.5K5.2%1.5%77 tps0.6s131K$0.40$2.00
145167Qwen 2.5 72B960±151.4K4.5%1.2%96 tps1.2s131K$0.14$0.26
146179Qwen3 30B A3B Thinking 2507953±103.5K4.7%0.5%124 tps1.2s131K$0.16$1.70
147179Switchpoint Router949±102.7K3.6%1.7%71 tps4.9s131K$0.85$3.40
148179Qwen3 8B948±94.2K8.2%2.4%61 tps1.4s41K$0.02$0.07
149179Ministral 14B 3.0948±168058.5%2.0%119 tps0.5s128K$0.20$0.20
150179Grok 3 Mini Fast943±79K7.0%1.6%44 tps0.5s131K$0.60$4.00
151179ERNIE 4.5 21B A3B943±285406.9%2.3%78 tps1.5s120K$0.05$0.19
152179ERNIE 4.5 VL 424B A47B942±187256.5%4.9%36 tps3.5s123K$0.42$1.25
153179NVIDIA Llama 3.3 Nemotron Super 49B v1.5941±191.4K6.9%2.0%50 tps0.6s131K$0.09$0.33
154189Llama 3.3 70B933±103K6.6%0.3%500 tps0.5s8K$0.48$0.66
155189Grok 3 Mini927±69.9K6.4%1.2%43 tps0.5s131K$0.30$0.50
156189Jamba 1.6 Large924±93.3K3.8%2.0%59 tps1.2s256K$1.33$5.33
157189Llama 3.3 Swallow 70B Instruct919±83.5K5.5%1.4%153 tps1.3s131K$0.13$0.39
158189Jamba 1.7 Large917±169758.0%1.3%58 tps1.0s256K$1.33$5.33
159189Devstral Small916±171.4K5.0%2.4%180 tps0.6s131K$0.10$0.30
160189Rnj-1 Instruct915±219706.7%0.6%103 tps0.3s33K$0.15$0.15
161189Seed 1.6 Flash 250715913±161.2K5.7%2.5%108 tps1.6s256K$0.07$0.30
162189Inception Mercury Coder Small Beta912±206103.2%1.7%270 tps1.4s32K$0.25$1.00
163201Magistral Small 2506908±114.3K3.1%1.6%156 tps0.5s40K$0.37$1.10
164201GPT-3.5 Turbo908±151.2K2.5%1.3%74 tps0.9s16K$0.75$1.75
165201Llama 3 8B904±103.2K3.5%6.0%85 tps0.7s8K$0.12$0.16
166201Mistral Small 3.2 24B Instruct903±208508.6%1.9%113 tps1.1s131K$0.02$0.08
167201GPT-4o mini900±142.8K5.3%2.1%71 tps1.7s128K$0.15$0.60
168201Moonshot V1 Auto899±229304.1%1.2%54 tps1.5s8K$2.00$5.00
169201Amazon Nova Pro 1.0894±105.7K4.0%0.9%96 tps0.7s300K$0.80$1.70
170210Magistral Medium 2509883±162.6K9.5%4.0%58 tps0.9s131K$2.00$5.00
171210Gemma 3n E4B882±76K4.5%2.0%30 tps0.5s8K$0.01$0.02
172210Qwen3 4B880±85.1K9.6%1.9%94 tps1.5s128K$0.01$0.01
173210Moonshot V1 128k879±191.1K4.6%1.4%54 tps1.5s131K$2.00$5.00
174210Inception Mercury878±56.9K3.7%0.4%257 tps1.1s32K$0.25$1.00
175210DeepSeek R1T2 Chimera876±102.1K5.9%3.0%28 tps1.8s164K$0.13$0.45
176210Mistral Medium 3875±234856.7%2.4%47 tps0.8s33K$0.40$2.00
177210Solar Mini 250422874±171.3K5.9%1.8%90 tps1.7s33K$0.15$0.15
178210GLM 4.7 Flash871±286104.7%5.8%61 tps2.8s128K$0.07$0.39
179210Mixtral 8x22B871±221.3K5.0%1.2%140 tps0.6s64K$2.00$6.00
180210Krutrim Spectre V2868±161.3K3.6%<0.1%33 tps3.1s4K$0.19$0.19
181210GLM 4 32B868±122.9K4.9%2.6%40 tps1.6s33K$0.14$0.14
182210Gemma 3 12B867±112.5K4.9%4.2%73 tps0.8s131K$0.05$0.12
183210Moonshot V1 8k863±139155.2%1.0%55 tps1.5s8K$0.20$2.00
184210Qwen 2.5 14B Instruct861±162.4K5.7%2.4%40 tps1.6s1M$0.40$1.61
185210Ministral 3B 2512854±575158.0%2.8%339 tps0.6s131K$0.10$0.10
186234Jamba 1.5 Large851±92.9K4.0%1.7%48 tps0.9s256K$1.50$6.00
187234GPT-3.5 Turbo 16k838±102.7K3.6%<0.1%22 tps0.6s16K$3.00$4.00
188234ERNIE 4.5 21B A3B Thinking838±231.1K6.9%1.8%87 tps1.5s120K$0.07$0.28
189240GLM 4.5 Flash834±375208.8%12.2%15 tps2.2s131K$0$0
190240LFM2 2.6B826±2681010.0%6.7%184 tps0.4s33K$0.01$0.02
191240Krutrim 2825±102.3K2.3%12.5%33 tps2.1s128K$1.00$1.00
192240Moonshot V1 32k820±179503.1%1.4%53 tps1.4s33K$1.00$3.00
193240LFM2 8B A1B818±1882511.3%<0.1%142 tps0.3s33K$0.01$0.02
194252Magistral Small 2509802±181.8K7.5%2.7%116 tps0.6s131K$0.50$1.50
195252WizardLM-2 8x22B801±121.9K3.1%11.6%11 tps2.5s66K$0.77$0.77
196252Hermes 4 405B FP8797±218158.4%3.5%31 tps0.9s131K$0.52$1.73
197252Mercury Coder793±275103.8%<0.1%247 tps2.2s32K$0.25$1.00
198252GPT-3.5 Turbo Instruct787±92K2.7%<0.1%46 tps1.2s4K$1.50$2.00
199252Mistral Large785±161.1K5.8%1.5%54 tps0.7s33K$2.00$6.00
200252Hermes 4 70B781±294608.9%1.1%67 tps0.6s131K$0.12$0.39
201262Baichuan-M2-32B770±3074010.8%<0.1%32 tps3.3s131K$0.07$0.07
202262Open Mistral 7B762±181.3K4.7%0.7%176 tps0.4s33K$0.25$0.25
203262Qwen 2.5 VL 72B Instruct746±202.1K6.0%5.3%25 tps3.7s128K$1.01$2.79
204269Inflection 3 Productivity737±241.5K5.0%0.6%50 tps3.2s8K$2.50$10.00
205269Inflection 3 Pi719±181.5K4.1%1.1%33 tps3.4s8K$2.50$10.00
206269DeepHermes 3 Mistral 24B Preview706±307155.9%2.5%50 tps1.0s33K$0.06$0.25
207276DeepSeek-R1 Distill Qwen 32B696±202K5.5%6.2%22 tps1.8s131K$0.37$0.39
208276MiniMax M1686±133.8K5.3%<0.1%31 tps2.8s1M$0.55$2.20
209279UI-TARS 1.5 7B610±4053011.7%4.0%75 tps0.9s128K$0.10$0.20
210279Hunyuan A13B Instruct588±221.6K9.2%2.3%67 tps2.0s33K$0.01$0.01
Show Less