Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1398
GPT-5.4 (High)
1364
GPT-5.4
1342
Claude Opus 4.6 (Thinking)
1316
Grok 4.20 Beta Non-reasoning
1289
GPT-5.1 (High)
1288
GPT-5.1
1286
GPT-5.1 (Medium)
1274
Claude Opus 4.6
1270
Gemini 3.1 Pro
1260
GPT-5.2 Instant
1256
Claude Sonnet 4.6 (Thinking)
1249
Nova Experimental Chat 11-10
1249
Mistral Medium 3.1
1222
Gemini 3 Pro
1219
Nova Experimental Chat 10-20

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
14GPT-5.4 (High)1398±102.6K1.2%4.6%68 tps7.9s1M$2.50$15.00
22GPT-5.41364±72.3K1.1%2.6%55 tps0.8s1M$2.50$15.00
31Claude Opus 4.6 (Thinking)1342±55.7K0.9%2.5%56 tps1.6s200K$5.00$25.00
422Grok 4.20 Beta Non-reasoning1316±136303.8%1.1%151 tps0.6s2M$2.00$6.00
58GPT-5.1 (High)1289±223K1.4%3.2%76 tps6.9s400K$1.25$10.00
68GPT-5.11288±219.7K1.3%2.3%71 tps1.4s400K$1.42$11.33
78GPT-5.1 (Medium)1286±36.6K1.7%<0.1%86 tps3.8s400K$0.83$6.67
82Claude Opus 4.61274±47.1K1.4%2.1%48 tps1.7s200K$5.00$25.00
96Gemini 3.1 Pro1270±512.3K1.1%3.5%35 tps4.1s1M$2.00$12.00
1010GPT-5.2 Instant1260±327.1K0.8%1.7%52 tps2.0s400K$1.75$14.00
115Claude Sonnet 4.6 (Thinking)1256±55.8K1.4%4.7%57 tps1.1s200K$3.00$15.00
1216Nova Experimental Chat 11-101249±314.2K1.6%0.4%84 tps8.9s98K$0$0
1319Mistral Medium 3.11249±225.6K1.6%<0.1%77 tps0.7s128K$0.40$2.00
1410Gemini 3 Pro1222±344.5K1.1%2.1%50 tps3.6s1M$2.00$12.00
1537Nova Experimental Chat 10-201219±310.2K2.8%<0.1%30 tps0.5s98K$0$0
1617Grok 4.20 Beta Reasoning1216±72.1K1.7%1.1%77 tps4.5s2M$2.00$5.50
1728Ministral 8B 25121213±71.5K2.6%<0.1%174 tps0.5s128K$0.15$0.15
1833Qwen Plus 07281213±35.2K2.3%<0.1%55 tps0.9s1M$0.40$1.20
194Claude Sonnet 4.61212±56K1.1%1.6%47 tps1.2s200K$3.00$15.00
2029Qwen3 VL 235B A22B Instruct1211±310.2K2.5%3.1%75 tps1.9s129K$0.37$1.81
2122GPT-5 Chat1208±258K1.4%1.3%95 tps0.9s400K$1.25$10.00
2226Grok 4.1 Fast Non-Reasoning1207±320.1K1.7%0.9%101 tps0.5s2M$0.20$0.50
2313GPT-5.3 Instant1206±45.5K1.0%0.9%63 tps0.8s400K$1.75$14.00
2429Nova Experimental Chat 12-101206±39K1.2%2.4%84 tps12.9s98K$0$0
2537Sherlock Dash Alpha1205±51.7K1.7%<0.1%68 tps0.7s2M$0$0
2633Grok 4.20 Multi Agent Beta1197±71.7K1.8%1.2%56 tps8.8s2M$2.00$6.00
2714Gemini 3 Flash Preview Thinking1195±319.7K1.0%1.6%3 tps6.2s1M$0.50$3.00
2814Gemini 3 Pro (Low)1195±320.3K1.1%2.4%51 tps3.5s1M$2.00$12.00
2916GPT-5.21193±216.3K0.9%4.1%18 tps2.7s400K$1.75$14.00
3037Qwen3 Omni 30B A3B Thinking1188±55.3K1.2%3.7%67 tps1.2s66K$0.97$1.79
3133Qwen3 Next 80B A3B Instruct1185±219.8K1.9%0.6%84 tps1.1s256K$0.20$1.42
3248gpt-oss-120b1182±226.3K1.3%0.7%213 tps0.5s131K$0.11$0.50
3332Gemini 2.5 Pro High1175±127.6K2.0%1.5%48 tps2.3s1M$1.25$10.00
3417Gemini 3 Flash Preview1173±312.8K0.7%1.3%138 tps1.4s1M$0.50$3.00
3517GPT-5.2 (High)1168±231.4K1.1%6.7%18 tps16.3s400K$1.75$14.00
3633Qwen3 30B A3B Instruct 25071167±224.4K1.7%1.2%55 tps1.3s131K$0.13$0.72
3756Gemini 2.5 Pro Low1166±216K2.1%<0.1%89 tps2.4s1M$1.25$10.00
3848OpenAI o1-mini1160±217K1.8%<0.1%118 tpsN/A128K$1.13$4.51
3944Grok 4.1 Fast Reasoning1160±223.1K1.8%1.5%58 tps7.3s2M$0.20$0.50
4037Kimi K2.5 Instant1158±64K1.5%2.9%32 tps3.0s262K$0.50$3.00
4162Qwen3 Omni 30B A3B Instruct1157±62.3K1.9%3.9%65 tps1.2s66K$0.35$0.97
4226GPT-5 (High)1156±312.3K2.3%4.5%81 tps35.9s400K$1.25$10.00
4352Grok 4 Fast Non-Reasoning1156±216.7K2.2%1.5%93 tps0.6s2M$0.27$0.67
4444Gemini 2.5 Pro1153±238.9K1.8%2.3%45 tps2.6s1M$1.25$10.00
4544DeepSeek V3.1 Terminus Chat1152±214.5K1.8%3.4%27 tps1.5s131K$0.86$1.80
4640DeepSeek V3.21152±316.5K1.1%1.4%83 tps5.1s131K$0.43$1.09
4784Nova Experimental Chat 10-091151±45.3K4.0%<0.1%59 tps6.1s98K$0$0
4843Gemini 2.5 Flash Thinking Preview 09251151±313.9K2.1%<0.1%111 tps4.7s1M$0.30$2.50
4942Qwen3 Max Instruct Preview1151±226.4K2.0%1.1%31 tps1.7s256K$1.43$6.61
5048Step 3.5 Flash1150±62.9K1.7%2.2%109 tps0.6s256K$0.05$0.15
5122GLM 51149±46.4K1.2%3.4%36 tps2.7s200K$0.72$2.55
5210Claude Sonnet 4.5 (Thinking)1148±227.2K2.5%1.9%44 tps1.1s200K$3.00$15.00
53182MAI-DS-R1 FP81148±106052.4%<0.1%79 tps2.8s164K$0.25$1.00
5433Kimi K2.51147±316.1K1.2%6.5%33 tps1.7s262K$0.34$2.57
557Claude Opus 4.5 (Thinking)1147±421.9K1.6%1.8%49 tps1.4s200K$5.00$25.00
5640Qwen3 235B A22B Instruct 25071147±224.5K1.4%6.8%13 tps1.9s262K$0.13$0.52
5742GPT-5.2 (Extra High) 1147±215.6K1.4%13.2%17 tps20.5s400K$1.75$14.00
5848Polaris Alpha1146±51.6K1.9%<0.1%48 tps1.1s256K$0$0
5944Kimi K2 Thinking Turbo1145±210.9K1.6%2.0%75 tps1.4s262K$1.15$8.00
6056DeepSeek V3.2 Thinking1144±416.9K1.3%9.0%30 tps2.6s131K$0.28$0.42
6129MiniMax M2.71142±137001.4%3.0%34 tps2.5s205K$0.30$1.20
6248Grok 4 Fast Reasoning1142±314.5K2.0%2.1%102 tps3.1s2M$0.30$0.75
6317GPT-5.4 mini1141±145451.8%0.8%148 tps0.5s400K$0.75$4.50
6456DeepSeek V3.1 Turbo1134±39.5K1.2%0.9%173 tps1.3s164K$2.00$3.75
6565Mistral Large 31133±410.8K2.6%2.1%51 tps1.0s256K$0.50$1.50
6680GPT-5 (Minimal)1132±312.9K2.2%<0.1%67 tps1.4s400K$1.25$10.00
6784Claude Sonnet 3.7 (Thinking)1130±42.3K2.7%<0.1%41 tps2.6s200K$3.00$15.00
6856MiniMax M2.1 Lightning1129±53.6K1.4%1.7%52 tps2.1s205K$0.30$2.40
6986Qwen3 235B A22B1129±37.8K2.1%5.3%71 tps0.9s41K$0.23$0.63
7071DeepSeek V3.11125±44.4K1.1%0.8%197 tps0.4s164K$0.55$1.60
7165DeepSeek V3.2 Exp Chat1125±311.5K1.9%2.6%29 tps1.5s131K$0.27$0.39
7260MiniMax M2.11124±324.4K1.0%2.1%66 tps2.6s205K$0.30$1.20
7386Nemotron 3 Nano (Thinking)1123±35.9K1.5%2.0%200 tps0.5s256K$0$0
7452Qwen3.5 122B A17B1123±52.6K1.3%1.5%82 tps1.4s256K$0.40$3.20
75100Qwen Plus 0728 (Thinking)1123±53K2.1%<0.1%56 tps1.1s1M$0.40$4.00
7626Claude Haiku 4.5 (Extended Thinking)1121±314.1K1.8%1.4%115 tps0.7s200K$1.00$5.00
7760Gemini 2.5 Flash Preview 09251118±314.4K2.2%1.2%5 tps0.9s1M$0.13$0.97
7852GPT-51117±231.1K1.7%3.1%78 tps23.1s400K$1.25$9.67
7981OpenAI o3-pro1116±53.2K2.8%5.2%22 tps70.8s200K$20.00$80.00
8068Grok 41110±198.8K0.9%3.9%29 tps11.1s256K$3.00$15.00
8117Claude Opus 4.51110±412.9K2.2%1.5%45 tps1.5s200K$5.00$25.00
8262MiniMax M21110±317.2K2.5%2.2%39 tps2.3s205K$0.21$0.85
83133Gemini 2.5 Pro Preview 06051109±135301.9%<0.1%0 tps3.7s1M$1.25$10.00
8477GPT-4.5 Preview1108±54.8K0.8%<0.1%36 tps3.0s200K$75.00$150.00
8571Qwen3.5 397B A17B1107±65.1K1.6%4.3%57 tps1.4s256K$0.52$3.00
8686DeepSeek V3.1 Nex N11107±81.5K1.3%3.4%24 tps7.2s131K$0.14$0.50
8779Qwen3 Max Thinking Preview1106±413.3K2.0%3.1%40 tps2.1s256K$1.20$6.00
88101DeepSeek V3 (Turbo)1105±53.7K1.5%1.5%32 tps1.5s64K$0.40$1.30
8956Gemini 3.1 Flash Lite Preview Thinking1105±82K1.7%1.7%75 tps4.7s1M$0.25$1.50
9068GLM 4.71105±321K1.0%5.8%40 tps1.5s200K$0.77$1.73
9195DeepSeek-R1 Turbo1104±54.8K1.5%2.6%29 tps1.8s64K$2.85$4.75
92108GPT-5 Mini Low1100±44.3K2.4%<0.1%69 tps3.2s400K$0.25$2.00
9386Amazon Nova 2 Lite1099±410.5K2.7%1.0%137 tps0.6s300K$0.35$2.95
9468Qwen Plus (Aug'24)1098±250.5K1.1%1.4%53 tps1.3s30K$0.40$1.20
95101GPT-5 (Low)1097±71.5K1.0%1.8%75 tps8.2s400K$1.25$10.00
9662GPT-5.1 Instant1096±314.9K1.5%1.3%50 tps1.9s400K$1.25$10.00
9784GPT-5 Mini Minimal1094±34.9K3.0%1.2%63 tps1.4s400K$0.25$2.00
98111Solar Pro 3 (Reasoning)1093±72.6K1.5%3.2%118 tps1.2s131K$0.15$0.60
9995Kimi K2 Thinking1092±45.4K2.0%4.2%61 tps5.9s262K$0.24$1.03
10037Claude Sonnet 4.51092±225.2K2.2%1.4%41 tps1.3s200K$1.80$9.00
10171MiniMax M2.5 FP81092±102.1K1.6%3.6%33 tps1.7s205K$0.45$1.75
10281GPT-4o1091±223.5K0.7%1.0%49 tps2.4s128K$3.71$12.57
10371Seed 1.8 2512281090±314.9K1.5%3.7%41 tps2.1s256K$0.25$2.00
10452Claude Haiku 4.51089±320.4K2.1%1.1%100 tps0.9s200K$1.00$5.00
105133Qwen3 14B1088±48.2K2.3%1.7%109 tps0.8s41K$0.04$0.15
10671Gemini 2.5 Flash Lite Preview 09251087±215.1K2.5%1.2%209 tps0.7s1M$0.25$0.35
10786DeepSeek V3.1 Chat1087±310.7K1.8%2.8%21 tps1.6s131K$0.38$1.00
10871GPT-5 Mini1087±311.3K2.1%2.6%66 tps14.2s400K$0.25$2.00
10951GPT-5.2 (Medium)1087±106852.1%<0.1%39 tps2.5s400K$1.75$14.00
11095Qwen3 32B1085±52.6K1.5%3.9%30 tps3.1s41K$0.12$0.42
11193Qwen Max1084±254.8K0.9%1.5%49 tps1.5s33K$1.60$6.40
11295DeepSeek V3.2 Exp Thinking1084±54.8K1.9%7.2%26 tps3.0s131K$0.28$0.42
113100Gemini 2.5 Flash Preview1082±58.8K0.6%<0.1%138 tps6.9s1M$0.15$0.60
114133Solar Pro 2 2507101081±221.6K1.4%<0.1%9 tpsN/A66K$0.50$0.50
11593DeepSeek V3 0324 Turbo1081±350.9K1.4%6.3%12 tps2.4s164K$0.73$1.79
116101gpt-oss-20b1080±214.2K1.7%0.5%216 tps0.5s131K$0.06$0.26
117147Grok 4 0709 EU1077±72.4K1.8%<0.1%33 tps8.2s128K$3.00$15.00
118133Nemotron 3 Nano1076±81.6K1.9%1.3%216 tps0.8s256K$0.05$4.94
119106DeepSeek V3.1 Terminus Thinking1075±46.7K1.9%5.9%27 tps1.8s131K$0.56$1.68
120161DeepSeek Prover v21075±101.4K1.4%5.2%14 tps1.3s164K$0.40$1.56
12165GLM 4.61075±311.7K2.8%5.4%39 tps1.5s200K$0.42$1.66
122121NVIDIA Llama 3.3 Nemotron Super 49B v1.51074±63.5K2.2%2.0%50 tps0.6s131K$0.09$0.33
12371Gemini 3.1 Flash Lite Preview1073±111.3K2.2%1.0%8 tps1.2s1M$0.25$1.50
124126Qwen3 30B A3B1073±49.6K2.1%5.1%163 tps1.0s41K$0.06$0.21
12522MiniMax M2.7-highspeed1071±116752.2%2.3%50 tps2.1s205K$0.60$2.40
126111LongCat Flash Chat1071±43.8K1.9%0.8%85 tps0.9s131K$0.14$0.68
127133DeepSeek-R1 05281070±64.4K1.6%1.3%93 tps0.5s64K$1.60$3.67
128106DeepSeek V3 03241067±237.1K1.0%5.8%12 tps2.7s164K$0.38$0.93
129147GLM 4.5 Air1067±314.8K2.0%<0.1%22 tps1.4s131K$0.10$0.38
130113Kimi K2 Fast1067±294.2K1.2%0.8%365 tps0.5s131K$1.00$3.00
131148DeepSeek-R11067±54.9K1.4%0.8%133 tps0.6s64K$0.91$3.07
132113GLM 4.5 AirX1067±53.5K1.8%3.3%75 tps1.2s131K$1.10$4.50
133101Gemini 2.5 Flash Lite1066±236K1.7%1.3%210 tps0.7s1M$0.10$0.40
13481Qwen3.5 27B1065±131.2K2.4%3.7%55 tps2.6s256K$0.30$2.40
135129DeepSeek V3.1 Thinking1063±49.3K2.2%7.1%18 tps1.8s131K$0.23$0.75
136124Qwen3 235B A22B Thinking 25071061±43.6K1.2%2.5%53 tps1.6s131K$0.59$5.70
13779MiniMax M2.5 Lightning1060±54.1K1.5%1.5%51 tps2.0s205K$0.60$2.40
13895Gemini 2.5 Flash1060±297.6K1.0%1.3%2 tps3.7s1M$0.30$2.50
139121Qwen3 32B Fast1060±411.4K2.3%11.6%30 tps3.1s41K$0.10$0.25
140153Apriel 1.5 15B Thinker1059±81.5K2.5%2.4%146 tps0.4s131K$0$0
141153GLM 4.5 FP81059±91.4K2.7%<0.1%59 tps1.2s131K$0.41$1.65
142121QwQ 32B1058±315.3K2.1%5.4%41 tps2.1s16K$0.43$0.56
143119GLM 4.7 FP81057±62.2K1.6%6.9%40 tps1.3s200K$0.30$1.20
14471Gemini 2.5 Flash Thinking1054±47.9K1.4%2.2%88 tps6.4s1M$0.30$2.50
145143Solar Pro 2 2512151053±97551.3%1.8%107 tps1.5s66K$0.15$0.60
14695Gemini 2.5 Flash Lite Thinking Preview 09251052±29.4K2.4%1.5%152 tps3.0s1M$0.10$0.40
147113GLM 4.51050±212.3K1.6%3.7%46 tps1.4s131K$0.43$1.63
148106Grok 31049±256K1.4%1.5%53 tps0.6s1M$3.67$18.33
149119ERNIE 4.5 300B A47B1049±244.6K1.0%4.7%23 tps2.3s123K$0.28$1.10
150113Gemini 2.5 Flash Lite Thinking1048±312.2K2.2%1.0%118 tps4.4s1M$0.03$0.13
151111Claude Sonnet 3.71046±228K0.8%<0.1%39 tps1.6s200K$3.00$15.00
152104Grok 3 Beta1045±45.8K0.6%<0.1%58 tps0.8s131K$3.00$15.00
153113Mistral Medium1044±233.2K1.3%1.8%48 tps0.6s33K$1.48$4.55
15486Seed 2.0 Lite (Medium)1043±91.2K2.0%6.6%33 tps1.6s256K$0.25$2.00
15584MiniMax M2.51038±111.5K2.0%1.4%70 tps1.9s205K$0.28$1.20
15656Claude Opus 4.1 (Thinking)1037±46.7K2.0%<0.1%20 tps3.9s200K$15.00$75.00
157124Kimi K2 0905 Turbo1037±318.8K2.4%0.7%373 tps0.5s262K$1.70$6.50
158106Claude Sonnet 3.5 v21035±416.6K1.0%<0.1%46 tps1.4s200K$3.00$15.00
159148Qwen3 30B A3B Thinking 25071035±44K1.4%0.5%124 tps1.2s131K$0.16$1.70
160159Gemini 2.5 Pro Preview 03251034±81.7K2.5%<0.1%3 tps16.6s1M$1.25$10.00
161147Arcee AI Maestro Reasoning1033±411.8K1.6%<0.1%85 tps0.3s131K$0.90$3.30
162159Llama 3.1 405B Instruct1032±81.3K1.5%<0.1%52 tps0.5s128K$2.60$4.27
163177Llama 3 70B Turbo1031±315.7K1.3%<0.1%31 tps0.0s8K$0.73$0.83
164118GPT-4.1 mini1031±257.7K1.3%1.1%67 tps0.9s1M$0.34$1.60
165153Ministral 14B 3.01031±62.3K3.1%2.0%119 tps0.5s128K$0.20$0.20
166129Command A1030±267.4K1.3%2.2%42 tps0.8s256K$2.00$7.33
16721Claude Opus 4 (Thinking)1030±61.3K3.0%<0.1%28 tps1.3s200K$15.00$75.00
168111Grok 3 Fast1030±312K1.1%1.7%52 tps2.4s131K$5.00$25.00
169126Qwen3 VL 235B A22B Thinking1027±47.3K2.9%4.3%47 tps3.0s127K$0.47$3.31
170133DeepSeek V3.2 Speciale1027±55.9K2.2%6.0%43 tps1.4s131K$0.84$1.52
171165Qwen3 4B1027±49.4K3.3%1.9%94 tps1.5s128K$0.01$0.01
172126DeepSeek V31027±241.8K1.0%0.9%69 tps1.1s64K$0.59$1.49
173165Qwen3 VL 30B A3B Thinking1027±72.3K4.6%4.5%84 tps2.9s127K$0.20$1.47
17486Claude Sonnet 41026±288.9K1.5%1.8%49 tps1.3s200K$3.00$15.00
175182GLM 4.5 Turbo1025±111K2.9%<0.1%46 tps1.6s131K$1.00$3.00
176159Qwen Turbo1025±332.8K1.3%<0.1%53 tps1.1s1M$0.05$0.20
177143Mistral Medium 31023±91.2K1.7%2.4%47 tps0.8s33K$0.40$2.00
178182GLM 4.6 FP81022±52.2K3.7%<0.1%56 tps1.8s200K$0.40$1.75
17977Claude Opus 4.11022±36.5K2.5%3.0%17 tps3.7s200K$15.00$75.00
180161Qwen3 8B1020±56.1K2.6%2.4%61 tps1.4s41K$0.02$0.07
181143Seed 1.6 2506151018±43.6K1.6%3.1%46 tps2.2s256K$0.25$2.00
182182Fauna Fox1018±410.7K2.4%<0.1%194 tps0.3s128K$0.04$0.15
183148OpenAI o31018±64.2K1.8%0.9%85 tps6.8s128K$7.33$29.33
184133Kimi K2 09051016±49.2K2.1%4.0%30 tps1.4s262K$0.63$2.39
185157Qwen3 Next 80B A3B Thinking1015±312.3K2.3%0.6%175 tps1.3s256K$0.21$2.26
186133GPT-4.1 nano1014±252.1K1.3%0.6%175 tps0.5s1M$0.10$0.40
187101Qwen3.5 35B A3B1011±151.1K1.4%2.1%116 tps2.1s256K$0.63$1.13
188179Baichuan-M2-32B1011±81.4K2.7%<0.1%32 tps3.3s131K$0.07$0.07
189139Seed 2.0 Mini (Medium)1010±101.3K2.6%11.9%33 tps1.7s256K$0.15$0.60
190148OpenAI o4-mini-high1009±219.5K2.1%1.9%117 tps15.9s200K$1.10$4.40
191175MiMo V2 Flash1009±106453.7%7.2%24 tps1.9s262K$0.07$0.23
192139Qwen3 VL 30B A3B Instruct1009±91.2K4.5%1.8%80 tps2.6s129K$0.18$0.67
193193GPT-5 Nano High1008±106001.6%<0.1%23 tps25.7s400K$0.05$0.40
194129Qwen3 Max Thinking1007±56.9K1.4%13.5%32 tps2.3s256K$1.20$6.00
195139OpenAI o4-mini1007±313.6K2.2%1.4%97 tps7.0s128K$1.10$4.40
196165ERNIE 4.5 21B A3B1006±71.4K1.8%2.3%78 tps1.5s120K$0.05$0.19
197186Mistral Small 3.2 24B Instruct1006±71.6K3.4%1.9%113 tps1.1s131K$0.02$0.08
198233TNG Tech DeepSeek R1T Chimera1006±126000.8%<0.1%78 tps1.5s164K$0.11$0.44
199139GLM 4.6V1005±37.4K1.4%6.4%21 tps1.8s128K$0.38$0.90
200153Qwen 2.5 32B Instruct1005±315.7K1.0%2.5%48 tps1.0s131K$0.21$0.25
201194GLM 4.5 Flash1005±111.1K2.3%12.2%15 tps2.2s131K$0$0
202161Mistral Small 3.11005±39.4K1.0%7.4%13 tps2.6s32K$0.17$0.28
203182Qwen 2.5 72B Turbo1004±62.9K1.2%<0.1%84 tps0.8s131K$0.60$0.60
204143Gemini 2.0 Flash1004±323.8K0.9%<0.1%76 tps0.5s1M$0.14$0.56
205193Solar Pro 2 2509091003±147252.7%<0.1%84 tps1.1s66K$0.15$0.15
206170Devstral Small 25071002±89802.0%2.2%186 tps0.5s131K$0.10$0.30
207253R1 17761001±52.2K2.2%<0.1%61 tps1.0s128K$2.00$8.00
208246Amazon Nova Micro 1.01000±236302.3%4.1%193 tps0.6s128K$0.04$0.07
209200Llama 3 8B Turbo999±72.5K1.4%<0.1%97 tps0.1s8K$0.12$0.13
210219EXAONE Deep 32B999±53.4K1.7%<0.1%24 tpsN/A33K$0$0
211165DeepSeek R1T2 Chimera998±55.6K1.7%3.0%28 tps1.8s164K$0.13$0.45
212165Pixtral Large997±57.6K1.8%2.5%57 tps1.3s128K$1.50$4.50
21321Claude Opus 4997±53.4K2.1%<0.1%25 tps1.5s200K$15.00$75.00
214186Jamba 1.7 Large994±92.1K2.5%1.3%58 tps1.0s256K$1.33$5.33
215143Gemini 2.0 Flash Lite994±355.3K2.8%<0.1%42 tps0.5s1M$0.08$0.30
216170Devstral Medium992±410.6K0.9%1.5%77 tps0.6s131K$0.40$2.00
217200K2 Think991±63.6K1.2%<0.1%418 tps2.8sN/A$0$0
218219NVIDIA Llama 3.3 Nemotron Super 49B v1991±214.5K0.5%<0.1%13 tpsN/A131K$0.07$0.20
219160Llama 4 Scout990±255K1.5%0.6%88 tps5.1s131K$0.18$0.46
220200NVIDIA Llama 3.1 Nemotron 70B990±320.1K0.8%<0.1%9 tps0.1s128K$0.33$0.39
221186Gemma 3 27B990±63.1K1.8%1.8%35 tps1.1s66K$0.06$0.10
222170Llama 3.1 8B Turbo989±57.3K1.5%2.1%650 tps0.5s128K$0.13$0.14
223157GPT-5 Nano989±36.6K3.1%3.2%113 tps20.9s400K$0.05$0.40
224161Llama 4 Maverick987±259.4K1.5%1.2%88 tps2.4s1M$0.23$0.83
225170Kimi K2 0711987±319K1.1%1.6%29 tps1.3s131K$0.72$2.60
22648Claude Sonnet 4 (Thinking)986±38.7K1.8%1.5%52 tps1.5s200K$3.00$13.67
227157Cogito v2.1 671B986±43.6K1.2%0.8%85 tps0.5s128K$1.25$1.25
228170Mistral Small 3.2 24B986±313K1.1%2.8%141 tps0.7s33K$0.02$0.08
229219Arcee AI Virtuoso-Large985±211.7K1.1%<0.1%64 tps0.5s131K$0.75$1.20
230213DeepSeek R1T Chimera980±62.7K2.0%<0.1%46 tps1.1s164K$0.09$0.36
231277GLM Z1 32B979±73K1.9%<0.1%18 tps9.3s33K$0.09$0.11
232200Claude Sonnet 3.5976±58.2K1.2%1.0%40 tps2.7s200K$3.00$15.00
233314DeepSeek-R1 0528 Qwen3 8B976±63.5K3.2%<0.1%45 tps2.4s128K$0.05$0.09
234213Claude Haiku 3.5976±315.2K1.5%0.8%40 tps2.8s200K$0.80$4.00
235211Gemini 1.5 Pro976±46.7K1.8%<0.1%15 tps0.0s2M$0.78$3.13
236314MAI-DS-R1974±45.1K2.9%<0.1%73 tps3.2s64K$0.10$0.40
237246DeepSeek-R1 Distill Llama 70B973±112.1K3.0%3.6%27 tps1.6s32K$0.73$0.95
238241OLMo 3 7B Think973±62.6K2.4%4.2%77 tps0.4s66K$0.12$0.20
239179Amazon Nova Pro 1.0971±222.3K0.8%0.9%96 tps0.7s300K$0.80$1.70
240179Qwen 2.5 72B970±45.3K1.0%1.2%96 tps1.2s131K$0.14$0.26
241233Llama 3.1 70B Instruct Turbo969±316.5K0.9%<0.1%110 tps0.8s128K$0.88$0.88
242179Llama 3.1 70B Instruct969±158451.7%6.3%30 tps0.8s128K$0.17$0.22
243241Arcee AI Blitz967±314.9K0.7%<0.1%6 tpsN/A33K$0.45$0.75
244179Inception Mercury967±324.4K0.9%0.4%257 tps1.1s32K$0.25$1.00
245253Magistral Medium965±62K3.8%<0.1%95 tps0.5s41K$2.00$5.00
246186Gemma 3n E4B965±422.2K1.2%2.0%30 tps0.5s8K$0.01$0.02
247177Mistral Small 3.1 24B Instruct962±410.6K1.3%7.5%15 tps2.4s131K$0.06$0.18
248270AFM 4.5B Preview961±59.7K2.1%<0.1%32 tps0.0s66K$0$0
249241Claude Haiku 3960±310.8K1.0%0.4%62 tps0.5s200K$0.25$1.25
250186Jamba 1.6 Large957±315.3K0.9%2.0%59 tps1.2s256K$1.33$5.33
251182Gemini 2.5 Flash Preview Thinking956±109601.5%<0.1%26 tps1.8s1M$0.15$1.76
252219Grok 3 Mini Beta956±37.7K0.6%<0.1%75 tps0.5s131K$0.45$2.25
253194Mistral Small 3 24B Instruct955±47.2K0.9%2.6%77 tps0.6s33K$0.07$0.14
254194Llama 3.2 11B Instruct955±49.2K1.0%1.5%152 tps0.5s8K$0.16$0.16
255153OpenAI o1954±43.9K1.4%4.2%92 tps5.5s200K$15.00$60.00
256277Dobby Unhinged Llama 3.3 70B951±53.9K1.4%<0.1%41 tps0.4s128K$0.90$0.90
257179GLM 4.7 Flash949±64K1.9%5.8%61 tps2.8s128K$0.07$0.39
258229ERNIE 4.5 21B A3B Thinking947±91.7K2.3%1.8%87 tps1.5s120K$0.07$0.28
259194Magistral Small 2506945±415.6K1.0%1.6%156 tps0.5s40K$0.37$1.10
260292GPT-5 Nano Minimal944±92.4K4.6%<0.1%88 tps0.8s400K$0.05$0.40
261201Qwen 2.5 7B Turbo944±92.4K1.5%0.5%125 tps0.4s131K$0.30$0.30
262194Llama 3.3 70B943±49.1K2.7%0.3%500 tps0.5s8K$0.48$0.66
263253Grok 4 (Low Reasoning)942±51.5K1.3%<0.1%18 tps9.5s256K$0$0
264314GLM 4 32B 0414 128K942±136854.9%<0.1%48 tps3.5s131K$0.10$0.10
265209Qwen 2.5 14B Instruct941±68.3K1.4%2.4%40 tps1.6s1M$0.40$1.61
266201Llama 3 8B941±312.1K0.9%6.0%85 tps0.7s8K$0.12$0.16
267302OLMo 3 32B Think939±121.4K1.4%<0.1%84 tps0.6s66K$0.15$0.50
268270Arcee AI Virtuoso-Medium939±310.1K0.7%<0.1%3 tpsN/A131K$0.50$0.80
269302Cogito V2 Preview Llama 109B938±147452.0%<0.1%84 tps1.4s33K$0.18$0.59
270265Llama 3.1 405B Instruct Turbo938±48K0.9%<0.1%26 tps0.8s131K$3.50$3.50
271186GLM 4.6V Flash937±45.8K2.0%3.7%64 tps2.1s128K$0.04$0.40
272292Arcee AI Spotlight937±318.1K0.9%<0.1%121 tps0.4s131K$0.18$0.18
273179Switchpoint Router936±48.2K1.0%1.7%71 tps4.9s131K$0.85$3.40
274201ERNIE 4.5 VL 424B A47B936±128055.3%4.9%36 tps3.5s123K$0.42$1.25
275201Mistral Small 24B Instruct935±46.3K1.2%1.5%84 tps0.4s33K$0.80$0.80
276194Llama 3 70B934±71.7K1.1%4.5%21 tps1.7s8K$1.08$1.38
277214Krutrim 2933±310.7K0.7%12.5%33 tps2.1s128K$1.00$1.00
278277Cypher Alpha932±43K2.9%<0.1%4 tpsN/A1M$0$0
279277Grok 2931±39.2K0.9%<0.1%55 tps1.1s131K$2.00$10.00
280186Grok 3 Mini931±423K1.4%1.2%43 tps0.5s131K$0.30$0.50
281214C4AI Aya Expanse 32B931±317.1K0.8%1.5%43 tps0.5s128K$0.50$1.50
282292NVIDIA Llama 3.1 Nemotron Ultra 253B v1930±58.5K0.9%<0.1%40 tps0.8s128K$0.30$0.90
283225GPT-3.5 Turbo 16k929±39.2K0.6%<0.1%22 tps0.6s16K$3.00$4.00
284225Command R 7B928±312.7K1.1%1.1%76 tps0.4s128K$0.04$0.15
285209Llama 3.3 Swallow 70B Instruct928±39.7K1.1%1.4%153 tps1.3s131K$0.13$0.39
286209GPT-3.5 Turbo928±45.8K0.6%1.3%74 tps0.9s16K$0.75$1.75
287186Grok 3 Mini Fast927±221.1K1.5%1.6%44 tps0.5s131K$0.60$4.00
288201Gemma 3 27B IT927±38.9K0.9%2.0%60 tps0.8s128K$0.17$0.29
289314Cogito V2 Preview Llama 405B927±88002.4%<0.1%23 tps2.1s33K$1.17$1.17
290175OpenAI o3-mini-low925±318K2.6%0.7%139 tps1.5s200K$1.10$4.40
291214Qwen 2.5 7B924±46.9K1.4%3.7%40 tps1.9s131K$0.08$0.27
292292Exaone 3.5 32B Instruct924±42.8K1.7%<0.1%17 tpsN/A33K$0$0
293222Jamba 1.5 Large924±413K1.0%1.7%48 tps0.9s256K$1.50$6.00
294214Moonshot V1 128k922±54.4K0.9%1.4%54 tps1.5s131K$2.00$5.00
295209Seed 1.6 Flash 250715921±52.5K1.9%2.5%108 tps1.6s256K$0.07$0.30
296177OpenAI o3-mini921±319.4K2.3%0.8%143 tps3.3s200K$1.10$4.40
297324Solar Pro 3921±101.7K2.6%2.0%99 tps1.3s131K$0.15$0.60
298214Gemma 3 12B920±49K1.3%4.2%73 tps0.8s131K$0.05$0.12
299277Claude Sonnet 3920±43.9K0.8%<0.1%35 tps1.0s200K$3.00$15.00
300214Llama 3.3 70B Instruct Turbo919±83.7K1.5%2.0%78 tps1.0s131K$0.88$0.88
301277Wikipedia917±267K1.7%<0.1%47 tps2.1s32K$0$0
302302Yi Large917±38.6K0.3%<0.1%34 tpsN/A33K$1.50$1.50
303201Devstral Small917±45.1K1.3%2.4%180 tps0.6s131K$0.10$0.30
304233Cogito V2 Preview Llama 70B915±149403.1%<0.1%44 tps1.6s33K$0.44$0.44
305324Qwen 2 72B Instruct914±34.7K0.7%<0.1%3 tpsN/A33K$0.90$0.90
306277Jamba 1.7 Mini913±82.5K2.6%<0.1%84 tps0.9s256K$0.20$0.40
307292AFM 4.5B913±39.1K2.6%<0.1%81 tps0.3s66K$0.05$0.20
308225Command R913±39.6K1.5%5.8%54 tps0.6s128K$0.30$0.99
309324Typhoon 2 70B Instruct911±48.1K0.9%<0.1%19 tps0.1s8K$0.88$0.88
310225Open Mistral Nemo910±56.7K1.1%1.5%171 tps0.5s131K$0.15$0.15
311240Mistral Nemo910±53.8K0.5%<0.1%112 tps0.4s131K$0.07$0.13
312235Gemma 3 4B909±411.3K1.0%1.3%138 tps0.7s131K$0.02$0.04
313194INTELLECT-3909±145702.6%1.5%114 tps0.6s131K$0.20$1.10
314235Hermes 2 Pro Llama 3 8B908±38.3K0.7%<0.1%76 tps1.0s131K$0.08$0.09
315339Refuel LLM 2 Small907±317.9K1.0%<0.1%116 tps0.5s8K$0.20$0.20
316222Sky T1 32B Preview905±410.5K1.1%7.8%73 tps0.6s16K$0.12$0.18
317229Ministral 8B905±46.8K1.4%1.4%177 tps0.4s128K$0.14$0.14
318235Command R+904±56.3K1.3%2.8%36 tps0.7s128K$2.08$9.45
319240GPT-3.5 Turbo Instruct903±37K0.6%<0.1%46 tps1.2s4K$1.50$2.00
320331Marin 8B Instruct902±99352.6%<0.1%170 tps0.2s131K$0.18$0.18
321331Hermes 2 Mixtral 8x7B DPO902±35.6K0.6%<0.1%1 tpsN/A33K$0.60$0.60
322235Mixtral 8x7B899±64.7K1.3%2.2%142 tps0.6s33K$0.23$0.23
323229Moonshot V1 Auto898±83.8K1.0%1.2%54 tps1.5s8K$2.00$5.00
324302OLMo 2 0425 1B Instruct897±62.5K2.9%<0.1%68 tps0.0s4K$0$0
325246Hermes 4 70B897±101K1.9%1.1%67 tps0.6s131K$0.12$0.39
326274DeepSeek-R1 Distill Qwen 32B896±101.4K2.1%6.2%22 tps1.8s131K$0.37$0.39
327235GLM 4 32B894±310.8K1.2%2.6%40 tps1.6s33K$0.14$0.14
328246WizardLM-2 8x22B894±39.3K0.8%11.6%11 tps2.5s66K$0.77$0.77
329240Moonshot V1 8k893±63.8K0.9%1.0%55 tps1.5s8K$0.20$2.00
330229Krutrim Spectre V2893±36.8K1.1%<0.1%33 tps3.1s4K$0.19$0.19
331256Gemma 3 1B892±56.1K1.9%0.6%176 tps1.0s33K$0.06$0.10
332240Hermes 4 405B FP8891±111.6K3.0%3.5%31 tps0.9s131K$0.52$1.73
333201GPT-4o mini891±47K1.9%2.1%71 tps1.7s128K$0.15$0.60
334361Zenith891±121.2K3.2%<0.1%36 tps1.8s131K$0$0
335241GPT-5 Mini High890±73.7K3.3%<0.1%33 tps3.9s400K$0.25$2.00
336253Gemma 2 27B889±46.5K1.2%1.4%44 tps1.4s8K$0.80$0.80
337229Magistral Medium 2509889±54.4K3.3%4.0%58 tps0.9s131K$2.00$5.00
338222Rnj-1 Instruct888±82.4K4.5%0.6%103 tps0.3s33K$0.15$0.15
339246Ministral 3B886±57.5K1.4%0.8%248 tps0.4s131K$0.08$0.08
340246Mixtral 8x22B886±64.7K1.1%1.2%140 tps0.6s64K$2.00$6.00
341292Kimi K2 Instruct885±98151.8%<0.1%31 tps0.9s131K$0.66$2.34
342314Weather879±55.5K1.9%<0.1%36 tps1.1s32K$0$0
343265Ministral 3B 2512876±91.3K3.4%2.8%339 tps0.6s131K$0.10$0.10
344361Venice Uncensored876±101.1K3.2%<0.1%59 tps3.9s33K$0$0
345240Moonshot V1 32k872±43.8K0.8%1.4%53 tps1.4s33K$1.00$3.00
346246Mixtral 8x22B Instruct871±45.4K1.6%1.8%142 tps0.7s66K$0.45$0.45
347229Llama 3.1 8B868±91.2K2.4%1.9%61 tps1.0s8K$0.07$0.09
348256Phi 4867±37.7K1.2%5.1%28 tps1.3s128K$0.10$0.32
349361Meridian860±111.4K3.4%<0.1%92 tps1.2s131K$0$0
350374GLM 4.1V 9B Thinking859±34.3K0.8%<0.1%69 tps1.3s66K$0.04$0.14
351260Mistral Small859±64.4K1.6%1.7%142 tps0.6s32K$0.43$1.30
352265Inflection 3 Productivity858±46.5K0.8%0.6%50 tps3.2s8K$2.50$10.00
353378Command Light858±54.4K1.6%<0.1%23 tpsN/A4K$0.10$0.20
354361Cogito V2 Preview Llama 109B MoE856±129152.1%<0.1%66 tps1.6s33K$0.18$0.59
355256Solar Mini 250422856±83.2K2.0%1.8%90 tps1.7s33K$0.15$0.15
356260Open Mistral 7B849±64.8K1.2%0.7%176 tps0.4s33K$0.25$0.25
357265Mixtral-8x7B Instruct v0.1849±55K1.5%1.3%54 tps0.4s33K$0.60$0.60
358214Qwen 2.5 VL 32B Instruct847±168404.0%6.3%43 tps3.2s128K$0.35$0.62
359302YouTube844±86.3K2.9%<0.1%34 tps2.7s32K$0.99$0.99
360256Mixtral 8x7B Instruct844±55.7K1.3%0.2%79 tps0.7s33K$0.23$0.31
361284MiniMax M1843±52.7K1.6%<0.1%31 tps2.8s1M$0.55$2.20
362369Magistral Medium 2507843±91.1K4.7%<0.1%86 tps0.7s41K$2.00$5.00
363399Gemini 1.5 Flash 8B842±126852.8%<0.1%11 tps0.0s1M$0.02$0.10
364260Apriel 1.6 15B Thinker840±128552.3%2.6%92 tps0.4s131K$0$0
365214OpenAI o3-mini-high839±63.6K1.6%2.4%231 tps10.5s200K$1.10$4.40
366240Llama 3.3 70B Instruct839±186752.2%5.3%28 tps1.3s128K$0.38$0.55
367271Inflection 3 Pi837±46.8K0.9%1.1%33 tps3.4s8K$2.50$10.00
368392Mythalion 13B835±35.2K0.8%<0.1%63 tps0.5s4K$0.56$1.13
369406DeepSeek-R1 Distill Qwen 14B833±159753.0%<0.1%44 tps1.7s64K$0.63$0.63
370399Phi 4 Multimodal Instruct832±55.9K2.3%<0.1%17 tps1.4s128K$0.03$0.05
371361Magistral Small 2507830±81.4K4.1%<0.1%148 tps0.4s41K$0.50$1.50
372260Hermes 4 405B Reasoning FP8826±45K3.5%3.6%32 tps0.8s131K$1.00$3.00
373392MiMo 7B RL825±36.7K1.1%<0.1%31 tps0.4s32K$0.49$0.49
374392Mistral Nemo 12B Celeste V1.9825±45.6K1.1%<0.1%6 tps10.2s8K$0.80$1.20
375274DeepHermes 3 Mistral 24B Preview824±99953.4%2.5%50 tps1.0s33K$0.06$0.25
376392Phi 3 Mini 128k Instruct821±91K2.4%<0.1%16 tps0.5s128K$0.12$0.31
377406Command821±63.1K1.6%<0.1%25 tpsN/A4K$0.83$1.33
378406Solar Pro 250422820±71.2K1.2%<0.1%13 tps0.6s33K$0$0
379271Mistral Large818±64.3K1.6%1.5%54 tps0.7s33K$2.00$6.00
380265LFM2 2.6B816±81.7K5.2%6.7%184 tps0.4s33K$0.01$0.02
381271Hermes 3 405B Instruct814±45.5K1.2%2.3%20 tps1.1s131K$0.80$0.80
382265Qwen 2.5 VL 72B Instruct808±121.3K5.5%5.3%25 tps3.7s128K$1.01$2.79
383412ArliAI QwQ 32B Arliai RpR V1799±166555.1%<0.1%34 tps1.8s33K$0.02$0.07
384281MythoMax L2 13B796±49K1.6%1.2%22 tps1.1s4K$0.18$0.18
385412Dolphin 3.0 R1 Mistral 24B795±72.4K1.4%<0.1%13 tps0.1s33K$0.03$0.09
386412Dolphin 2.9.2 Mixtral 8x22B791±45.4K0.8%<0.1%20 tps1.5s16K$0.90$0.90
387412Shisa V2 Llama 3.3 70B790±91.1K2.2%<0.1%8 tps2.0s33K$0.03$0.09
388399Phi 3 Medium 128k Instruct789±108952.2%<0.1%40 tps1.3s128K$0.58$0.84
389265Magistral Small 2509786±111.7K3.7%2.7%116 tps0.6s131K$0.50$1.50
390281Gemma 2 9B779±71.3K3.6%<0.1%100 tps0.4s8K$0.09$0.09
391274C4AI Aya Expanse 8B778±158852.7%0.9%61 tps0.4s8K$0.50$1.50
392285Hunyuan A13B Instruct777±54.1K2.4%2.3%67 tps2.0s33K$0.01$0.01
393285Phi 4 Mini Instruct774±53.7K2.0%7.4%40 tps1.1s128K$0.07$0.30
394274MiniMax M2-her767±118801.7%<0.1%108 tps0.7s205K$0.30$1.20
395274LFM2 8B A1B757±91.9K4.8%<0.1%142 tps0.3s33K$0.01$0.02
396274Pixtral 12B748±152.3K5.1%2.2%101 tps1.2s131K$0.08$0.08
397424ERNIE 4.5 0.3B744±101.1K5.7%<0.1%85 tps2.2s120K$0$0
398421Llema 7B742±44.3K1.7%<0.1%1 tps15.0s4K$0.80$1.20
399281Goliath 120B741±92.6K2.1%2.7%21 tps2.2s6K$6.56$9.38
400287Phi 4 Reasoning707±101K2.8%21.0%29 tps1.0s33K$0.06$0.25
401419Kimi Dev 72B694±198802.2%<0.1%17 tps13.5s131K$0.12$0.47
402274Moonshot V1 128k Vision649±315156.4%3.1%44 tps3.8s131K$2.00$5.00
403289UI-TARS 1.5 7B646±179154.2%4.0%75 tps0.9s128K$0.10$0.20
404430OpenHands LM 32B V0.1633±71.8K0.8%<0.1%11 tpsN/A16K$2.60$3.40
405291Phi 4 Mini Reasoning626±84.4K4.9%9.7%30 tps0.9s128K$0.07$0.30
406288Qwen 2.5 VL 3B Instruct624±172.2K7.8%3.0%44 tps2.5s128K$0.21$0.63
407434QwQ 32B RpR v1596±91.9K4.1%<0.1%34 tps3.3s33K$0.02$0.07
408430Phi 3.5 Mini 128k Instruct569±137452.6%<0.1%14 tps0.7s128K$0.10$0.10
409291LFM2.5 1.2B Thinking508±265455.2%2.6%258 tps0.4s33K$0$0
410439Mistral Nemo 12B Inferor v0.0344±72.2K0.9%<0.1%83 tps0.8s16K$0.80$1.20
Show Less