Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

546
Qwen 2.5 VL 3B Instruct
726
GLM 4.6V Flash
739
Grok 3 Mini
744
Pixtral 12B
745
Llama 3.3 70B
778
Pixtral Large
781
Gemma 3n E4B
790
Magistral Small 2509
797
Magistral Medium 2509
804
Qwen 2.5 VL 72B Instruct
818
Qwen3 30B A3B Thinking 2507
821
Nemotron 3 Nano (Thinking)
826
GPT-4o mini
827
Qwen3 8B
830
DeepSeek V3.2 Speciale

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
1288Qwen 2.5 VL 3B Instruct546±311K9.1%3.0%44 tps2.5s128K$0.21$0.63
2186GLM 4.6V Flash726±295752.5%3.7%64 tps2.1s128K$0.04$0.40
3186Grok 3 Mini739±201.4K2.5%1.2%43 tps0.5s131K$0.30$0.50
4274Pixtral 12B744±339409.6%2.2%101 tps1.2s131K$0.08$0.08
5194Llama 3.3 70B745±305254.5%0.3%500 tps0.5s8K$0.48$0.66
6165Pixtral Large778±181.1K7.2%2.5%57 tps1.3s128K$1.50$4.50
7186Gemma 3n E4B781±275354.5%2.0%30 tps0.5s8K$0.01$0.02
8265Magistral Small 2509790±295306.2%2.7%116 tps0.6s131K$0.50$1.50
9229Magistral Medium 2509797±175705.0%4.0%58 tps0.9s131K$2.00$5.00
10265Qwen 2.5 VL 72B Instruct804±297156.5%5.3%25 tps3.7s128K$1.01$2.79
11148Qwen3 30B A3B Thinking 2507818±187953.0%0.5%124 tps1.2s131K$0.16$1.70
1286Nemotron 3 Nano (Thinking)821±235404.4%2.0%200 tps0.5s256K$0$0
13201GPT-4o mini826±186456.5%2.1%71 tps1.7s128K$0.15$0.60
14161Qwen3 8B827±366004.0%2.4%61 tps1.4s41K$0.02$0.07
15133DeepSeek V3.2 Speciale830±285403.6%6.0%43 tps1.4s131K$0.84$1.52
16186Grok 3 Mini Fast832±231K3.3%1.6%44 tps0.5s131K$0.60$4.00
1784GPT-5 Mini Minimal835±131.1K6.6%1.2%63 tps1.4s400K$0.25$2.00
18175OpenAI o3-mini-low838±211.7K2.6%0.7%139 tps1.5s200K$1.10$4.40
19157GPT-5 Nano843±142K6.0%3.2%113 tps20.9s400K$0.05$0.40
20157Qwen3 Next 80B A3B Thinking846±151.3K3.9%0.6%175 tps1.3s256K$0.21$2.26
21121Qwen3 32B Fast853±131.8K4.2%11.6%30 tps3.1s41K$0.10$0.25
22133Qwen3 14B856±247452.6%1.7%109 tps0.8s41K$0.04$0.15
23170Kimi K2 0711858±248904.3%1.6%29 tps1.3s131K$0.72$2.60
24139GLM 4.6V865±248902.7%6.4%21 tps1.8s128K$0.38$0.90
25129Qwen3 Max Thinking866±141.5K1.7%13.5%32 tps2.3s256K$1.20$6.00
26214OpenAI o3-mini-high868±131.4K3.8%2.4%231 tps10.5s200K$1.10$4.40
27246DeepSeek-R1 Distill Llama 70B869±285904.8%3.6%27 tps1.6s32K$0.73$0.95
28179GLM 4.7 Flash874±248552.8%5.8%61 tps2.8s128K$0.07$0.39
29160Llama 4 Scout875±152.3K2.9%0.6%88 tps5.1s131K$0.18$0.46
30165Qwen3 4B878±237353.9%1.9%94 tps1.5s128K$0.01$0.01
31161Llama 4 Maverick883±123.6K4.4%1.2%88 tps2.4s1M$0.23$0.83
32148DeepSeek-R1894±121.1K3.8%0.8%133 tps0.6s64K$0.91$3.07
3386Qwen3 235B A22B898±237253.3%5.3%71 tps0.9s41K$0.23$0.63
34139Seed 2.0 Mini (Medium)900±305153.7%11.9%33 tps1.7s256K$0.15$0.60
35177OpenAI o3-mini901±122.5K3.1%0.8%143 tps3.3s200K$1.10$4.40
3662MiniMax M2905±181.4K3.5%2.2%39 tps2.3s205K$0.21$0.85
37121QwQ 32B910±131.8K3.1%5.4%41 tps2.1s16K$0.43$0.56
38143Gemini 2.0 Flash Lite917±112.5K6.7%<0.1%42 tps0.5s1M$0.08$0.30
39126Qwen3 30B A3B918±158653.4%5.1%163 tps1.0s41K$0.06$0.21
40119ERNIE 4.5 300B A47B918±171.6K2.7%4.7%23 tps2.3s123K$0.28$1.10
41126Qwen3 VL 235B A22B Thinking922±187454.5%4.3%47 tps3.0s127K$0.47$3.31
42133Kimi K2 0905922±218054.2%4.0%30 tps1.4s262K$0.63$2.39
43124Kimi K2 0905 Turbo925±131.5K4.7%0.7%373 tps0.5s262K$1.70$6.50
4495Kimi K2 Thinking926±177402.0%4.2%61 tps5.9s262K$0.24$1.03
45143Seed 1.6 250615928±216355.2%3.1%46 tps2.2s256K$0.25$2.00
4665Mistral Large 3947±201.3K4.4%2.1%51 tps1.0s256K$0.50$1.50
47101Qwen3.5 35B A3B949±275302.8%2.1%116 tps2.1s256K$0.63$1.13
48101gpt-oss-20b950±181.4K4.7%0.5%216 tps0.5s131K$0.06$0.26
4981OpenAI o3-pro951±191.6K3.4%5.2%22 tps70.8s200K$20.00$80.00
5079Qwen3 Max Thinking Preview952±201.1K2.2%3.1%40 tps2.1s256K$1.20$6.00
51129DeepSeek V3.1 Thinking955±141.1K2.2%7.1%18 tps1.8s131K$0.23$0.75
52139OpenAI o4-mini956±161.4K2.8%1.4%97 tps7.0s128K$1.10$4.40
53148OpenAI o4-mini-high958±112.2K3.1%1.9%117 tps15.9s200K$1.10$4.40
54126DeepSeek V3960±73.4K2.3%0.9%69 tps1.1s64K$0.59$1.49
55153OpenAI o1960±112.3K2.4%4.2%92 tps5.5s200K$15.00$60.00
56111LongCat Flash Chat963±255604.3%0.8%85 tps0.9s131K$0.14$0.68
57129Command A965±83K2.9%2.2%42 tps0.8s256K$2.00$7.33
58148OpenAI o3970±101.2K3.1%0.9%85 tps6.8s128K$7.33$29.33
59133GPT-4.1 nano974±112.3K3.4%0.6%175 tps0.5s1M$0.10$0.40
60143Gemini 2.0 Flash974±191.9K4.7%<0.1%76 tps0.5s1M$0.14$0.56
61113Kimi K2 Fast975±104.8K2.3%0.8%365 tps0.5s131K$1.00$3.00
6271Seed 1.8 251228983±103K2.6%3.7%41 tps2.1s256K$0.25$2.00
6365GLM 4.6991±159453.6%5.4%39 tps1.5s200K$0.42$1.66
64106DeepSeek V3.1 Terminus Thinking1000±147452.6%5.9%27 tps1.8s131K$0.56$1.68
65133DeepSeek-R1 05281001±151.1K4.1%1.3%93 tps0.5s64K$1.60$3.67
6693Qwen Max1009±112.7K2.7%1.5%49 tps1.5s33K$1.60$6.40
6795DeepSeek-R1 Turbo1009±206603.6%2.6%29 tps1.8s64K$2.85$4.75
68124Qwen3 235B A22B Thinking 25071010±167453.2%2.5%53 tps1.6s131K$0.59$5.70
69106DeepSeek V3 03241013±112.1K3.1%5.8%12 tps2.7s164K$0.38$0.93
7071GPT-5 Mini1017±103.1K5.2%2.6%66 tps14.2s400K$0.25$2.00
7156MiniMax M2.1 Lightning1019±248301.8%1.7%52 tps2.1s205K$0.30$2.40
7244Grok 4.1 Fast Reasoning1020±73.7K3.0%1.5%58 tps7.3s2M$0.20$0.50
7356DeepSeek V3.2 Thinking1021±131.9K1.8%9.0%30 tps2.6s131K$0.28$0.42
7468Qwen Plus (Aug'24)1023±92.4K2.9%1.4%53 tps1.3s30K$0.40$1.20
7552Grok 4 Fast Non-Reasoning1030±171.5K4.1%1.5%93 tps0.6s2M$0.27$0.67
7679MiniMax M2.5 Lightning1031±208201.8%1.5%51 tps2.0s205K$0.60$2.40
77106Grok 31034±82.8K2.8%1.5%53 tps0.6s1M$3.67$18.33
7829Qwen3 VL 235B A22B Instruct1036±161.3K4.2%3.1%75 tps1.9s129K$0.37$1.81
7986DeepSeek V3.1 Chat1038±139752.5%2.8%21 tps1.6s131K$0.38$1.00
8095DeepSeek V3.2 Exp Thinking1038±176553.7%7.2%26 tps3.0s131K$0.28$0.42
8193DeepSeek V3 0324 Turbo1038±92.2K1.8%6.3%12 tps2.4s164K$0.73$1.79
82113GLM 4.51038±129153.2%3.7%46 tps1.4s131K$0.43$1.63
8337Qwen3 Omni 30B A3B Thinking1040±207502.0%3.7%67 tps1.2s66K$0.97$1.79
84101Gemini 2.5 Flash Lite1042±67.8K4.3%1.3%210 tps0.7s1M$0.10$0.40
8586Amazon Nova 2 Lite1042±236902.1%1.0%137 tps0.6s300K$0.35$2.95
86113Mistral Medium1043±111.1K3.1%1.8%48 tps0.6s33K$1.48$4.55
87118GPT-4.1 mini1045±83.4K2.5%1.1%67 tps0.9s1M$0.34$1.60
8848Grok 4 Fast Reasoning1049±102.3K3.6%2.1%102 tps3.1s2M$0.30$0.75
89106Claude Sonnet 3.5 v21055±227703.8%<0.1%46 tps1.4s200K$3.00$15.00
9040DeepSeek V3.21056±151.4K1.4%1.4%83 tps5.1s131K$0.43$1.09
9181Qwen3.5 27B1056±176652.9%3.7%55 tps2.6s256K$0.30$2.40
9244DeepSeek V3.1 Terminus Chat1056±139552.1%3.4%27 tps1.5s131K$0.86$1.80
9326Grok 4.1 Fast Non-Reasoning1058±192K4.1%0.9%101 tps0.5s2M$0.20$0.50
9471Gemini 3.1 Flash Lite Preview1060±221.2K3.3%1.0%8 tps1.2s1M$0.25$1.50
9552Qwen3.5 122B A17B1063±149801.5%1.5%82 tps1.4s256K$0.40$3.20
9671Qwen3.5 397B A17B1067±151.3K2.2%4.3%57 tps1.4s256K$0.52$3.00
9748Step 3.5 Flash1067±206302.3%2.2%109 tps0.6s256K$0.05$0.15
9856DeepSeek V3.1 Turbo1070±121.3K2.6%0.9%173 tps1.3s164K$2.00$3.75
99113Gemini 2.5 Flash Lite Thinking1071±102.3K3.2%1.0%118 tps4.4s1M$0.03$0.13
10068GLM 4.71071±121.9K2.1%5.8%40 tps1.5s200K$0.77$1.73
10165DeepSeek V3.2 Exp Chat1072±127552.6%2.6%29 tps1.5s131K$0.27$0.39
10244Kimi K2 Thinking Turbo1072±171.3K2.2%2.0%75 tps1.4s262K$1.15$8.00
10348gpt-oss-120b1074±63K2.6%0.7%213 tps0.5s131K$0.11$0.50
10433Qwen3 Next 80B A3B Instruct1083±161.5K2.6%0.6%84 tps1.1s256K$0.20$1.42
10533Qwen3 30B A3B Instruct 25071084±82.3K3.2%1.2%55 tps1.3s131K$0.13$0.72
10671DeepSeek V3.11085±146903.5%0.8%197 tps0.4s164K$0.55$1.60
10795Gemini 2.5 Flash Lite Thinking Preview 09251086±83.4K2.7%1.5%152 tps3.0s1M$0.10$0.40
10833Kimi K2.51090±134.3K2.1%6.5%33 tps1.7s262K$0.34$2.57
10937Kimi K2.5 Instant1093±121.4K2.7%2.9%32 tps3.0s262K$0.50$3.00
11068Grok 41102±77.8K3.3%3.9%29 tps11.1s256K$3.00$15.00
11142Qwen3 Max Instruct Preview1103±132.2K2.0%1.1%31 tps1.7s256K$1.43$6.61
11295Gemini 2.5 Flash1104±79.8K2.7%1.3%2 tps3.7s1M$0.30$2.50
11329Nova Experimental Chat 12-101110±257101.4%2.4%84 tps12.9s98K$0$0
11481GPT-4o1113±82.2K3.6%1.0%49 tps2.4s128K$3.71$12.57
11552GPT-51115±85.4K3.7%3.1%78 tps23.1s400K$1.25$9.67
11626GPT-5 (High)1115±73.7K3.6%4.5%81 tps35.9s400K$1.25$10.00
11748Claude Sonnet 4 (Thinking)1116±75.3K4.2%1.5%52 tps1.5s200K$3.00$13.67
11860Gemini 2.5 Flash Preview 09251124±93.4K3.4%1.2%5 tps0.9s1M$0.13$0.97
11940Qwen3 235B A22B Instruct 25071126±82.1K2.5%6.8%13 tps1.9s262K$0.13$0.52
12022GLM 51132±121.7K1.4%3.4%36 tps2.7s200K$0.72$2.55
12156Gemini 3.1 Flash Lite Preview Thinking1132±121.7K3.9%1.7%75 tps4.7s1M$0.25$1.50
12271Gemini 2.5 Flash Lite Preview 09251134±93.5K3.4%1.2%209 tps0.7s1M$0.25$0.35
12386Claude Sonnet 41138±77.4K2.3%1.8%49 tps1.3s200K$3.00$15.00
12442GPT-5.2 (Extra High) 1146±103.5K2.3%13.2%17 tps20.5s400K$1.75$14.00
12533Grok 4.20 Multi Agent Beta1158±236651.5%1.2%56 tps8.8s2M$2.00$6.00
12660MiniMax M2.11159±112.1K2.1%2.1%66 tps2.6s205K$0.30$1.20
12771Gemini 2.5 Flash Thinking1161±46.8K3.0%2.2%88 tps6.4s1M$0.30$2.50
12852Claude Haiku 4.51163±85.3K4.1%1.1%100 tps0.9s200K$1.00$5.00
12962GPT-5.1 Instant1168±74.3K2.5%1.3%50 tps1.9s400K$1.25$10.00
13044Gemini 2.5 Pro1184±513.8K2.9%2.3%45 tps2.6s1M$1.25$10.00
13132Gemini 2.5 Pro High1191±66.1K3.2%1.5%48 tps2.3s1M$1.25$10.00
13226Claude Haiku 4.5 (Extended Thinking)1196±63K2.6%1.4%115 tps0.7s200K$1.00$5.00
13317Grok 4.20 Beta Reasoning1209±219152.1%1.1%77 tps4.5s2M$2.00$5.50
13437Claude Sonnet 4.51230±64.7K4.1%1.4%41 tps1.3s200K$1.80$9.00
13517GPT-5.2 (High)1236±811.2K1.8%6.7%18 tps16.3s400K$1.75$14.00
13613GPT-5.3 Instant1240±144.5K1.9%0.9%63 tps0.8s400K$1.75$14.00
13722GPT-5 Chat1243±711.4K2.5%1.3%95 tps0.9s400K$1.25$10.00
13817Gemini 3 Flash Preview1248±123.9K2.2%1.3%138 tps1.4s1M$0.50$3.00
1398GPT-5.1 (High)1252±86.4K1.9%3.2%76 tps6.9s400K$1.25$10.00
14016GPT-5.21254±114.5K1.8%4.1%18 tps2.7s400K$1.75$14.00
14117Claude Opus 4.51259±74.1K2.9%1.5%45 tps1.5s200K$5.00$25.00
14210GPT-5.2 Instant1262±76.9K1.8%1.7%52 tps2.0s400K$1.75$14.00
14314Gemini 3 Pro (Low)1262±66.1K2.2%2.4%51 tps3.5s1M$2.00$12.00
14414Gemini 3 Flash Preview Thinking1272±97.9K1.8%1.6%3 tps6.2s1M$0.50$3.00
1457Claude Opus 4.5 (Thinking)1272±711.3K2.0%1.8%49 tps1.4s200K$5.00$25.00
14610Gemini 3 Pro1285±917.6K1.5%2.1%50 tps3.6s1M$2.00$12.00
1478GPT-5.11295±74.3K2.2%2.3%71 tps1.4s400K$1.42$11.33
1486Gemini 3.1 Pro1317±87.9K1.6%3.5%35 tps4.1s1M$2.00$12.00
14910Claude Sonnet 4.5 (Thinking)1319±46.7K2.4%1.9%44 tps1.1s200K$3.00$15.00
1504Claude Sonnet 4.61345±114.7K1.3%1.6%47 tps1.2s200K$3.00$15.00
1515Claude Sonnet 4.6 (Thinking)1377±94.9K1.3%4.7%57 tps1.1s200K$3.00$15.00
1522Claude Opus 4.61420±116.5K1.1%2.1%48 tps1.7s200K$5.00$25.00
1531Claude Opus 4.6 (Thinking)1440±95.1K1.2%2.5%56 tps1.6s200K$5.00$25.00
1542GPT-5.41446±141.7K1.7%2.6%55 tps0.8s1M$2.50$15.00
Show Less