Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1446
GPT-5.4
1440
Claude Opus 4.6 (Thinking)
1420
Claude Opus 4.6
1415
GPT-5.4 (High)
1377
Claude Sonnet 4.6 (Thinking)
1346
GPT-5.1 (Medium)
1345
Claude Sonnet 4.6
1319
Claude Sonnet 4.5 (Thinking)
1317
Gemini 3.1 Pro
1295
GPT-5.1
1285
Gemini 3 Pro
1272
Claude Opus 4.5 (Thinking)
1272
Gemini 3 Flash Preview Thinking
1262
Gemini 3 Pro (Low)
1262
GPT-5.2 Instant

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
12GPT-5.41446±141.7K1.7%2.6%55 tps0.8s1M$2.50$15.00
21Claude Opus 4.6 (Thinking)1440±95.1K1.2%2.5%56 tps1.6s200K$5.00$25.00
32Claude Opus 4.61420±116.5K1.1%2.1%48 tps1.7s200K$5.00$25.00
44GPT-5.4 (High)1415±122.1K1.4%4.6%68 tps7.9s1M$2.50$15.00
55Claude Sonnet 4.6 (Thinking)1377±94.9K1.3%4.7%57 tps1.1s200K$3.00$15.00
68GPT-5.1 (Medium)1346±147851.9%<0.1%86 tps3.8s400K$0.83$6.67
74Claude Sonnet 4.61345±114.7K1.3%1.6%47 tps1.2s200K$3.00$15.00
810Claude Sonnet 4.5 (Thinking)1319±46.7K2.4%1.9%44 tps1.1s200K$3.00$15.00
96Gemini 3.1 Pro1317±87.9K1.6%3.5%35 tps4.1s1M$2.00$12.00
108GPT-5.11295±74.3K2.2%2.3%71 tps1.4s400K$1.42$11.33
1110Gemini 3 Pro1285±917.6K1.5%2.1%50 tps3.6s1M$2.00$12.00
127Claude Opus 4.5 (Thinking)1272±711.3K2.0%1.8%49 tps1.4s200K$5.00$25.00
1314Gemini 3 Flash Preview Thinking1272±97.9K1.8%1.6%3 tps6.2s1M$0.50$3.00
1414Gemini 3 Pro (Low)1262±66.1K2.2%2.4%51 tps3.5s1M$2.00$12.00
1510GPT-5.2 Instant1262±76.9K1.8%1.7%52 tps2.0s400K$1.75$14.00
1617Claude Opus 4.51259±74.1K2.9%1.5%45 tps1.5s200K$5.00$25.00
1716GPT-5.21254±114.5K1.8%4.1%18 tps2.7s400K$1.75$14.00
188GPT-5.1 (High)1252±86.4K1.9%3.2%76 tps6.9s400K$1.25$10.00
1917Gemini 3 Flash Preview1248±123.9K2.2%1.3%138 tps1.4s1M$0.50$3.00
2022GPT-5 Chat1243±711.4K2.5%1.3%95 tps0.9s400K$1.25$10.00
2113GPT-5.3 Instant1240±144.5K1.9%0.9%63 tps0.8s400K$1.75$14.00
2217GPT-5.2 (High)1236±811.2K1.8%6.7%18 tps16.3s400K$1.75$14.00
2337Claude Sonnet 4.51230±64.7K4.1%1.4%41 tps1.3s200K$1.80$9.00
2443Gemini 2.5 Flash Thinking Preview 09251211±72.6K2.9%<0.1%111 tps4.7s1M$0.30$2.50
2517Grok 4.20 Beta Reasoning1209±219152.1%1.1%77 tps4.5s2M$2.00$5.50
2626Claude Haiku 4.5 (Extended Thinking)1196±63K2.6%1.4%115 tps0.7s200K$1.00$5.00
2732Gemini 2.5 Pro High1191±66.1K3.2%1.5%48 tps2.3s1M$1.25$10.00
2856Gemini 2.5 Pro Low1186±72.9K3.3%<0.1%89 tps2.4s1M$1.25$10.00
2944Gemini 2.5 Pro1184±513.8K2.9%2.3%45 tps2.6s1M$1.25$10.00
3062GPT-5.1 Instant1168±74.3K2.5%1.3%50 tps1.9s400K$1.25$10.00
3152Claude Haiku 4.51163±85.3K4.1%1.1%100 tps0.9s200K$1.00$5.00
3271Gemini 2.5 Flash Thinking1161±46.8K3.0%2.2%88 tps6.4s1M$0.30$2.50
3360MiniMax M2.11159±112.1K2.1%2.1%66 tps2.6s205K$0.30$1.20
3433Grok 4.20 Multi Agent Beta1158±236651.5%1.2%56 tps8.8s2M$2.00$6.00
3521Claude Opus 4 (Thinking)1156±187153.4%<0.1%28 tps1.3s200K$15.00$75.00
3642GPT-5.2 (Extra High) 1146±103.5K2.3%13.2%17 tps20.5s400K$1.75$14.00
3786Claude Sonnet 41138±77.4K2.3%1.8%49 tps1.3s200K$3.00$15.00
3871Gemini 2.5 Flash Lite Preview 09251134±93.5K3.4%1.2%209 tps0.7s1M$0.25$0.35
3956Gemini 3.1 Flash Lite Preview Thinking1132±121.7K3.9%1.7%75 tps4.7s1M$0.25$1.50
4022GLM 51132±121.7K1.4%3.4%36 tps2.7s200K$0.72$2.55
4140Qwen3 235B A22B Instruct 25071126±82.1K2.5%6.8%13 tps1.9s262K$0.13$0.52
4260Gemini 2.5 Flash Preview 09251124±93.4K3.4%1.2%5 tps0.9s1M$0.13$0.97
4356Claude Opus 4.1 (Thinking)1119±102.1K5.0%<0.1%20 tps3.9s200K$15.00$75.00
4448Claude Sonnet 4 (Thinking)1116±75.3K4.2%1.5%52 tps1.5s200K$3.00$13.67
4526GPT-5 (High)1115±73.7K3.6%4.5%81 tps35.9s400K$1.25$10.00
4652GPT-51115±85.4K3.7%3.1%78 tps23.1s400K$1.25$9.67
4781GPT-4o1113±82.2K3.6%1.0%49 tps2.4s128K$3.71$12.57
4829Nova Experimental Chat 12-101110±257101.4%2.4%84 tps12.9s98K$0$0
4977GPT-4.5 Preview1106±165202.8%<0.1%36 tps3.0s200K$75.00$150.00
5095Gemini 2.5 Flash1104±79.8K2.7%1.3%2 tps3.7s1M$0.30$2.50
5142Qwen3 Max Instruct Preview1103±132.2K2.0%1.1%31 tps1.7s256K$1.43$6.61
5268Grok 41102±77.8K3.3%3.9%29 tps11.1s256K$3.00$15.00
5319Mistral Medium 3.11101±101.9K3.5%<0.1%77 tps0.7s128K$0.40$2.00
5421Claude Opus 41101±82.3K3.2%<0.1%25 tps1.5s200K$15.00$75.00
5537Kimi K2.5 Instant1093±121.4K2.7%2.9%32 tps3.0s262K$0.50$3.00
5633Kimi K2.51090±134.3K2.1%6.5%33 tps1.7s262K$0.34$2.57
5716Nova Experimental Chat 11-101089±237502.6%0.4%84 tps8.9s98K$0$0
5895Gemini 2.5 Flash Lite Thinking Preview 09251086±83.4K2.7%1.5%152 tps3.0s1M$0.10$0.40
5948OpenAI o1-mini1086±82.2K2.2%<0.1%118 tpsN/A128K$1.13$4.51
6071DeepSeek V3.11085±146903.5%0.8%197 tps0.4s164K$0.55$1.60
6177Claude Opus 4.11084±62.4K4.3%3.0%17 tps3.7s200K$15.00$75.00
6233Qwen3 30B A3B Instruct 25071084±82.3K3.2%1.2%55 tps1.3s131K$0.13$0.72
6333Qwen3 Next 80B A3B Instruct1083±161.5K2.6%0.6%84 tps1.1s256K$0.20$1.42
6484Claude Sonnet 3.7 (Thinking)1078±83.8K4.7%<0.1%41 tps2.6s200K$3.00$15.00
6548gpt-oss-120b1074±63K2.6%0.7%213 tps0.5s131K$0.11$0.50
6644Kimi K2 Thinking Turbo1072±171.3K2.2%2.0%75 tps1.4s262K$1.15$8.00
6765DeepSeek V3.2 Exp Chat1072±127552.6%2.6%29 tps1.5s131K$0.27$0.39
6868GLM 4.71071±121.9K2.1%5.8%40 tps1.5s200K$0.77$1.73
69113Gemini 2.5 Flash Lite Thinking1071±102.3K3.2%1.0%118 tps4.4s1M$0.03$0.13
7056DeepSeek V3.1 Turbo1070±121.3K2.6%0.9%173 tps1.3s164K$2.00$3.75
7148Step 3.5 Flash1067±206302.3%2.2%109 tps0.6s256K$0.05$0.15
7271Qwen3.5 397B A17B1067±151.3K2.2%4.3%57 tps1.4s256K$0.52$3.00
7352Qwen3.5 122B A17B1063±149801.5%1.5%82 tps1.4s256K$0.40$3.20
7471Gemini 3.1 Flash Lite Preview1060±221.2K3.3%1.0%8 tps1.2s1M$0.25$1.50
7526Grok 4.1 Fast Non-Reasoning1058±192K4.1%0.9%101 tps0.5s2M$0.20$0.50
7644DeepSeek V3.1 Terminus Chat1056±139552.1%3.4%27 tps1.5s131K$0.86$1.80
7781Qwen3.5 27B1056±176652.9%3.7%55 tps2.6s256K$0.30$2.40
7840DeepSeek V3.21056±151.4K1.4%1.4%83 tps5.1s131K$0.43$1.09
79106Claude Sonnet 3.5 v21055±227703.8%<0.1%46 tps1.4s200K$3.00$15.00
80100Gemini 2.5 Flash Preview1050±185652.6%<0.1%138 tps6.9s1M$0.15$0.60
8148Grok 4 Fast Reasoning1049±102.3K3.6%2.1%102 tps3.1s2M$0.30$0.75
82118GPT-4.1 mini1045±83.4K2.5%1.1%67 tps0.9s1M$0.34$1.60
83113Mistral Medium1043±111.1K3.1%1.8%48 tps0.6s33K$1.48$4.55
8486Amazon Nova 2 Lite1042±236902.1%1.0%137 tps0.6s300K$0.35$2.95
85101Gemini 2.5 Flash Lite1042±67.8K4.3%1.3%210 tps0.7s1M$0.10$0.40
8637Qwen3 Omni 30B A3B Thinking1040±207502.0%3.7%67 tps1.2s66K$0.97$1.79
87113GLM 4.51038±129153.2%3.7%46 tps1.4s131K$0.43$1.63
8893DeepSeek V3 0324 Turbo1038±92.2K1.8%6.3%12 tps2.4s164K$0.73$1.79
8995DeepSeek V3.2 Exp Thinking1038±176553.7%7.2%26 tps3.0s131K$0.28$0.42
9086DeepSeek V3.1 Chat1038±139752.5%2.8%21 tps1.6s131K$0.38$1.00
9129Qwen3 VL 235B A22B Instruct1036±161.3K4.2%3.1%75 tps1.9s129K$0.37$1.81
92106Grok 31034±82.8K2.8%1.5%53 tps0.6s1M$3.67$18.33
9379MiniMax M2.5 Lightning1031±208201.8%1.5%51 tps2.0s205K$0.60$2.40
9452Grok 4 Fast Non-Reasoning1030±171.5K4.1%1.5%93 tps0.6s2M$0.27$0.67
95111Claude Sonnet 3.71027±94K4.9%<0.1%39 tps1.6s200K$3.00$15.00
9668Qwen Plus (Aug'24)1023±92.4K2.9%1.4%53 tps1.3s30K$0.40$1.20
9756DeepSeek V3.2 Thinking1021±131.9K1.8%9.0%30 tps2.6s131K$0.28$0.42
9844Grok 4.1 Fast Reasoning1020±73.7K3.0%1.5%58 tps7.3s2M$0.20$0.50
9956MiniMax M2.1 Lightning1019±248301.8%1.7%52 tps2.1s205K$0.30$2.40
10071GPT-5 Mini1017±103.1K5.2%2.6%66 tps14.2s400K$0.25$2.00
101106DeepSeek V3 03241013±112.1K3.1%5.8%12 tps2.7s164K$0.38$0.93
102124Qwen3 235B A22B Thinking 25071010±167453.2%2.5%53 tps1.6s131K$0.59$5.70
10395DeepSeek-R1 Turbo1009±206603.6%2.6%29 tps1.8s64K$2.85$4.75
10493Qwen Max1009±112.7K2.7%1.5%49 tps1.5s33K$1.60$6.40
10580GPT-5 (Minimal)1003±101.9K5.4%<0.1%67 tps1.4s400K$1.25$10.00
106133DeepSeek-R1 05281001±151.1K4.1%1.3%93 tps0.5s64K$1.60$3.67
107106DeepSeek V3.1 Terminus Thinking1000±147452.6%5.9%27 tps1.8s131K$0.56$1.68
10865GLM 4.6991±159453.6%5.4%39 tps1.5s200K$0.42$1.66
109147GLM 4.5 Air991±161.1K2.7%<0.1%22 tps1.4s131K$0.10$0.38
11037Nova Experimental Chat 10-20984±205555.1%<0.1%30 tps0.5s98K$0$0
11171Seed 1.8 251228983±103K2.6%3.7%41 tps2.1s256K$0.25$2.00
112113Kimi K2 Fast975±104.8K2.3%0.8%365 tps0.5s131K$1.00$3.00
113143Gemini 2.0 Flash974±191.9K4.7%<0.1%76 tps0.5s1M$0.14$0.56
114133GPT-4.1 nano974±112.3K3.4%0.6%175 tps0.5s1M$0.10$0.40
115148OpenAI o3970±101.2K3.1%0.9%85 tps6.8s128K$7.33$29.33
116129Command A965±83K2.9%2.2%42 tps0.8s256K$2.00$7.33
117111LongCat Flash Chat963±255604.3%0.8%85 tps0.9s131K$0.14$0.68
118111Solar Pro 3 (Reasoning)960±235051.0%3.2%118 tps1.2s131K$0.15$0.60
119153OpenAI o1960±112.3K2.4%4.2%92 tps5.5s200K$15.00$60.00
120126DeepSeek V3960±73.4K2.3%0.9%69 tps1.1s64K$0.59$1.49
121148OpenAI o4-mini-high958±112.2K3.1%1.9%117 tps15.9s200K$1.10$4.40
122139OpenAI o4-mini956±161.4K2.8%1.4%97 tps7.0s128K$1.10$4.40
123129DeepSeek V3.1 Thinking955±141.1K2.2%7.1%18 tps1.8s131K$0.23$0.75
12479Qwen3 Max Thinking Preview952±201.1K2.2%3.1%40 tps2.1s256K$1.20$6.00
12581OpenAI o3-pro951±191.6K3.4%5.2%22 tps70.8s200K$20.00$80.00
126101gpt-oss-20b950±181.4K4.7%0.5%216 tps0.5s131K$0.06$0.26
127101Qwen3.5 35B A3B949±275302.8%2.1%116 tps2.1s256K$0.63$1.13
12865Mistral Large 3947±201.3K4.4%2.1%51 tps1.0s256K$0.50$1.50
129143Seed 1.6 250615928±216355.2%3.1%46 tps2.2s256K$0.25$2.00
130241GPT-5 Mini High926±158804.9%<0.1%33 tps3.9s400K$0.25$2.00
13195Kimi K2 Thinking926±177402.0%4.2%61 tps5.9s262K$0.24$1.03
132124Kimi K2 0905 Turbo925±131.5K4.7%0.7%373 tps0.5s262K$1.70$6.50
133211Gemini 1.5 Pro925±207504.5%<0.1%15 tps0.0s2M$0.78$3.13
134133Kimi K2 0905922±218054.2%4.0%30 tps1.4s262K$0.63$2.39
135126Qwen3 VL 235B A22B Thinking922±187454.5%4.3%47 tps3.0s127K$0.47$3.31
136133Solar Pro 2 250710918±141.4K3.4%<0.1%9 tpsN/A66K$0.50$0.50
137119ERNIE 4.5 300B A47B918±171.6K2.7%4.7%23 tps2.3s123K$0.28$1.10
138126Qwen3 30B A3B918±158653.4%5.1%163 tps1.0s41K$0.06$0.21
139143Gemini 2.0 Flash Lite917±112.5K6.7%<0.1%42 tps0.5s1M$0.08$0.30
140121QwQ 32B910±131.8K3.1%5.4%41 tps2.1s16K$0.43$0.56
14162MiniMax M2905±181.4K3.5%2.2%39 tps2.3s205K$0.21$0.85
142213Claude Haiku 3.5901±102.7K5.9%0.8%40 tps2.8s200K$0.80$4.00
143177OpenAI o3-mini901±122.5K3.1%0.8%143 tps3.3s200K$1.10$4.40
144139Seed 2.0 Mini (Medium)900±305153.7%11.9%33 tps1.7s256K$0.15$0.60
14586Qwen3 235B A22B898±237253.3%5.3%71 tps0.9s41K$0.23$0.63
146302YouTube896±162.4K5.6%<0.1%34 tps2.7s32K$0.99$0.99
147148DeepSeek-R1894±121.1K3.8%0.8%133 tps0.6s64K$0.91$3.07
148161Llama 4 Maverick883±123.6K4.4%1.2%88 tps2.4s1M$0.23$0.83
149165Qwen3 4B878±237353.9%1.9%94 tps1.5s128K$0.01$0.01
150270Solar Pro 2 250710 (Reasoning)878±255053.8%<0.1%9 tpsN/A66K$0.50$0.50
151160Llama 4 Scout875±152.3K2.9%0.6%88 tps5.1s131K$0.18$0.46
152179GLM 4.7 Flash874±248552.8%5.8%61 tps2.8s128K$0.07$0.39
153246DeepSeek-R1 Distill Llama 70B869±285904.8%3.6%27 tps1.6s32K$0.73$0.95
154214OpenAI o3-mini-high868±131.4K3.8%2.4%231 tps10.5s200K$1.10$4.40
155129Qwen3 Max Thinking866±141.5K1.7%13.5%32 tps2.3s256K$1.20$6.00
156139GLM 4.6V865±248902.7%6.4%21 tps1.8s128K$0.38$0.90
157170Kimi K2 0711858±248904.3%1.6%29 tps1.3s131K$0.72$2.60
158133Qwen3 14B856±247452.6%1.7%109 tps0.8s41K$0.04$0.15
159121Qwen3 32B Fast853±131.8K4.2%11.6%30 tps3.1s41K$0.10$0.25
160157Qwen3 Next 80B A3B Thinking846±151.3K3.9%0.6%175 tps1.3s256K$0.21$2.26
161157GPT-5 Nano843±142K6.0%3.2%113 tps20.9s400K$0.05$0.40
162175OpenAI o3-mini-low838±211.7K2.6%0.7%139 tps1.5s200K$1.10$4.40
16384GPT-5 Mini Minimal835±131.1K6.6%1.2%63 tps1.4s400K$0.25$2.00
164186Grok 3 Mini Fast832±231K3.3%1.6%44 tps0.5s131K$0.60$4.00
165133DeepSeek V3.2 Speciale830±285403.6%6.0%43 tps1.4s131K$0.84$1.52
166161Qwen3 8B827±366004.0%2.4%61 tps1.4s41K$0.02$0.07
167201GPT-4o mini826±186456.5%2.1%71 tps1.7s128K$0.15$0.60
16886Nemotron 3 Nano (Thinking)821±235404.4%2.0%200 tps0.5s256K$0$0
169148Qwen3 30B A3B Thinking 2507818±187953.0%0.5%124 tps1.2s131K$0.16$1.70
170213DeepSeek R1T Chimera805±255103.8%<0.1%46 tps1.1s164K$0.09$0.36
171265Qwen 2.5 VL 72B Instruct804±297156.5%5.3%25 tps3.7s128K$1.01$2.79
172147Arcee AI Maestro Reasoning802±174804.0%<0.1%85 tps0.3s131K$0.90$3.30
173229Magistral Medium 2509797±175705.0%4.0%58 tps0.9s131K$2.00$5.00
174108GPT-5 Mini Low796±168657.0%<0.1%69 tps3.2s400K$0.25$2.00
175265Magistral Small 2509790±295306.2%2.7%116 tps0.6s131K$0.50$1.50
176186Gemma 3n E4B781±275354.5%2.0%30 tps0.5s8K$0.01$0.02
177159Qwen Turbo779±158602.8%<0.1%53 tps1.1s1M$0.05$0.20
178165Pixtral Large778±181.1K7.2%2.5%57 tps1.3s128K$1.50$4.50
179194Llama 3.3 70B745±305254.5%0.3%500 tps0.5s8K$0.48$0.66
180274Pixtral 12B744±339409.6%2.2%101 tps1.2s131K$0.08$0.08
181186Grok 3 Mini739±201.4K2.5%1.2%43 tps0.5s131K$0.30$0.50
182186GLM 4.6V Flash726±295752.5%3.7%64 tps2.1s128K$0.04$0.40
183182Fauna Fox715±284904.9%<0.1%194 tps0.3s128K$0.04$0.15
184277Wikipedia682±345509.8%<0.1%47 tps2.1s32K$0$0
185292GPT-5 Nano Minimal621±185907.8%<0.1%88 tps0.8s400K$0.05$0.40
186314DeepSeek-R1 0528 Qwen3 8B552±235655.8%<0.1%45 tps2.4s128K$0.05$0.09
187288Qwen 2.5 VL 3B Instruct546±311K9.1%3.0%44 tps2.5s128K$0.21$0.63
188292AFM 4.5B305±465855.6%<0.1%81 tps0.3s66K$0.05$0.20
Show Less