Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1485
Claude Opus 4.6 (Thinking)
1483
Claude Opus 4.6
1466
GPT-5.4 (High)
1344
Gemini 3.1 Pro
1295
Claude Sonnet 4.6
1280
GPT-5.1 (Medium)
1274
GPT-5.2 Instant
1274
GPT-5.1 (High)
1270
Gemini 3 Pro (Low)
1268
GPT-5.1
1265
Gemini 3 Pro
1264
Nova Experimental Chat 11-10
1260
Claude Sonnet 4.5 (Thinking)
1248
Claude Opus 4.5 (Thinking)
1235
Claude Sonnet 4.6 (Thinking)

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
11Claude Opus 4.6 (Thinking)1485±112.3K1.3%2.5%56 tps1.6s200K$5.00$25.00
22Claude Opus 4.61483±113.2K0.9%2.1%48 tps1.7s200K$5.00$25.00
34GPT-5.4 (High)1466±195152.8%4.6%68 tps7.9s1M$2.50$15.00
46Gemini 3.1 Pro1344±162.8K2.4%3.5%35 tps4.1s1M$2.00$12.00
54Claude Sonnet 4.61295±161.8K0.8%1.6%47 tps1.2s200K$3.00$15.00
68GPT-5.1 (Medium)1280±127401.3%<0.1%86 tps3.8s400K$0.83$6.67
710GPT-5.2 Instant1274±83.9K2.2%1.7%52 tps2.0s400K$1.75$14.00
88GPT-5.1 (High)1274±64.2K2.5%3.2%76 tps6.9s400K$1.25$10.00
914Gemini 3 Pro (Low)1270±123.5K2.4%2.4%51 tps3.5s1M$2.00$12.00
108GPT-5.11268±63.3K1.5%2.3%71 tps1.4s400K$1.42$11.33
1110Gemini 3 Pro1265±512K1.3%2.1%50 tps3.6s1M$2.00$12.00
1216Nova Experimental Chat 11-101264±131.1K1.4%0.4%84 tps8.9s98K$0$0
1310Claude Sonnet 4.5 (Thinking)1260±48.3K1.8%1.9%44 tps1.1s200K$3.00$15.00
147Claude Opus 4.5 (Thinking)1248±69.3K2.3%1.8%49 tps1.4s200K$5.00$25.00
155Claude Sonnet 4.6 (Thinking)1235±181.7K2.0%4.7%57 tps1.1s200K$3.00$15.00
1617Claude Opus 4.51228±92.9K1.8%1.5%45 tps1.5s200K$5.00$25.00
1722GPT-5 Chat1220±610.4K2.1%1.3%95 tps0.9s400K$1.25$10.00
1816GPT-5.21213±122.7K2.2%4.1%18 tps2.7s400K$1.75$14.00
1914Gemini 3 Flash Preview Thinking1212±84.2K1.8%1.6%3 tps6.2s1M$0.50$3.00
2019Mistral Medium 3.11198±102.2K2.8%<0.1%77 tps0.7s128K$0.40$2.00
2117Gemini 3 Flash Preview1198±142K2.0%1.3%138 tps1.4s1M$0.50$3.00
2232Gemini 2.5 Pro High1189±64.5K2.5%1.5%48 tps2.3s1M$1.25$10.00
2322GLM 51184±227952.5%3.4%36 tps2.7s200K$0.72$2.55
2417GPT-5.2 (High)1175±106.4K1.7%6.7%18 tps16.3s400K$1.75$14.00
2542GPT-5.2 (Extra High) 1168±122.4K1.8%13.2%17 tps20.5s400K$1.75$14.00
2637Nova Experimental Chat 10-201162±91.1K3.1%<0.1%30 tps0.5s98K$0$0
2756Gemini 2.5 Pro Low1162±82.4K3.3%<0.1%89 tps2.4s1M$1.25$10.00
2813GPT-5.3 Instant1160±191.5K1.9%0.9%63 tps0.8s400K$1.75$14.00
2984Claude Sonnet 3.7 (Thinking)1159±71.9K5.3%<0.1%41 tps2.6s200K$3.00$15.00
3044Gemini 2.5 Pro1158±57.8K3.3%2.3%45 tps2.6s1M$1.25$10.00
3152GPT-51157±75.3K2.9%3.1%78 tps23.1s400K$1.25$9.67
3237Claude Sonnet 4.51156±55.1K2.8%1.4%41 tps1.3s200K$1.80$9.00
3371Gemini 2.5 Flash Thinking1148±72.6K4.2%2.2%88 tps6.4s1M$0.30$2.50
3448Claude Sonnet 4 (Thinking)1144±64.2K3.5%1.5%52 tps1.5s200K$3.00$13.67
3533Qwen3 Next 80B A3B Instruct1142±91.8K2.5%0.6%84 tps1.1s256K$0.20$1.42
3680GPT-5 (Minimal)1141±111.9K3.0%<0.1%67 tps1.4s400K$1.25$10.00
37111Claude Sonnet 3.71140±92K5.0%<0.1%39 tps1.6s200K$3.00$15.00
3843Gemini 2.5 Flash Thinking Preview 09251138±72.3K3.4%<0.1%111 tps4.7s1M$0.30$2.50
3929Nova Experimental Chat 12-101135±197202.0%2.4%84 tps12.9s98K$0$0
4060Gemini 2.5 Flash Preview 09251131±102.1K2.7%1.2%5 tps0.9s1M$0.13$0.97
4126Grok 4.1 Fast Non-Reasoning1130±152K4.3%0.9%101 tps0.5s2M$0.20$0.50
4233Qwen3 30B A3B Instruct 25071127±92.5K2.9%1.2%55 tps1.3s131K$0.13$0.72
4381OpenAI o3-pro1126±102.3K3.1%5.2%22 tps70.8s200K$20.00$80.00
4440Qwen3 235B A22B Instruct 25071118±112.5K2.1%6.8%13 tps1.9s262K$0.13$0.52
4568Qwen Plus (Aug'24)1115±81.9K2.6%1.4%53 tps1.3s30K$0.40$1.20
4626Claude Haiku 4.5 (Extended Thinking)1115±122.2K2.7%1.4%115 tps0.7s200K$1.00$5.00
4729Qwen3 VL 235B A22B Instruct1114±81.3K2.5%3.1%75 tps1.9s129K$0.37$1.81
4821Claude Opus 4 (Thinking)1114±87703.1%<0.1%28 tps1.3s200K$15.00$75.00
4971Qwen3.5 397B A17B1112±245802.5%4.3%57 tps1.4s256K$0.52$3.00
5026GPT-5 (High)1110±74.3K3.1%4.5%81 tps35.9s400K$1.25$10.00
5137Kimi K2.5 Instant1101±284951.0%2.9%32 tps3.0s262K$0.50$3.00
5295Gemini 2.5 Flash1098±94.8K2.7%1.3%2 tps3.7s1M$0.30$2.50
5368Grok 41093±55.7K4.0%3.9%29 tps11.1s256K$3.00$15.00
5448Grok 4 Fast Reasoning1090±112.1K3.1%2.1%102 tps3.1s2M$0.30$0.75
5521Claude Opus 41087±93.2K2.9%<0.1%25 tps1.5s200K$15.00$75.00
5656Gemini 3.1 Flash Lite Preview Thinking1083±324853.0%1.7%75 tps4.7s1M$0.25$1.50
5733Kimi K2.51083±161.7K3.2%6.5%33 tps1.7s262K$0.34$2.57
5848gpt-oss-120b1083±73.5K3.0%0.7%213 tps0.5s131K$0.11$0.50
5956Claude Opus 4.1 (Thinking)1083±62K4.1%<0.1%20 tps3.9s200K$15.00$75.00
6044Grok 4.1 Fast Reasoning1076±102.6K4.2%1.5%58 tps7.3s2M$0.20$0.50
6171GPT-5 Mini1075±92.1K4.3%2.6%66 tps14.2s400K$0.25$2.00
6262GPT-5.1 Instant1075±92.2K2.6%1.3%50 tps1.9s400K$1.25$10.00
6393DeepSeek V3 0324 Turbo1074±142.1K1.9%6.3%12 tps2.4s164K$0.73$1.79
6471Gemini 2.5 Flash Lite Preview 09251066±112.2K2.8%1.2%209 tps0.7s1M$0.25$0.35
6586Claude Sonnet 41066±85.3K2.5%1.8%49 tps1.3s200K$3.00$15.00
6642Qwen3 Max Instruct Preview1063±72.7K1.5%1.1%31 tps1.7s256K$1.43$6.61
6740DeepSeek V3.21063±161.1K2.5%1.4%83 tps5.1s131K$0.43$1.09
6852Claude Haiku 4.51057±63.4K3.4%1.1%100 tps0.9s200K$1.00$5.00
6952Grok 4 Fast Non-Reasoning1054±81.6K2.5%1.5%93 tps0.6s2M$0.27$0.67
7081GPT-4o1046±151.4K2.5%1.0%49 tps2.4s128K$3.71$12.57
7148OpenAI o1-mini1045±81.8K3.5%<0.1%118 tpsN/A128K$1.13$4.51
7260MiniMax M2.11044±121.7K2.8%2.1%66 tps2.6s205K$0.30$1.20
7395Gemini 2.5 Flash Lite Thinking Preview 09251044±91.7K3.5%1.5%152 tps3.0s1M$0.10$0.40
7462MiniMax M21043±91.8K4.2%2.2%39 tps2.3s205K$0.21$0.85
7565GLM 4.61041±111.6K2.9%5.4%39 tps1.5s200K$0.42$1.66
7644DeepSeek V3.1 Terminus Chat1037±91.3K2.2%3.4%27 tps1.5s131K$0.86$1.80
7756DeepSeek V3.2 Thinking1033±151.7K2.0%9.0%30 tps2.6s131K$0.28$0.42
7877Claude Opus 4.11032±112K2.7%3.0%17 tps3.7s200K$15.00$75.00
79129Qwen3 Max Thinking1029±316002.4%13.5%32 tps2.3s256K$1.20$6.00
8065Mistral Large 31026±221.1K4.1%2.1%51 tps1.0s256K$0.50$1.50
8195DeepSeek-R1 Turbo1021±134853.0%2.6%29 tps1.8s64K$2.85$4.75
8293Qwen Max1021±141.8K2.7%1.5%49 tps1.5s33K$1.60$6.40
8379Qwen3 Max Thinking Preview1020±101.2K2.4%3.1%40 tps2.1s256K$1.20$6.00
8486DeepSeek V3.1 Chat1018±121.1K3.1%2.8%21 tps1.6s131K$0.38$1.00
85133Kimi K2 09051014±138102.4%4.0%30 tps1.4s262K$0.63$2.39
86101Gemini 2.5 Flash Lite1014±95.3K3.9%1.3%210 tps0.7s1M$0.10$0.40
8786Amazon Nova 2 Lite1013±188154.7%1.0%137 tps0.6s300K$0.35$2.95
88108GPT-5 Mini Low1005±167353.9%<0.1%69 tps3.2s400K$0.25$2.00
89106Grok 31003±92K2.6%1.5%53 tps0.6s1M$3.67$18.33
90170Kimi K2 07111002±157203.4%1.6%29 tps1.3s131K$0.72$2.60
9195DeepSeek V3.2 Exp Thinking999±227751.9%7.2%26 tps3.0s131K$0.28$0.42
92113Gemini 2.5 Flash Lite Thinking996±112.5K3.7%1.0%118 tps4.4s1M$0.03$0.13
93124Kimi K2 0905 Turbo991±121.6K1.8%0.7%373 tps0.5s262K$1.70$6.50
94106DeepSeek V3 0324990±112.1K3.0%5.8%12 tps2.7s164K$0.38$0.93
95113Kimi K2 Fast989±107.4K2.2%0.8%365 tps0.5s131K$1.00$3.00
9686Qwen3 235B A22B989±197403.9%5.3%71 tps0.9s41K$0.23$0.63
97113Mistral Medium989±141.1K2.7%1.8%48 tps0.6s33K$1.48$4.55
9844Kimi K2 Thinking Turbo986±141.1K3.2%2.0%75 tps1.4s262K$1.15$8.00
9995Kimi K2 Thinking985±216203.1%4.2%61 tps5.9s262K$0.24$1.03
100106DeepSeek V3.1 Terminus Thinking979±136603.6%5.9%27 tps1.8s131K$0.56$1.68
101118GPT-4.1 mini976±132.7K1.8%1.1%67 tps0.9s1M$0.34$1.60
102113GLM 4.5969±191.3K3.5%3.7%46 tps1.4s131K$0.43$1.63
10384GPT-5 Mini Minimal968±177953.6%1.2%63 tps1.4s400K$0.25$2.00
104148OpenAI o3960±166002.4%0.9%85 tps6.8s128K$7.33$29.33
105129DeepSeek V3.1 Thinking958±141K2.4%7.1%18 tps1.8s131K$0.23$0.75
10656DeepSeek V3.1 Turbo957±148204.1%0.9%173 tps1.3s164K$2.00$3.75
107126DeepSeek V3956±121.7K2.5%0.9%69 tps1.1s64K$0.59$1.49
108148OpenAI o4-mini-high950±121.5K3.8%1.9%117 tps15.9s200K$1.10$4.40
10968GLM 4.7949±131.6K1.8%5.8%40 tps1.5s200K$0.77$1.73
11071Seed 1.8 251228949±181.2K2.7%3.7%41 tps2.1s256K$0.25$2.00
111129Command A948±121.9K3.1%2.2%42 tps0.8s256K$2.00$7.33
112139OpenAI o4-mini947±111.2K2.5%1.4%97 tps7.0s128K$1.10$4.40
113133Qwen3 14B943±168254.1%1.7%109 tps0.8s41K$0.04$0.15
114126Qwen3 VL 235B A22B Thinking939±159653.5%4.3%47 tps3.0s127K$0.47$3.31
115133Solar Pro 2 250710938±101.7K3.9%<0.1%9 tpsN/A66K$0.50$0.50
116148DeepSeek-R1936±217054.1%0.8%133 tps0.6s64K$0.91$3.07
117147GLM 4.5 Air932±161.6K3.5%<0.1%22 tps1.4s131K$0.10$0.38
118153OpenAI o1926±159152.1%4.2%92 tps5.5s200K$15.00$60.00
119101gpt-oss-20b912±121.5K4.1%0.5%216 tps0.5s131K$0.06$0.26
12065DeepSeek V3.2 Exp Chat909±111.3K2.6%2.6%29 tps1.5s131K$0.27$0.39
121124Qwen3 235B A22B Thinking 2507905±166952.1%2.5%53 tps1.6s131K$0.59$5.70
122133DeepSeek-R1 0528904±196404.5%1.3%93 tps0.5s64K$1.60$3.67
123143Gemini 2.0 Flash Lite902±149907.9%<0.1%42 tps0.5s1M$0.08$0.30
124157GPT-5 Nano901±91.2K5.0%3.2%113 tps20.9s400K$0.05$0.40
125133GPT-4.1 nano896±111.8K3.2%0.6%175 tps0.5s1M$0.10$0.40
126241GPT-5 Mini High891±168104.1%<0.1%33 tps3.9s400K$0.25$2.00
127157Qwen3 Next 80B A3B Thinking890±161.1K3.4%0.6%175 tps1.3s256K$0.21$2.26
128143Gemini 2.0 Flash885±168705.9%<0.1%76 tps0.5s1M$0.14$0.56
129119ERNIE 4.5 300B A47B873±121.1K3.4%4.7%23 tps2.3s123K$0.28$1.10
130121Qwen3 32B Fast863±132K4.5%11.6%30 tps3.1s41K$0.10$0.25
131159Qwen Turbo847±151.1K4.3%<0.1%53 tps1.1s1M$0.05$0.20
132133DeepSeek V3.2 Speciale842±384854.9%6.0%43 tps1.4s131K$0.84$1.52
133161Llama 4 Maverick838±102.4K4.3%1.2%88 tps2.4s1M$0.23$0.83
134139GLM 4.6V837±236405.2%6.4%21 tps1.8s128K$0.38$0.90
135126Qwen3 30B A3B832±209504.0%5.1%163 tps1.0s41K$0.06$0.21
136121QwQ 32B825±161.3K5.5%5.4%41 tps2.1s16K$0.43$0.56
137165Qwen3 4B818±208704.9%1.9%94 tps1.5s128K$0.01$0.01
138161Qwen3 8B813±236105.4%2.4%61 tps1.4s41K$0.02$0.07
139314MAI-DS-R1810±205655.8%<0.1%73 tps3.2s64K$0.10$0.40
140213Claude Haiku 3.5801±151.2K5.9%0.8%40 tps2.8s200K$0.80$4.00
141186Grok 3 Mini799±231.9K2.6%1.2%43 tps0.5s131K$0.30$0.50
142302YouTube797±201.1K4.0%<0.1%34 tps2.7s32K$0.99$0.99
143165Pixtral Large796±266107.6%2.5%57 tps1.3s128K$1.50$4.50
144170Mistral Small 3.2 24B793±284755.9%2.8%141 tps0.7s33K$0.02$0.08
145148Qwen3 30B A3B Thinking 2507787±146554.4%0.5%124 tps1.2s131K$0.16$1.70
146160Llama 4 Scout782±151.6K4.1%0.6%88 tps5.1s131K$0.18$0.46
147214OpenAI o3-mini-high778±176303.1%2.4%231 tps10.5s200K$1.10$4.40
148177OpenAI o3-mini777±122K3.6%0.8%143 tps3.3s200K$1.10$4.40
149229Magistral Medium 2509765±186103.9%4.0%58 tps0.9s131K$2.00$5.00
150175OpenAI o3-mini-low752±171.4K4.3%0.7%139 tps1.5s200K$1.10$4.40
151186Grok 3 Mini Fast724±151.3K4.3%1.6%44 tps0.5s131K$0.60$4.00
152194Llama 3.3 70B719±185503.5%0.3%500 tps0.5s8K$0.48$0.66
153277Wikipedia703±167104.1%<0.1%47 tps2.1s32K$0$0
154274Pixtral 12B674±337205.9%2.2%101 tps1.2s131K$0.08$0.08
155265Qwen 2.5 VL 72B Instruct635±345055.6%5.3%25 tps3.7s128K$1.01$2.79
156179Inception Mercury613±255006.5%0.4%257 tps1.1s32K$0.25$1.00
157182Fauna Fox518±296256.0%<0.1%194 tps0.3s128K$0.04$0.15
158288Qwen 2.5 VL 3B Instruct373±469557.7%3.0%44 tps2.5s128K$0.21$0.63
159292AFM 4.5B95±546059.7%<0.1%81 tps0.3s66K$0.05$0.20
Show Less