Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1485
Claude Opus 4.6 (Thinking)
1483
Claude Opus 4.6
1344
Gemini 3.1 Pro
1295
Claude Sonnet 4.6
1274
GPT-5.2 Instant
1274
GPT-5.1 (High)
1270
Gemini 3 Pro (Low)
1268
GPT-5.1
1265
Gemini 3 Pro
1260
Claude Sonnet 4.5 (Thinking)
1248
Claude Opus 4.5 (Thinking)
1235
Claude Sonnet 4.6 (Thinking)
1228
Claude Opus 4.5
1220
GPT-5 Chat
1213
GPT-5.2

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
11Claude Opus 4.6 (Thinking)1485±112.3K1.3%2.5%56 tps1.6s200K$5.00$25.00
22Claude Opus 4.61483±113.2K0.9%2.1%48 tps1.7s200K$5.00$25.00
36Gemini 3.1 Pro1344±162.8K2.4%3.5%35 tps4.1s1M$2.00$12.00
44Claude Sonnet 4.61295±161.8K0.8%1.6%47 tps1.2s200K$3.00$15.00
510GPT-5.2 Instant1274±83.9K2.2%1.7%52 tps2.0s400K$1.75$14.00
68GPT-5.1 (High)1274±64.2K2.5%3.2%76 tps6.9s400K$1.25$10.00
714Gemini 3 Pro (Low)1270±123.5K2.4%2.4%51 tps3.5s1M$2.00$12.00
88GPT-5.11268±63.3K1.5%2.3%71 tps1.4s400K$1.42$11.33
910Gemini 3 Pro1265±512K1.3%2.1%50 tps3.6s1M$2.00$12.00
1010Claude Sonnet 4.5 (Thinking)1260±48.3K1.8%1.9%44 tps1.1s200K$3.00$15.00
117Claude Opus 4.5 (Thinking)1248±69.3K2.3%1.8%49 tps1.4s200K$5.00$25.00
125Claude Sonnet 4.6 (Thinking)1235±181.7K2.0%4.7%57 tps1.1s200K$3.00$15.00
1317Claude Opus 4.51228±92.9K1.8%1.5%45 tps1.5s200K$5.00$25.00
1422GPT-5 Chat1220±610.4K2.1%1.3%95 tps0.9s400K$1.25$10.00
1516GPT-5.21213±122.7K2.2%4.1%18 tps2.7s400K$1.75$14.00
1614Gemini 3 Flash Preview Thinking1212±84.2K1.8%1.6%3 tps6.2s1M$0.50$3.00
1717Gemini 3 Flash Preview1198±142K2.0%1.3%138 tps1.4s1M$0.50$3.00
1832Gemini 2.5 Pro High1189±64.5K2.5%1.5%48 tps2.3s1M$1.25$10.00
1922GLM 51184±227952.5%3.4%36 tps2.7s200K$0.72$2.55
2017GPT-5.2 (High)1175±106.4K1.7%6.7%18 tps16.3s400K$1.75$14.00
2142GPT-5.2 (Extra High) 1168±122.4K1.8%13.2%17 tps20.5s400K$1.75$14.00
2213GPT-5.3 Instant1160±191.5K1.9%0.9%63 tps0.8s400K$1.75$14.00
2344Gemini 2.5 Pro1158±57.8K3.3%2.3%45 tps2.6s1M$1.25$10.00
2452GPT-51157±75.3K2.9%3.1%78 tps23.1s400K$1.25$9.67
2537Claude Sonnet 4.51156±55.1K2.8%1.4%41 tps1.3s200K$1.80$9.00
2671Gemini 2.5 Flash Thinking1148±72.6K4.2%2.2%88 tps6.4s1M$0.30$2.50
2748Claude Sonnet 4 (Thinking)1144±64.2K3.5%1.5%52 tps1.5s200K$3.00$13.67
2833Qwen3 Next 80B A3B Instruct1142±91.8K2.5%0.6%84 tps1.1s256K$0.20$1.42
2929Nova Experimental Chat 12-101135±197202.0%2.4%84 tps12.9s98K$0$0
3060Gemini 2.5 Flash Preview 09251131±102.1K2.7%1.2%5 tps0.9s1M$0.13$0.97
3126Grok 4.1 Fast Non-Reasoning1130±152K4.3%0.9%101 tps0.5s2M$0.20$0.50
3233Qwen3 30B A3B Instruct 25071127±92.5K2.9%1.2%55 tps1.3s131K$0.13$0.72
3381OpenAI o3-pro1126±102.3K3.1%5.2%22 tps70.8s200K$20.00$80.00
3440Qwen3 235B A22B Instruct 25071118±112.5K2.1%6.8%13 tps1.9s262K$0.13$0.52
3568Qwen Plus (Aug'24)1115±81.9K2.6%1.4%53 tps1.3s30K$0.40$1.20
3626Claude Haiku 4.5 (Extended Thinking)1115±122.2K2.7%1.4%115 tps0.7s200K$1.00$5.00
3729Qwen3 VL 235B A22B Instruct1114±81.3K2.5%3.1%75 tps1.9s129K$0.37$1.81
3871Qwen3.5 397B A17B1112±245802.5%4.3%57 tps1.4s256K$0.52$3.00
3926GPT-5 (High)1110±74.3K3.1%4.5%81 tps35.9s400K$1.25$10.00
4037Kimi K2.5 Instant1101±284951.0%2.9%32 tps3.0s262K$0.50$3.00
4195Gemini 2.5 Flash1098±94.8K2.7%1.3%2 tps3.7s1M$0.30$2.50
4268Grok 41093±55.7K4.0%3.9%29 tps11.1s256K$3.00$15.00
4348Grok 4 Fast Reasoning1090±112.1K3.1%2.1%102 tps3.1s2M$0.30$0.75
4456Gemini 3.1 Flash Lite Preview Thinking1083±324853.0%1.7%75 tps4.7s1M$0.25$1.50
4533Kimi K2.51083±161.7K3.2%6.5%33 tps1.7s262K$0.34$2.57
4648gpt-oss-120b1083±73.5K3.0%0.7%213 tps0.5s131K$0.11$0.50
4744Grok 4.1 Fast Reasoning1076±102.6K4.2%1.5%58 tps7.3s2M$0.20$0.50
4871GPT-5 Mini1075±92.1K4.3%2.6%66 tps14.2s400K$0.25$2.00
4962GPT-5.1 Instant1075±92.2K2.6%1.3%50 tps1.9s400K$1.25$10.00
5093DeepSeek V3 0324 Turbo1074±142.1K1.9%6.3%12 tps2.4s164K$0.73$1.79
5171Gemini 2.5 Flash Lite Preview 09251066±112.2K2.8%1.2%209 tps0.7s1M$0.25$0.35
5286Claude Sonnet 41066±85.3K2.5%1.8%49 tps1.3s200K$3.00$15.00
5342Qwen3 Max Instruct Preview1063±72.7K1.5%1.1%31 tps1.7s256K$1.43$6.61
5440DeepSeek V3.21063±161.1K2.5%1.4%83 tps5.1s131K$0.43$1.09
5552Claude Haiku 4.51057±63.4K3.4%1.1%100 tps0.9s200K$1.00$5.00
5652Grok 4 Fast Non-Reasoning1054±81.6K2.5%1.5%93 tps0.6s2M$0.27$0.67
5781GPT-4o1046±151.4K2.5%1.0%49 tps2.4s128K$3.71$12.57
5860MiniMax M2.11044±121.7K2.8%2.1%66 tps2.6s205K$0.30$1.20
5995Gemini 2.5 Flash Lite Thinking Preview 09251044±91.7K3.5%1.5%152 tps3.0s1M$0.10$0.40
6062MiniMax M21043±91.8K4.2%2.2%39 tps2.3s205K$0.21$0.85
6165GLM 4.61041±111.6K2.9%5.4%39 tps1.5s200K$0.42$1.66
6244DeepSeek V3.1 Terminus Chat1037±91.3K2.2%3.4%27 tps1.5s131K$0.86$1.80
6356DeepSeek V3.2 Thinking1033±151.7K2.0%9.0%30 tps2.6s131K$0.28$0.42
64129Qwen3 Max Thinking1029±316002.4%13.5%32 tps2.3s256K$1.20$6.00
6565Mistral Large 31026±221.1K4.1%2.1%51 tps1.0s256K$0.50$1.50
6695DeepSeek-R1 Turbo1021±134853.0%2.6%29 tps1.8s64K$2.85$4.75
6793Qwen Max1021±141.8K2.7%1.5%49 tps1.5s33K$1.60$6.40
6879Qwen3 Max Thinking Preview1020±101.2K2.4%3.1%40 tps2.1s256K$1.20$6.00
6986DeepSeek V3.1 Chat1018±121.1K3.1%2.8%21 tps1.6s131K$0.38$1.00
70133Kimi K2 09051014±138102.4%4.0%30 tps1.4s262K$0.63$2.39
71101Gemini 2.5 Flash Lite1014±95.3K3.9%1.3%210 tps0.7s1M$0.10$0.40
7286Amazon Nova 2 Lite1013±188154.7%1.0%137 tps0.6s300K$0.35$2.95
73106Grok 31003±92K2.6%1.5%53 tps0.6s1M$3.67$18.33
74170Kimi K2 07111002±157203.4%1.6%29 tps1.3s131K$0.72$2.60
7595DeepSeek V3.2 Exp Thinking999±227751.9%7.2%26 tps3.0s131K$0.28$0.42
76113Gemini 2.5 Flash Lite Thinking996±112.5K3.7%1.0%118 tps4.4s1M$0.03$0.13
77124Kimi K2 0905 Turbo991±121.6K1.8%0.7%373 tps0.5s262K$1.70$6.50
78106DeepSeek V3 0324990±112.1K3.0%5.8%12 tps2.7s164K$0.38$0.93
79113Kimi K2 Fast989±107.4K2.2%0.8%365 tps0.5s131K$1.00$3.00
8086Qwen3 235B A22B989±197403.9%5.3%71 tps0.9s41K$0.23$0.63
81113Mistral Medium989±141.1K2.7%1.8%48 tps0.6s33K$1.48$4.55
8244Kimi K2 Thinking Turbo986±141.1K3.2%2.0%75 tps1.4s262K$1.15$8.00
8395Kimi K2 Thinking985±216203.1%4.2%61 tps5.9s262K$0.24$1.03
84106DeepSeek V3.1 Terminus Thinking979±136603.6%5.9%27 tps1.8s131K$0.56$1.68
85118GPT-4.1 mini976±132.7K1.8%1.1%67 tps0.9s1M$0.34$1.60
86113GLM 4.5969±191.3K3.5%3.7%46 tps1.4s131K$0.43$1.63
8784GPT-5 Mini Minimal968±177953.6%1.2%63 tps1.4s400K$0.25$2.00
88148OpenAI o3960±166002.4%0.9%85 tps6.8s128K$7.33$29.33
89129DeepSeek V3.1 Thinking958±141K2.4%7.1%18 tps1.8s131K$0.23$0.75
9056DeepSeek V3.1 Turbo957±148204.1%0.9%173 tps1.3s164K$2.00$3.75
91126DeepSeek V3956±121.7K2.5%0.9%69 tps1.1s64K$0.59$1.49
92148OpenAI o4-mini-high950±121.5K3.8%1.9%117 tps15.9s200K$1.10$4.40
9368GLM 4.7949±131.6K1.8%5.8%40 tps1.5s200K$0.77$1.73
9471Seed 1.8 251228949±181.2K2.7%3.7%41 tps2.1s256K$0.25$2.00
95129Command A948±121.9K3.1%2.2%42 tps0.8s256K$2.00$7.33
96139OpenAI o4-mini947±111.2K2.5%1.4%97 tps7.0s128K$1.10$4.40
97133Qwen3 14B943±168254.1%1.7%109 tps0.8s41K$0.04$0.15
98126Qwen3 VL 235B A22B Thinking939±159653.5%4.3%47 tps3.0s127K$0.47$3.31
99148DeepSeek-R1936±217054.1%0.8%133 tps0.6s64K$0.91$3.07
100153OpenAI o1926±159152.1%4.2%92 tps5.5s200K$15.00$60.00
101101gpt-oss-20b912±121.5K4.1%0.5%216 tps0.5s131K$0.06$0.26
10265DeepSeek V3.2 Exp Chat909±111.3K2.6%2.6%29 tps1.5s131K$0.27$0.39
103124Qwen3 235B A22B Thinking 2507905±166952.1%2.5%53 tps1.6s131K$0.59$5.70
104133DeepSeek-R1 0528904±196404.5%1.3%93 tps0.5s64K$1.60$3.67
105143Gemini 2.0 Flash Lite902±149907.9%<0.1%42 tps0.5s1M$0.08$0.30
106157GPT-5 Nano901±91.2K5.0%3.2%113 tps20.9s400K$0.05$0.40
107133GPT-4.1 nano896±111.8K3.2%0.6%175 tps0.5s1M$0.10$0.40
108157Qwen3 Next 80B A3B Thinking890±161.1K3.4%0.6%175 tps1.3s256K$0.21$2.26
109143Gemini 2.0 Flash885±168705.9%<0.1%76 tps0.5s1M$0.14$0.56
110119ERNIE 4.5 300B A47B873±121.1K3.4%4.7%23 tps2.3s123K$0.28$1.10
111121Qwen3 32B Fast863±132K4.5%11.6%30 tps3.1s41K$0.10$0.25
112133DeepSeek V3.2 Speciale842±384854.9%6.0%43 tps1.4s131K$0.84$1.52
113161Llama 4 Maverick838±102.4K4.3%1.2%88 tps2.4s1M$0.23$0.83
114139GLM 4.6V837±236405.2%6.4%21 tps1.8s128K$0.38$0.90
115126Qwen3 30B A3B832±209504.0%5.1%163 tps1.0s41K$0.06$0.21
116121QwQ 32B825±161.3K5.5%5.4%41 tps2.1s16K$0.43$0.56
117165Qwen3 4B818±208704.9%1.9%94 tps1.5s128K$0.01$0.01
118161Qwen3 8B813±236105.4%2.4%61 tps1.4s41K$0.02$0.07
119186Grok 3 Mini799±231.9K2.6%1.2%43 tps0.5s131K$0.30$0.50
120165Pixtral Large796±266107.6%2.5%57 tps1.3s128K$1.50$4.50
121170Mistral Small 3.2 24B793±284755.9%2.8%141 tps0.7s33K$0.02$0.08
122148Qwen3 30B A3B Thinking 2507787±146554.4%0.5%124 tps1.2s131K$0.16$1.70
123160Llama 4 Scout782±151.6K4.1%0.6%88 tps5.1s131K$0.18$0.46
124214OpenAI o3-mini-high778±176303.1%2.4%231 tps10.5s200K$1.10$4.40
125177OpenAI o3-mini777±122K3.6%0.8%143 tps3.3s200K$1.10$4.40
126229Magistral Medium 2509765±186103.9%4.0%58 tps0.9s131K$2.00$5.00
127175OpenAI o3-mini-low752±171.4K4.3%0.7%139 tps1.5s200K$1.10$4.40
128186Grok 3 Mini Fast724±151.3K4.3%1.6%44 tps0.5s131K$0.60$4.00
129194Llama 3.3 70B719±185503.5%0.3%500 tps0.5s8K$0.48$0.66
130274Pixtral 12B674±337205.9%2.2%101 tps1.2s131K$0.08$0.08
131265Qwen 2.5 VL 72B Instruct635±345055.6%5.3%25 tps3.7s128K$1.01$2.79
132179Inception Mercury613±255006.5%0.4%257 tps1.1s32K$0.25$1.00
133288Qwen 2.5 VL 3B Instruct373±469557.7%3.0%44 tps2.5s128K$0.21$0.63
Show Less