Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1485
Claude Opus 4.6 (Thinking)
1483
Claude Opus 4.6
1344
Gemini 3.1 Pro
1295
Claude Sonnet 4.6
1274
GPT-5.2 Instant
1274
GPT-5.1 (High)
1270
Gemini 3 Pro (Low)
1268
GPT-5.1
1265
Gemini 3 Pro
1260
Claude Sonnet 4.5 (Thinking)
1248
Claude Opus 4.5 (Thinking)
1235
Claude Sonnet 4.6 (Thinking)
1228
Claude Opus 4.5
1220
GPT-5 Chat
1213
GPT-5.2

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
11Claude Opus 4.6 (Thinking)1485±112.3K1.3%2.5%56 tps1.6s200K$5.00$25.00
22Claude Opus 4.61483±113.2K0.9%2.1%48 tps1.7s200K$5.00$25.00
36Gemini 3.1 Pro1344±162.8K2.4%3.5%35 tps4.1s1M$2.00$12.00
44Claude Sonnet 4.61295±161.8K0.8%1.6%47 tps1.2s200K$3.00$15.00
510GPT-5.2 Instant1274±83.9K2.2%1.7%52 tps2.0s400K$1.75$14.00
68GPT-5.1 (High)1274±64.2K2.5%3.2%76 tps6.9s400K$1.25$10.00
714Gemini 3 Pro (Low)1270±123.5K2.4%2.4%51 tps3.5s1M$2.00$12.00
88GPT-5.11268±63.3K1.5%2.3%71 tps1.4s400K$1.42$11.33
910Gemini 3 Pro1265±512K1.3%2.1%50 tps3.6s1M$2.00$12.00
1010Claude Sonnet 4.5 (Thinking)1260±48.3K1.8%1.9%44 tps1.1s200K$3.00$15.00
117Claude Opus 4.5 (Thinking)1248±69.3K2.3%1.8%49 tps1.4s200K$5.00$25.00
125Claude Sonnet 4.6 (Thinking)1235±181.7K2.0%4.7%57 tps1.1s200K$3.00$15.00
1317Claude Opus 4.51228±92.9K1.8%1.5%45 tps1.5s200K$5.00$25.00
1422GPT-5 Chat1220±610.4K2.1%1.3%95 tps0.9s400K$1.25$10.00
1516GPT-5.21213±122.7K2.2%4.1%18 tps2.7s400K$1.75$14.00
1614Gemini 3 Flash Preview Thinking1212±84.2K1.8%1.6%3 tps6.2s1M$0.50$3.00
1717Gemini 3 Flash Preview1198±142K2.0%1.3%138 tps1.4s1M$0.50$3.00
1832Gemini 2.5 Pro High1189±64.5K2.5%1.5%48 tps2.3s1M$1.25$10.00
1922GLM 51184±227952.5%3.4%36 tps2.7s200K$0.72$2.55
2017GPT-5.2 (High)1175±106.4K1.7%6.7%18 tps16.3s400K$1.75$14.00
2142GPT-5.2 (Extra High) 1168±122.4K1.8%13.2%17 tps20.5s400K$1.75$14.00
2213GPT-5.3 Instant1160±191.5K1.9%0.9%63 tps0.8s400K$1.75$14.00
2344Gemini 2.5 Pro1158±57.8K3.3%2.3%45 tps2.6s1M$1.25$10.00
2452GPT-51157±75.3K2.9%3.1%78 tps23.1s400K$1.25$9.67
2537Claude Sonnet 4.51156±55.1K2.8%1.4%41 tps1.3s200K$1.80$9.00
2671Gemini 2.5 Flash Thinking1148±72.6K4.2%2.2%88 tps6.4s1M$0.30$2.50
2748Claude Sonnet 4 (Thinking)1144±64.2K3.5%1.5%52 tps1.5s200K$3.00$13.67
2829Nova Experimental Chat 12-101135±197202.0%2.4%84 tps12.9s98K$0$0
2960Gemini 2.5 Flash Preview 09251131±102.1K2.7%1.2%5 tps0.9s1M$0.13$0.97
3026Grok 4.1 Fast Non-Reasoning1130±152K4.3%0.9%101 tps0.5s2M$0.20$0.50
3133Qwen3 30B A3B Instruct 25071127±92.5K2.9%1.2%55 tps1.3s131K$0.13$0.72
3281OpenAI o3-pro1126±102.3K3.1%5.2%22 tps70.8s200K$20.00$80.00
3340Qwen3 235B A22B Instruct 25071118±112.5K2.1%6.8%13 tps1.9s262K$0.13$0.52
3468Qwen Plus (Aug'24)1115±81.9K2.6%1.4%53 tps1.3s30K$0.40$1.20
3526Claude Haiku 4.5 (Extended Thinking)1115±122.2K2.7%1.4%115 tps0.7s200K$1.00$5.00
3629Qwen3 VL 235B A22B Instruct1114±81.3K2.5%3.1%75 tps1.9s129K$0.37$1.81
3771Qwen3.5 397B A17B1112±245802.5%4.3%57 tps1.4s256K$0.52$3.00
3826GPT-5 (High)1110±74.3K3.1%4.5%81 tps35.9s400K$1.25$10.00
3995Gemini 2.5 Flash1098±94.8K2.7%1.3%2 tps3.7s1M$0.30$2.50
4068Grok 41093±55.7K4.0%3.9%29 tps11.1s256K$3.00$15.00
4148Grok 4 Fast Reasoning1090±112.1K3.1%2.1%102 tps3.1s2M$0.30$0.75
4256Gemini 3.1 Flash Lite Preview Thinking1083±324853.0%1.7%75 tps4.7s1M$0.25$1.50
4344Grok 4.1 Fast Reasoning1076±102.6K4.2%1.5%58 tps7.3s2M$0.20$0.50
4471GPT-5 Mini1075±92.1K4.3%2.6%66 tps14.2s400K$0.25$2.00
4562GPT-5.1 Instant1075±92.2K2.6%1.3%50 tps1.9s400K$1.25$10.00
4693DeepSeek V3 0324 Turbo1074±142.1K1.9%6.3%12 tps2.4s164K$0.73$1.79
4771Gemini 2.5 Flash Lite Preview 09251066±112.2K2.8%1.2%209 tps0.7s1M$0.25$0.35
4886Claude Sonnet 41066±85.3K2.5%1.8%49 tps1.3s200K$3.00$15.00
4942Qwen3 Max Instruct Preview1063±72.7K1.5%1.1%31 tps1.7s256K$1.43$6.61
5040DeepSeek V3.21063±161.1K2.5%1.4%83 tps5.1s131K$0.43$1.09
5152Claude Haiku 4.51057±63.4K3.4%1.1%100 tps0.9s200K$1.00$5.00
5252Grok 4 Fast Non-Reasoning1054±81.6K2.5%1.5%93 tps0.6s2M$0.27$0.67
5381GPT-4o1046±151.4K2.5%1.0%49 tps2.4s128K$3.71$12.57
5460MiniMax M2.11044±121.7K2.8%2.1%66 tps2.6s205K$0.30$1.20
5595Gemini 2.5 Flash Lite Thinking Preview 09251044±91.7K3.5%1.5%152 tps3.0s1M$0.10$0.40
5662MiniMax M21043±91.8K4.2%2.2%39 tps2.3s205K$0.21$0.85
5765GLM 4.61041±111.6K2.9%5.4%39 tps1.5s200K$0.42$1.66
5844DeepSeek V3.1 Terminus Chat1037±91.3K2.2%3.4%27 tps1.5s131K$0.86$1.80
59129Qwen3 Max Thinking1029±316002.4%13.5%32 tps2.3s256K$1.20$6.00
6093Qwen Max1021±141.8K2.7%1.5%49 tps1.5s33K$1.60$6.40
6179Qwen3 Max Thinking Preview1020±101.2K2.4%3.1%40 tps2.1s256K$1.20$6.00
6286DeepSeek V3.1 Chat1018±121.1K3.1%2.8%21 tps1.6s131K$0.38$1.00
63133Kimi K2 09051014±138102.4%4.0%30 tps1.4s262K$0.63$2.39
64101Gemini 2.5 Flash Lite1014±95.3K3.9%1.3%210 tps0.7s1M$0.10$0.40
6586Amazon Nova 2 Lite1013±188154.7%1.0%137 tps0.6s300K$0.35$2.95
66106Grok 31003±92K2.6%1.5%53 tps0.6s1M$3.67$18.33
67170Kimi K2 07111002±157203.4%1.6%29 tps1.3s131K$0.72$2.60
68113Gemini 2.5 Flash Lite Thinking996±112.5K3.7%1.0%118 tps4.4s1M$0.03$0.13
69124Kimi K2 0905 Turbo991±121.6K1.8%0.7%373 tps0.5s262K$1.70$6.50
70106DeepSeek V3 0324990±112.1K3.0%5.8%12 tps2.7s164K$0.38$0.93
71113Mistral Medium989±141.1K2.7%1.8%48 tps0.6s33K$1.48$4.55
7295Kimi K2 Thinking985±216203.1%4.2%61 tps5.9s262K$0.24$1.03
73118GPT-4.1 mini976±132.7K1.8%1.1%67 tps0.9s1M$0.34$1.60
74113GLM 4.5969±191.3K3.5%3.7%46 tps1.4s131K$0.43$1.63
7584GPT-5 Mini Minimal968±177953.6%1.2%63 tps1.4s400K$0.25$2.00
76148OpenAI o3960±166002.4%0.9%85 tps6.8s128K$7.33$29.33
77129DeepSeek V3.1 Thinking958±141K2.4%7.1%18 tps1.8s131K$0.23$0.75
7856DeepSeek V3.1 Turbo957±148204.1%0.9%173 tps1.3s164K$2.00$3.75
79148OpenAI o4-mini-high950±121.5K3.8%1.9%117 tps15.9s200K$1.10$4.40
8068GLM 4.7949±131.6K1.8%5.8%40 tps1.5s200K$0.77$1.73
8171Seed 1.8 251228949±181.2K2.7%3.7%41 tps2.1s256K$0.25$2.00
82139OpenAI o4-mini947±111.2K2.5%1.4%97 tps7.0s128K$1.10$4.40
83126Qwen3 VL 235B A22B Thinking939±159653.5%4.3%47 tps3.0s127K$0.47$3.31
84153OpenAI o1926±159152.1%4.2%92 tps5.5s200K$15.00$60.00
85124Qwen3 235B A22B Thinking 2507905±166952.1%2.5%53 tps1.6s131K$0.59$5.70
86143Gemini 2.0 Flash Lite902±149907.9%<0.1%42 tps0.5s1M$0.08$0.30
87157GPT-5 Nano901±91.2K5.0%3.2%113 tps20.9s400K$0.05$0.40
88133GPT-4.1 nano896±111.8K3.2%0.6%175 tps0.5s1M$0.10$0.40
89157Qwen3 Next 80B A3B Thinking890±161.1K3.4%0.6%175 tps1.3s256K$0.21$2.26
90143Gemini 2.0 Flash885±168705.9%<0.1%76 tps0.5s1M$0.14$0.56
91119ERNIE 4.5 300B A47B873±121.1K3.4%4.7%23 tps2.3s123K$0.28$1.10
92133DeepSeek V3.2 Speciale842±384854.9%6.0%43 tps1.4s131K$0.84$1.52
93139GLM 4.6V837±236405.2%6.4%21 tps1.8s128K$0.38$0.90
94165Qwen3 4B818±208704.9%1.9%94 tps1.5s128K$0.01$0.01
95161Qwen3 8B813±236105.4%2.4%61 tps1.4s41K$0.02$0.07
96186Grok 3 Mini799±231.9K2.6%1.2%43 tps0.5s131K$0.30$0.50
97170Mistral Small 3.2 24B793±284755.9%2.8%141 tps0.7s33K$0.02$0.08
98148Qwen3 30B A3B Thinking 2507787±146554.4%0.5%124 tps1.2s131K$0.16$1.70
99160Llama 4 Scout782±151.6K4.1%0.6%88 tps5.1s131K$0.18$0.46
100214OpenAI o3-mini-high778±176303.1%2.4%231 tps10.5s200K$1.10$4.40
101177OpenAI o3-mini777±122K3.6%0.8%143 tps3.3s200K$1.10$4.40
102229Magistral Medium 2509765±186103.9%4.0%58 tps0.9s131K$2.00$5.00
103175OpenAI o3-mini-low752±171.4K4.3%0.7%139 tps1.5s200K$1.10$4.40
104186Grok 3 Mini Fast724±151.3K4.3%1.6%44 tps0.5s131K$0.60$4.00
105194Llama 3.3 70B719±185503.5%0.3%500 tps0.5s8K$0.48$0.66
106265Qwen 2.5 VL 72B Instruct635±345055.6%5.3%25 tps3.7s128K$1.01$2.79
107179Inception Mercury613±255006.5%0.4%257 tps1.1s32K$0.25$1.00
Show Less