Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1524
Claude Opus 4.6 (Thinking)
1424
Claude Opus 4.6
1280
Claude Opus 4.5 (Thinking)
1266
Claude Sonnet 4.6
1256
GPT-5.2 Instant
1248
Gemini 3 Pro
1244
Gemini 3.1 Pro
1240
Gemini 3 Pro (Low)
1231
GPT-5.1 (High)
1230
GPT-5.1
1222
Claude Sonnet 4.6 (Thinking)
1167
Gemini 3 Flash Preview Thinking
1165
Gemini 3 Flash Preview
1164
GPT-5 Chat
1162
GPT-5.2

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
11Claude Opus 4.6 (Thinking)1524±169801.0%2.5%56 tps1.6s200K$5.00$25.00
22Claude Opus 4.61424±169501.0%2.1%48 tps1.7s200K$5.00$25.00
37Claude Opus 4.5 (Thinking)1280±142.8K1.4%1.8%49 tps1.4s200K$5.00$25.00
44Claude Sonnet 4.61266±276501.5%1.6%47 tps1.2s200K$3.00$15.00
510GPT-5.2 Instant1256±151.3K1.8%1.7%52 tps2.0s400K$1.75$14.00
610Gemini 3 Pro1248±163.5K1.5%2.1%50 tps3.6s1M$2.00$12.00
76Gemini 3.1 Pro1244±231.4K1.7%3.5%35 tps4.1s1M$2.00$12.00
814Gemini 3 Pro (Low)1240±191.2K0.8%2.4%51 tps3.5s1M$2.00$12.00
98GPT-5.1 (High)1231±151.8K1.7%3.2%76 tps6.9s400K$1.25$10.00
108GPT-5.11230±131.3K1.9%2.3%71 tps1.4s400K$1.42$11.33
115Claude Sonnet 4.6 (Thinking)1222±236301.6%4.7%57 tps1.1s200K$3.00$15.00
1214Gemini 3 Flash Preview Thinking1167±171.4K1.7%1.6%3 tps6.2s1M$0.50$3.00
1317Gemini 3 Flash Preview1165±216751.5%1.3%138 tps1.4s1M$0.50$3.00
1422GPT-5 Chat1164±123.5K1.6%1.3%95 tps0.9s400K$1.25$10.00
1516GPT-5.21162±187851.9%4.1%18 tps2.7s400K$1.75$14.00
1617GPT-5.2 (High)1145±152.2K1.6%6.7%18 tps16.3s400K$1.75$14.00
1717Claude Opus 4.51135±211.1K1.4%1.5%45 tps1.5s200K$5.00$25.00
1844Gemini 2.5 Pro1125±63.1K3.7%2.3%45 tps2.6s1M$1.25$10.00
1926Claude Haiku 4.5 (Extended Thinking)1123±191.1K1.9%1.4%115 tps0.7s200K$1.00$5.00
2032Gemini 2.5 Pro High1119±102.5K2.4%1.5%48 tps2.3s1M$1.25$10.00
2113GPT-5.3 Instant1110±335151.0%0.9%63 tps0.8s400K$1.75$14.00
2233Kimi K2.51110±267202.0%6.5%33 tps1.7s262K$0.34$2.57
2342GPT-5.2 (Extra High) 1107±248902.7%13.2%17 tps20.5s400K$1.75$14.00
2410Claude Sonnet 4.5 (Thinking)1102±133.2K3.6%1.9%44 tps1.1s200K$3.00$15.00
2529Qwen3 VL 235B A22B Instruct1094±156752.2%3.1%75 tps1.9s129K$0.37$1.81
2648Claude Sonnet 4 (Thinking)1093±141.6K2.4%1.5%52 tps1.5s200K$3.00$13.67
2742Qwen3 Max Instruct Preview1083±171.1K1.7%1.1%31 tps1.7s256K$1.43$6.61
2844DeepSeek V3.1 Terminus Chat1078±125801.7%3.4%27 tps1.5s131K$0.86$1.80
2926GPT-5 (High)1061±92.5K2.7%4.5%81 tps35.9s400K$1.25$10.00
3052Claude Haiku 4.51060±131.6K3.1%1.1%100 tps0.9s200K$1.00$5.00
3165GLM 4.61059±256402.3%5.4%39 tps1.5s200K$0.42$1.66
3233Qwen3 30B A3B Instruct 25071056±188102.4%1.2%55 tps1.3s131K$0.13$0.72
3340Qwen3 235B A22B Instruct 25071053±196800.7%6.8%13 tps1.9s262K$0.13$0.52
3495Gemini 2.5 Flash1049±182.1K1.9%1.3%2 tps3.7s1M$0.30$2.50
3568Qwen Plus (Aug'24)1048±227302.0%1.4%53 tps1.3s30K$0.40$1.20
3662GPT-5.1 Instant1040±139152.7%1.3%50 tps1.9s400K$1.25$10.00
3737Claude Sonnet 4.51040±82K3.2%1.4%41 tps1.3s200K$1.80$9.00
3833Qwen3 Next 80B A3B Instruct1038±159202.6%0.6%84 tps1.1s256K$0.20$1.42
3960MiniMax M2.11036±226951.4%2.1%66 tps2.6s205K$0.30$1.20
4060Gemini 2.5 Flash Preview 09251025±131.2K2.0%1.2%5 tps0.9s1M$0.13$0.97
4126Grok 4.1 Fast Non-Reasoning1023±218201.8%0.9%101 tps0.5s2M$0.20$0.50
4252Grok 4 Fast Non-Reasoning1023±168701.7%1.5%93 tps0.6s2M$0.27$0.67
4368Grok 41022±102.1K2.5%3.9%29 tps11.1s256K$3.00$15.00
4448Grok 4 Fast Reasoning1022±141.2K2.0%2.1%102 tps3.1s2M$0.30$0.75
4544Grok 4.1 Fast Reasoning1016±181.4K2.0%1.5%58 tps7.3s2M$0.20$0.50
4686Claude Sonnet 41011±191.8K1.4%1.8%49 tps1.3s200K$3.00$15.00
4771Gemini 2.5 Flash Thinking1000±181K1.9%2.2%88 tps6.4s1M$0.30$2.50
4848gpt-oss-120b1000±151.1K1.3%0.7%213 tps0.5s131K$0.11$0.50
4968GLM 4.7992±336352.3%5.8%40 tps1.5s200K$0.77$1.73
5093Qwen Max979±196952.1%1.5%49 tps1.5s33K$1.60$6.40
5156DeepSeek V3.1 Turbo969±376651.5%0.9%173 tps1.3s164K$2.00$3.75
5252GPT-5957±201.6K2.9%3.1%78 tps23.1s400K$1.25$9.67
5384GPT-5 Mini Minimal953±165953.3%1.2%63 tps1.4s400K$0.25$2.00
54101Gemini 2.5 Flash Lite948±161.6K2.7%1.3%210 tps0.7s1M$0.10$0.40
5571Gemini 2.5 Flash Lite Preview 0925948±161.1K2.2%1.2%209 tps0.7s1M$0.25$0.35
5681GPT-4o945±315053.8%1.0%49 tps2.4s128K$3.71$12.57
5756DeepSeek V3.2 Thinking942±267052.8%9.0%30 tps2.6s131K$0.28$0.42
5879Qwen3 Max Thinking Preview925±265301.9%3.1%40 tps2.1s256K$1.20$6.00
59126Qwen3 VL 235B A22B Thinking922±196453.0%4.3%47 tps3.0s127K$0.47$3.31
60106DeepSeek V3 0324920±255700.9%5.8%12 tps2.7s164K$0.38$0.93
6144Kimi K2 Thinking Turbo917±275301.9%2.0%75 tps1.4s262K$1.15$8.00
62126DeepSeek V3910±385651.7%0.9%69 tps1.1s64K$0.59$1.49
63118GPT-4.1 mini900±169501.0%1.1%67 tps0.9s1M$0.34$1.60
6462MiniMax M2900±247202.7%2.2%39 tps2.3s205K$0.21$0.85
65113Kimi K2 Fast887±141.6K1.0%0.8%365 tps0.5s131K$1.00$3.00
66129DeepSeek V3.1 Thinking886±165102.9%7.1%18 tps1.8s131K$0.23$0.75
6765Mistral Large 3881±274953.9%2.1%51 tps1.0s256K$0.50$1.50
68124Kimi K2 0905 Turbo881±177102.1%0.7%373 tps0.5s262K$1.70$6.50
69106Grok 3872±267451.3%1.5%53 tps0.6s1M$3.67$18.33
7071GPT-5 Mini870±159404.1%2.6%66 tps14.2s400K$0.25$2.00
7195Gemini 2.5 Flash Lite Thinking Preview 0925859±161.2K2.5%1.5%152 tps3.0s1M$0.10$0.40
72143Gemini 2.0 Flash Lite856±245853.3%<0.1%42 tps0.5s1M$0.08$0.30
73148OpenAI o4-mini-high854±225601.8%1.9%117 tps15.9s200K$1.10$4.40
7493DeepSeek V3 0324 Turbo843±216350.8%6.3%12 tps2.4s164K$0.73$1.79
75157GPT-5 Nano836±346854.2%3.2%113 tps20.9s400K$0.05$0.40
76129Command A836±158551.2%2.2%42 tps0.8s256K$2.00$7.33
77133GPT-4.1 nano824±227352.0%0.6%175 tps0.5s1M$0.10$0.40
78101gpt-oss-20b816±205551.8%0.5%216 tps0.5s131K$0.06$0.26
79113Gemini 2.5 Flash Lite Thinking804±187753.1%1.0%118 tps4.4s1M$0.03$0.13
80139OpenAI o4-mini768±325450.9%1.4%97 tps7.0s128K$1.10$4.40
81157Qwen3 Next 80B A3B Thinking767±238102.4%0.6%175 tps1.3s256K$0.21$2.26
82160Llama 4 Scout722±337002.1%0.6%88 tps5.1s131K$0.18$0.46
83161Llama 4 Maverick719±271K2.9%1.2%88 tps2.4s1M$0.23$0.83
84177OpenAI o3-mini676±276901.4%0.8%143 tps3.3s200K$1.10$4.40
85175OpenAI o3-mini-low659±215052.9%0.7%139 tps1.5s200K$1.10$4.40
Show Less