Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1524
Claude Opus 4.6 (Thinking)
1424
Claude Opus 4.6
1280
Claude Opus 4.5 (Thinking)
1266
Claude Sonnet 4.6
1256
GPT-5.2 Instant
1248
Gemini 3 Pro
1244
Gemini 3.1 Pro
1240
Gemini 3 Pro (Low)
1231
GPT-5.1 (High)
1230
GPT-5.1
1222
Claude Sonnet 4.6 (Thinking)
1167
Gemini 3 Flash Preview Thinking
1165
Gemini 3 Flash Preview
1164
GPT-5 Chat
1162
GPT-5.2

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
11Claude Opus 4.6 (Thinking)1524±169801.0%2.5%56 tps1.6s200K$5.00$25.00
22Claude Opus 4.61424±169501.0%2.1%48 tps1.7s200K$5.00$25.00
37Claude Opus 4.5 (Thinking)1280±142.8K1.4%1.8%49 tps1.4s200K$5.00$25.00
44Claude Sonnet 4.61266±276501.5%1.6%47 tps1.2s200K$3.00$15.00
510GPT-5.2 Instant1256±151.3K1.8%1.7%52 tps2.0s400K$1.75$14.00
610Gemini 3 Pro1248±163.5K1.5%2.1%50 tps3.6s1M$2.00$12.00
76Gemini 3.1 Pro1244±231.4K1.7%3.5%35 tps4.1s1M$2.00$12.00
814Gemini 3 Pro (Low)1240±191.2K0.8%2.4%51 tps3.5s1M$2.00$12.00
98GPT-5.1 (High)1231±151.8K1.7%3.2%76 tps6.9s400K$1.25$10.00
108GPT-5.11230±131.3K1.9%2.3%71 tps1.4s400K$1.42$11.33
115Claude Sonnet 4.6 (Thinking)1222±236301.6%4.7%57 tps1.1s200K$3.00$15.00
1214Gemini 3 Flash Preview Thinking1167±171.4K1.7%1.6%3 tps6.2s1M$0.50$3.00
1317Gemini 3 Flash Preview1165±216751.5%1.3%138 tps1.4s1M$0.50$3.00
1422GPT-5 Chat1164±123.5K1.6%1.3%95 tps0.9s400K$1.25$10.00
1516GPT-5.21162±187851.9%4.1%18 tps2.7s400K$1.75$14.00
1617GPT-5.2 (High)1145±152.2K1.6%6.7%18 tps16.3s400K$1.75$14.00
1717Claude Opus 4.51135±211.1K1.4%1.5%45 tps1.5s200K$5.00$25.00
1844Gemini 2.5 Pro1125±63.1K3.7%2.3%45 tps2.6s1M$1.25$10.00
1926Claude Haiku 4.5 (Extended Thinking)1123±191.1K1.9%1.4%115 tps0.7s200K$1.00$5.00
2032Gemini 2.5 Pro High1119±102.5K2.4%1.5%48 tps2.3s1M$1.25$10.00
2113GPT-5.3 Instant1110±335151.0%0.9%63 tps0.8s400K$1.75$14.00
2242GPT-5.2 (Extra High) 1107±248902.7%13.2%17 tps20.5s400K$1.75$14.00
2310Claude Sonnet 4.5 (Thinking)1102±133.2K3.6%1.9%44 tps1.1s200K$3.00$15.00
2429Qwen3 VL 235B A22B Instruct1094±156752.2%3.1%75 tps1.9s129K$0.37$1.81
2548Claude Sonnet 4 (Thinking)1093±141.6K2.4%1.5%52 tps1.5s200K$3.00$13.67
2642Qwen3 Max Instruct Preview1083±171.1K1.7%1.1%31 tps1.7s256K$1.43$6.61
2744DeepSeek V3.1 Terminus Chat1078±125801.7%3.4%27 tps1.5s131K$0.86$1.80
2826GPT-5 (High)1061±92.5K2.7%4.5%81 tps35.9s400K$1.25$10.00
2952Claude Haiku 4.51060±131.6K3.1%1.1%100 tps0.9s200K$1.00$5.00
3065GLM 4.61059±256402.3%5.4%39 tps1.5s200K$0.42$1.66
3133Qwen3 30B A3B Instruct 25071056±188102.4%1.2%55 tps1.3s131K$0.13$0.72
3240Qwen3 235B A22B Instruct 25071053±196800.7%6.8%13 tps1.9s262K$0.13$0.52
3395Gemini 2.5 Flash1049±182.1K1.9%1.3%2 tps3.7s1M$0.30$2.50
3468Qwen Plus (Aug'24)1048±227302.0%1.4%53 tps1.3s30K$0.40$1.20
3562GPT-5.1 Instant1040±139152.7%1.3%50 tps1.9s400K$1.25$10.00
3637Claude Sonnet 4.51040±82K3.2%1.4%41 tps1.3s200K$1.80$9.00
3760MiniMax M2.11036±226951.4%2.1%66 tps2.6s205K$0.30$1.20
3860Gemini 2.5 Flash Preview 09251025±131.2K2.0%1.2%5 tps0.9s1M$0.13$0.97
3926Grok 4.1 Fast Non-Reasoning1023±218201.8%0.9%101 tps0.5s2M$0.20$0.50
4052Grok 4 Fast Non-Reasoning1023±168701.7%1.5%93 tps0.6s2M$0.27$0.67
4168Grok 41022±102.1K2.5%3.9%29 tps11.1s256K$3.00$15.00
4248Grok 4 Fast Reasoning1022±141.2K2.0%2.1%102 tps3.1s2M$0.30$0.75
4344Grok 4.1 Fast Reasoning1016±181.4K2.0%1.5%58 tps7.3s2M$0.20$0.50
4486Claude Sonnet 41011±191.8K1.4%1.8%49 tps1.3s200K$3.00$15.00
4571Gemini 2.5 Flash Thinking1000±181K1.9%2.2%88 tps6.4s1M$0.30$2.50
4668GLM 4.7992±336352.3%5.8%40 tps1.5s200K$0.77$1.73
4793Qwen Max979±196952.1%1.5%49 tps1.5s33K$1.60$6.40
4856DeepSeek V3.1 Turbo969±376651.5%0.9%173 tps1.3s164K$2.00$3.75
4952GPT-5957±201.6K2.9%3.1%78 tps23.1s400K$1.25$9.67
5084GPT-5 Mini Minimal953±165953.3%1.2%63 tps1.4s400K$0.25$2.00
51101Gemini 2.5 Flash Lite948±161.6K2.7%1.3%210 tps0.7s1M$0.10$0.40
5271Gemini 2.5 Flash Lite Preview 0925948±161.1K2.2%1.2%209 tps0.7s1M$0.25$0.35
5381GPT-4o945±315053.8%1.0%49 tps2.4s128K$3.71$12.57
5479Qwen3 Max Thinking Preview925±265301.9%3.1%40 tps2.1s256K$1.20$6.00
55126Qwen3 VL 235B A22B Thinking922±196453.0%4.3%47 tps3.0s127K$0.47$3.31
56106DeepSeek V3 0324920±255700.9%5.8%12 tps2.7s164K$0.38$0.93
57118GPT-4.1 mini900±169501.0%1.1%67 tps0.9s1M$0.34$1.60
5862MiniMax M2900±247202.7%2.2%39 tps2.3s205K$0.21$0.85
59129DeepSeek V3.1 Thinking886±165102.9%7.1%18 tps1.8s131K$0.23$0.75
60124Kimi K2 0905 Turbo881±177102.1%0.7%373 tps0.5s262K$1.70$6.50
61106Grok 3872±267451.3%1.5%53 tps0.6s1M$3.67$18.33
6271GPT-5 Mini870±159404.1%2.6%66 tps14.2s400K$0.25$2.00
6395Gemini 2.5 Flash Lite Thinking Preview 0925859±161.2K2.5%1.5%152 tps3.0s1M$0.10$0.40
64143Gemini 2.0 Flash Lite856±245853.3%<0.1%42 tps0.5s1M$0.08$0.30
65148OpenAI o4-mini-high854±225601.8%1.9%117 tps15.9s200K$1.10$4.40
6693DeepSeek V3 0324 Turbo843±216350.8%6.3%12 tps2.4s164K$0.73$1.79
67157GPT-5 Nano836±346854.2%3.2%113 tps20.9s400K$0.05$0.40
68133GPT-4.1 nano824±227352.0%0.6%175 tps0.5s1M$0.10$0.40
69113Gemini 2.5 Flash Lite Thinking804±187753.1%1.0%118 tps4.4s1M$0.03$0.13
70139OpenAI o4-mini768±325450.9%1.4%97 tps7.0s128K$1.10$4.40
71157Qwen3 Next 80B A3B Thinking767±238102.4%0.6%175 tps1.3s256K$0.21$2.26
72160Llama 4 Scout722±337002.1%0.6%88 tps5.1s131K$0.18$0.46
73177OpenAI o3-mini676±276901.4%0.8%143 tps3.3s200K$1.10$4.40
74175OpenAI o3-mini-low659±215052.9%0.7%139 tps1.5s200K$1.10$4.40
Show Less