Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1524
Claude Opus 4.6 (Thinking)
1424
Claude Opus 4.6
1280
Claude Opus 4.5 (Thinking)
1266
Claude Sonnet 4.6
1256
GPT-5.2 Instant
1248
Gemini 3 Pro
1244
Gemini 3.1 Pro
1240
Gemini 3 Pro (Low)
1231
GPT-5.1 (High)
1230
GPT-5.1
1222
Claude Sonnet 4.6 (Thinking)
1178
Mistral Medium 3.1
1167
Gemini 3 Flash Preview Thinking
1165
Gemini 3 Flash Preview
1164
GPT-5 Chat

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
11Claude Opus 4.6 (Thinking)1524±169801.0%2.5%56 tps1.6s200K$5.00$25.00
22Claude Opus 4.61424±169501.0%2.1%48 tps1.7s200K$5.00$25.00
37Claude Opus 4.5 (Thinking)1280±142.8K1.4%1.8%49 tps1.4s200K$5.00$25.00
44Claude Sonnet 4.61266±276501.5%1.6%47 tps1.2s200K$3.00$15.00
510GPT-5.2 Instant1256±151.3K1.8%1.7%52 tps2.0s400K$1.75$14.00
610Gemini 3 Pro1248±163.5K1.5%2.1%50 tps3.6s1M$2.00$12.00
76Gemini 3.1 Pro1244±231.4K1.7%3.5%35 tps4.1s1M$2.00$12.00
814Gemini 3 Pro (Low)1240±191.2K0.8%2.4%51 tps3.5s1M$2.00$12.00
98GPT-5.1 (High)1231±151.8K1.7%3.2%76 tps6.9s400K$1.25$10.00
108GPT-5.11230±131.3K1.9%2.3%71 tps1.4s400K$1.42$11.33
115Claude Sonnet 4.6 (Thinking)1222±236301.6%4.7%57 tps1.1s200K$3.00$15.00
1219Mistral Medium 3.11178±101.5K1.6%<0.1%77 tps0.7s128K$0.40$2.00
1314Gemini 3 Flash Preview Thinking1167±171.4K1.7%1.6%3 tps6.2s1M$0.50$3.00
1417Gemini 3 Flash Preview1165±216751.5%1.3%138 tps1.4s1M$0.50$3.00
1522GPT-5 Chat1164±123.5K1.6%1.3%95 tps0.9s400K$1.25$10.00
1616GPT-5.21162±187851.9%4.1%18 tps2.7s400K$1.75$14.00
1717GPT-5.2 (High)1145±152.2K1.6%6.7%18 tps16.3s400K$1.75$14.00
1817Claude Opus 4.51135±211.1K1.4%1.5%45 tps1.5s200K$5.00$25.00
1944Gemini 2.5 Pro1125±63.1K3.7%2.3%45 tps2.6s1M$1.25$10.00
2026Claude Haiku 4.5 (Extended Thinking)1123±191.1K1.9%1.4%115 tps0.7s200K$1.00$5.00
2116Nova Experimental Chat 11-101120±205002.0%0.4%84 tps8.9s98K$0$0
2232Gemini 2.5 Pro High1119±102.5K2.4%1.5%48 tps2.3s1M$1.25$10.00
2313GPT-5.3 Instant1110±335151.0%0.9%63 tps0.8s400K$1.75$14.00
2433Kimi K2.51110±267202.0%6.5%33 tps1.7s262K$0.34$2.57
2542GPT-5.2 (Extra High) 1107±248902.7%13.2%17 tps20.5s400K$1.75$14.00
2610Claude Sonnet 4.5 (Thinking)1102±133.2K3.6%1.9%44 tps1.1s200K$3.00$15.00
2743Gemini 2.5 Flash Thinking Preview 09251097±101.3K1.6%<0.1%111 tps4.7s1M$0.30$2.50
2829Qwen3 VL 235B A22B Instruct1094±156752.2%3.1%75 tps1.9s129K$0.37$1.81
2948Claude Sonnet 4 (Thinking)1093±141.6K2.4%1.5%52 tps1.5s200K$3.00$13.67
3042Qwen3 Max Instruct Preview1083±171.1K1.7%1.1%31 tps1.7s256K$1.43$6.61
3144DeepSeek V3.1 Terminus Chat1078±125801.7%3.4%27 tps1.5s131K$0.86$1.80
3226GPT-5 (High)1061±92.5K2.7%4.5%81 tps35.9s400K$1.25$10.00
3352Claude Haiku 4.51060±131.6K3.1%1.1%100 tps0.9s200K$1.00$5.00
3465GLM 4.61059±256402.3%5.4%39 tps1.5s200K$0.42$1.66
3533Qwen3 30B A3B Instruct 25071056±188102.4%1.2%55 tps1.3s131K$0.13$0.72
3640Qwen3 235B A22B Instruct 25071053±196800.7%6.8%13 tps1.9s262K$0.13$0.52
3795Gemini 2.5 Flash1049±182.1K1.9%1.3%2 tps3.7s1M$0.30$2.50
3868Qwen Plus (Aug'24)1048±227302.0%1.4%53 tps1.3s30K$0.40$1.20
3956Gemini 2.5 Pro Low1044±161.3K2.3%<0.1%89 tps2.4s1M$1.25$10.00
4084Claude Sonnet 3.7 (Thinking)1041±225752.5%<0.1%41 tps2.6s200K$3.00$15.00
4162GPT-5.1 Instant1040±139152.7%1.3%50 tps1.9s400K$1.25$10.00
4237Claude Sonnet 4.51040±82K3.2%1.4%41 tps1.3s200K$1.80$9.00
43111Claude Sonnet 3.71039±198002.4%<0.1%39 tps1.6s200K$3.00$15.00
4433Qwen3 Next 80B A3B Instruct1038±159202.6%0.6%84 tps1.1s256K$0.20$1.42
4560MiniMax M2.11036±226951.4%2.1%66 tps2.6s205K$0.30$1.20
4660Gemini 2.5 Flash Preview 09251025±131.2K2.0%1.2%5 tps0.9s1M$0.13$0.97
4726Grok 4.1 Fast Non-Reasoning1023±218201.8%0.9%101 tps0.5s2M$0.20$0.50
4852Grok 4 Fast Non-Reasoning1023±168701.7%1.5%93 tps0.6s2M$0.27$0.67
4968Grok 41022±102.1K2.5%3.9%29 tps11.1s256K$3.00$15.00
5048Grok 4 Fast Reasoning1022±141.2K2.0%2.1%102 tps3.1s2M$0.30$0.75
5144Grok 4.1 Fast Reasoning1016±181.4K2.0%1.5%58 tps7.3s2M$0.20$0.50
5286Claude Sonnet 41011±191.8K1.4%1.8%49 tps1.3s200K$3.00$15.00
5371Gemini 2.5 Flash Thinking1000±181K1.9%2.2%88 tps6.4s1M$0.30$2.50
5448gpt-oss-120b1000±151.1K1.3%0.7%213 tps0.5s131K$0.11$0.50
5556Claude Opus 4.1 (Thinking)997±147403.9%<0.1%20 tps3.9s200K$15.00$75.00
5668GLM 4.7992±336352.3%5.8%40 tps1.5s200K$0.77$1.73
5793Qwen Max979±196952.1%1.5%49 tps1.5s33K$1.60$6.40
5856DeepSeek V3.1 Turbo969±376651.5%0.9%173 tps1.3s164K$2.00$3.75
59108GPT-5 Mini Low968±145903.3%<0.1%69 tps3.2s400K$0.25$2.00
6077Claude Opus 4.1964±226104.7%3.0%17 tps3.7s200K$15.00$75.00
6152GPT-5957±201.6K2.9%3.1%78 tps23.1s400K$1.25$9.67
6284GPT-5 Mini Minimal953±165953.3%1.2%63 tps1.4s400K$0.25$2.00
63101Gemini 2.5 Flash Lite948±161.6K2.7%1.3%210 tps0.7s1M$0.10$0.40
6471Gemini 2.5 Flash Lite Preview 0925948±161.1K2.2%1.2%209 tps0.7s1M$0.25$0.35
6581GPT-4o945±315053.8%1.0%49 tps2.4s128K$3.71$12.57
6656DeepSeek V3.2 Thinking942±267052.8%9.0%30 tps2.6s131K$0.28$0.42
6748OpenAI o1-mini937±275802.5%<0.1%118 tpsN/A128K$1.13$4.51
6879Qwen3 Max Thinking Preview925±265301.9%3.1%40 tps2.1s256K$1.20$6.00
69126Qwen3 VL 235B A22B Thinking922±196453.0%4.3%47 tps3.0s127K$0.47$3.31
7080GPT-5 (Minimal)922±139554.0%<0.1%67 tps1.4s400K$1.25$10.00
71106DeepSeek V3 0324920±255700.9%5.8%12 tps2.7s164K$0.38$0.93
7244Kimi K2 Thinking Turbo917±275301.9%2.0%75 tps1.4s262K$1.15$8.00
73126DeepSeek V3910±385651.7%0.9%69 tps1.1s64K$0.59$1.49
74118GPT-4.1 mini900±169501.0%1.1%67 tps0.9s1M$0.34$1.60
7562MiniMax M2900±247202.7%2.2%39 tps2.3s205K$0.21$0.85
76113Kimi K2 Fast887±141.6K1.0%0.8%365 tps0.5s131K$1.00$3.00
77129DeepSeek V3.1 Thinking886±165102.9%7.1%18 tps1.8s131K$0.23$0.75
7865Mistral Large 3881±274953.9%2.1%51 tps1.0s256K$0.50$1.50
79124Kimi K2 0905 Turbo881±177102.1%0.7%373 tps0.5s262K$1.70$6.50
80106Grok 3872±267451.3%1.5%53 tps0.6s1M$3.67$18.33
8171GPT-5 Mini870±159404.1%2.6%66 tps14.2s400K$0.25$2.00
8295Gemini 2.5 Flash Lite Thinking Preview 0925859±161.2K2.5%1.5%152 tps3.0s1M$0.10$0.40
83143Gemini 2.0 Flash Lite856±245853.3%<0.1%42 tps0.5s1M$0.08$0.30
84148OpenAI o4-mini-high854±225601.8%1.9%117 tps15.9s200K$1.10$4.40
8593DeepSeek V3 0324 Turbo843±216350.8%6.3%12 tps2.4s164K$0.73$1.79
86157GPT-5 Nano836±346854.2%3.2%113 tps20.9s400K$0.05$0.40
87129Command A836±158551.2%2.2%42 tps0.8s256K$2.00$7.33
88133GPT-4.1 nano824±227352.0%0.6%175 tps0.5s1M$0.10$0.40
89101gpt-oss-20b816±205551.8%0.5%216 tps0.5s131K$0.06$0.26
90113Gemini 2.5 Flash Lite Thinking804±187753.1%1.0%118 tps4.4s1M$0.03$0.13
91213Claude Haiku 3.5803±245456.0%0.8%40 tps2.8s200K$0.80$4.00
92302YouTube802±224854.0%<0.1%34 tps2.7s32K$0.99$0.99
93139OpenAI o4-mini768±325450.9%1.4%97 tps7.0s128K$1.10$4.40
94157Qwen3 Next 80B A3B Thinking767±238102.4%0.6%175 tps1.3s256K$0.21$2.26
95160Llama 4 Scout722±337002.1%0.6%88 tps5.1s131K$0.18$0.46
96161Llama 4 Maverick719±271K2.9%1.2%88 tps2.4s1M$0.23$0.83
97177OpenAI o3-mini676±276901.4%0.8%143 tps3.3s200K$1.10$4.40
98175OpenAI o3-mini-low659±215052.9%0.7%139 tps1.5s200K$1.10$4.40
Show Less