Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1016
Grok 4.1 Fast Reasoning
1022
Grok 4 Fast Reasoning
1022
Grok 4
1023
Grok 4 Fast Non-Reasoning
1023
Grok 4.1 Fast Non-Reasoning
1025
Gemini 2.5 Flash Preview 0925
1036
MiniMax M2.1
1038
Qwen3 Next 80B A3B Instruct
1040
Claude Sonnet 4.5
1040
GPT-5.1 Instant
1048
Qwen Plus (Aug'24)
1049
Gemini 2.5 Flash
1053
Qwen3 235B A22B Instruct 2507
1056
Qwen3 30B A3B Instruct 2507
1059
GLM 4.6

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
4144Grok 4.1 Fast Reasoning1016±181.4K2.0%1.5%58 tps7.3s2M$0.20$0.50
4248Grok 4 Fast Reasoning1022±141.2K2.0%2.1%102 tps3.1s2M$0.30$0.75
4368Grok 41022±102.1K2.5%3.9%29 tps11.1s256K$3.00$15.00
4452Grok 4 Fast Non-Reasoning1023±168701.7%1.5%93 tps0.6s2M$0.27$0.67
4526Grok 4.1 Fast Non-Reasoning1023±218201.8%0.9%101 tps0.5s2M$0.20$0.50
4660Gemini 2.5 Flash Preview 09251025±131.2K2.0%1.2%5 tps0.9s1M$0.13$0.97
4760MiniMax M2.11036±226951.4%2.1%66 tps2.6s205K$0.30$1.20
4833Qwen3 Next 80B A3B Instruct1038±159202.6%0.6%84 tps1.1s256K$0.20$1.42
4937Claude Sonnet 4.51040±82K3.2%1.4%41 tps1.3s200K$1.80$9.00
5062GPT-5.1 Instant1040±139152.7%1.3%50 tps1.9s400K$1.25$10.00
5168Qwen Plus (Aug'24)1048±227302.0%1.4%53 tps1.3s30K$0.40$1.20
5295Gemini 2.5 Flash1049±182.1K1.9%1.3%2 tps3.7s1M$0.30$2.50
5340Qwen3 235B A22B Instruct 25071053±196800.7%6.8%13 tps1.9s262K$0.13$0.52
5433Qwen3 30B A3B Instruct 25071056±188102.4%1.2%55 tps1.3s131K$0.13$0.72
5565GLM 4.61059±256402.3%5.4%39 tps1.5s200K$0.42$1.66
5652Claude Haiku 4.51060±131.6K3.1%1.1%100 tps0.9s200K$1.00$5.00
5726GPT-5 (High)1061±92.5K2.7%4.5%81 tps35.9s400K$1.25$10.00
5844DeepSeek V3.1 Terminus Chat1078±125801.7%3.4%27 tps1.5s131K$0.86$1.80
5942Qwen3 Max Instruct Preview1083±171.1K1.7%1.1%31 tps1.7s256K$1.43$6.61
6048Claude Sonnet 4 (Thinking)1093±141.6K2.4%1.5%52 tps1.5s200K$3.00$13.67
6129Qwen3 VL 235B A22B Instruct1094±156752.2%3.1%75 tps1.9s129K$0.37$1.81
6210Claude Sonnet 4.5 (Thinking)1102±133.2K3.6%1.9%44 tps1.1s200K$3.00$15.00
6342GPT-5.2 (Extra High) 1107±248902.7%13.2%17 tps20.5s400K$1.75$14.00
6433Kimi K2.51110±267202.0%6.5%33 tps1.7s262K$0.34$2.57
6513GPT-5.3 Instant1110±335151.0%0.9%63 tps0.8s400K$1.75$14.00
6632Gemini 2.5 Pro High1119±102.5K2.4%1.5%48 tps2.3s1M$1.25$10.00
6726Claude Haiku 4.5 (Extended Thinking)1123±191.1K1.9%1.4%115 tps0.7s200K$1.00$5.00
6844Gemini 2.5 Pro1125±63.1K3.7%2.3%45 tps2.6s1M$1.25$10.00
6917Claude Opus 4.51135±211.1K1.4%1.5%45 tps1.5s200K$5.00$25.00
7017GPT-5.2 (High)1145±152.2K1.6%6.7%18 tps16.3s400K$1.75$14.00
7116GPT-5.21162±187851.9%4.1%18 tps2.7s400K$1.75$14.00
7222GPT-5 Chat1164±123.5K1.6%1.3%95 tps0.9s400K$1.25$10.00
7317Gemini 3 Flash Preview1165±216751.5%1.3%138 tps1.4s1M$0.50$3.00
7414Gemini 3 Flash Preview Thinking1167±171.4K1.7%1.6%3 tps6.2s1M$0.50$3.00
755Claude Sonnet 4.6 (Thinking)1222±236301.6%4.7%57 tps1.1s200K$3.00$15.00
768GPT-5.11230±131.3K1.9%2.3%71 tps1.4s400K$1.42$11.33
778GPT-5.1 (High)1231±151.8K1.7%3.2%76 tps6.9s400K$1.25$10.00
7814Gemini 3 Pro (Low)1240±191.2K0.8%2.4%51 tps3.5s1M$2.00$12.00
796Gemini 3.1 Pro1244±231.4K1.7%3.5%35 tps4.1s1M$2.00$12.00
8010Gemini 3 Pro1248±163.5K1.5%2.1%50 tps3.6s1M$2.00$12.00
View All (85 models)