Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1063
Qwen3 Max Instruct Preview
1066
Claude Sonnet 4
1066
Gemini 2.5 Flash Lite Preview 0925
1074
DeepSeek V3 0324 Turbo
1075
GPT-5.1 Instant
1075
GPT-5 Mini
1076
Grok 4.1 Fast Reasoning
1083
gpt-oss-120b
1083
Kimi K2.5
1083
Gemini 3.1 Flash Lite Preview Thinking
1090
Grok 4 Fast Reasoning
1093
Grok 4
1098
Gemini 2.5 Flash
1101
Kimi K2.5 Instant
1110
GPT-5 (High)

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
8142Qwen3 Max Instruct Preview1063±72.7K1.5%1.1%31 tps1.7s256K$1.43$6.61
8286Claude Sonnet 41066±85.3K2.5%1.8%49 tps1.3s200K$3.00$15.00
8371Gemini 2.5 Flash Lite Preview 09251066±112.2K2.8%1.2%209 tps0.7s1M$0.25$0.35
8493DeepSeek V3 0324 Turbo1074±142.1K1.9%6.3%12 tps2.4s164K$0.73$1.79
8562GPT-5.1 Instant1075±92.2K2.6%1.3%50 tps1.9s400K$1.25$10.00
8671GPT-5 Mini1075±92.1K4.3%2.6%66 tps14.2s400K$0.25$2.00
8744Grok 4.1 Fast Reasoning1076±102.6K4.2%1.5%58 tps7.3s2M$0.20$0.50
8848gpt-oss-120b1083±73.5K3.0%0.7%213 tps0.5s131K$0.11$0.50
8933Kimi K2.51083±161.7K3.2%6.5%33 tps1.7s262K$0.34$2.57
9056Gemini 3.1 Flash Lite Preview Thinking1083±324853.0%1.7%75 tps4.7s1M$0.25$1.50
9148Grok 4 Fast Reasoning1090±112.1K3.1%2.1%102 tps3.1s2M$0.30$0.75
9268Grok 41093±55.7K4.0%3.9%29 tps11.1s256K$3.00$15.00
9395Gemini 2.5 Flash1098±94.8K2.7%1.3%2 tps3.7s1M$0.30$2.50
9437Kimi K2.5 Instant1101±284951.0%2.9%32 tps3.0s262K$0.50$3.00
9526GPT-5 (High)1110±74.3K3.1%4.5%81 tps35.9s400K$1.25$10.00
9671Qwen3.5 397B A17B1112±245802.5%4.3%57 tps1.4s256K$0.52$3.00
9729Qwen3 VL 235B A22B Instruct1114±81.3K2.5%3.1%75 tps1.9s129K$0.37$1.81
9826Claude Haiku 4.5 (Extended Thinking)1115±122.2K2.7%1.4%115 tps0.7s200K$1.00$5.00
9968Qwen Plus (Aug'24)1115±81.9K2.6%1.4%53 tps1.3s30K$0.40$1.20
10040Qwen3 235B A22B Instruct 25071118±112.5K2.1%6.8%13 tps1.9s262K$0.13$0.52
10181OpenAI o3-pro1126±102.3K3.1%5.2%22 tps70.8s200K$20.00$80.00
10233Qwen3 30B A3B Instruct 25071127±92.5K2.9%1.2%55 tps1.3s131K$0.13$0.72
10326Grok 4.1 Fast Non-Reasoning1130±152K4.3%0.9%101 tps0.5s2M$0.20$0.50
10460Gemini 2.5 Flash Preview 09251131±102.1K2.7%1.2%5 tps0.9s1M$0.13$0.97
10529Nova Experimental Chat 12-101135±197202.0%2.4%84 tps12.9s98K$0$0
10633Qwen3 Next 80B A3B Instruct1142±91.8K2.5%0.6%84 tps1.1s256K$0.20$1.42
10748Claude Sonnet 4 (Thinking)1144±64.2K3.5%1.5%52 tps1.5s200K$3.00$13.67
10871Gemini 2.5 Flash Thinking1148±72.6K4.2%2.2%88 tps6.4s1M$0.30$2.50
10937Claude Sonnet 4.51156±55.1K2.8%1.4%41 tps1.3s200K$1.80$9.00
11052GPT-51157±75.3K2.9%3.1%78 tps23.1s400K$1.25$9.67
11144Gemini 2.5 Pro1158±57.8K3.3%2.3%45 tps2.6s1M$1.25$10.00
11213GPT-5.3 Instant1160±191.5K1.9%0.9%63 tps0.8s400K$1.75$14.00
11342GPT-5.2 (Extra High) 1168±122.4K1.8%13.2%17 tps20.5s400K$1.75$14.00
11417GPT-5.2 (High)1175±106.4K1.7%6.7%18 tps16.3s400K$1.75$14.00
11522GLM 51184±227952.5%3.4%36 tps2.7s200K$0.72$2.55
11632Gemini 2.5 Pro High1189±64.5K2.5%1.5%48 tps2.3s1M$1.25$10.00
11717Gemini 3 Flash Preview1198±142K2.0%1.3%138 tps1.4s1M$0.50$3.00
11814Gemini 3 Flash Preview Thinking1212±84.2K1.8%1.6%3 tps6.2s1M$0.50$3.00
11916GPT-5.21213±122.7K2.2%4.1%18 tps2.7s400K$1.75$14.00
12022GPT-5 Chat1220±610.4K2.1%1.3%95 tps0.9s400K$1.25$10.00
View All (133 models)