Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1130
Grok 4.1 Fast Non-Reasoning
1127
Qwen3 30B A3B Instruct 2507
1126
OpenAI o3-pro
1118
Qwen3 235B A22B Instruct 2507
1115
Qwen Plus (Aug'24)
1115
Claude Haiku 4.5 (Extended Thinking)
1114
Qwen3 VL 235B A22B Instruct
1114
Claude Opus 4 (Thinking)
1112
Qwen3.5 397B A17B
1110
GPT-5 (High)
1101
Kimi K2.5 Instant
1098
Gemini 2.5 Flash
1093
Grok 4
1090
Grok 4 Fast Reasoning
1087
Claude Opus 4

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
4126Grok 4.1 Fast Non-Reasoning1130±152K4.3%0.9%101 tps0.5s2M$0.20$0.50
4233Qwen3 30B A3B Instruct 25071127±92.5K2.9%1.2%55 tps1.3s131K$0.13$0.72
4381OpenAI o3-pro1126±102.3K3.1%5.2%22 tps70.8s200K$20.00$80.00
4440Qwen3 235B A22B Instruct 25071118±112.5K2.1%6.8%13 tps1.9s262K$0.13$0.52
4568Qwen Plus (Aug'24)1115±81.9K2.6%1.4%53 tps1.3s30K$0.40$1.20
4626Claude Haiku 4.5 (Extended Thinking)1115±122.2K2.7%1.4%115 tps0.7s200K$1.00$5.00
4729Qwen3 VL 235B A22B Instruct1114±81.3K2.5%3.1%75 tps1.9s129K$0.37$1.81
4821Claude Opus 4 (Thinking)1114±87703.1%<0.1%28 tps1.3s200K$15.00$75.00
4971Qwen3.5 397B A17B1112±245802.5%4.3%57 tps1.4s256K$0.52$3.00
5026GPT-5 (High)1110±74.3K3.1%4.5%81 tps35.9s400K$1.25$10.00
5137Kimi K2.5 Instant1101±284951.0%2.9%32 tps3.0s262K$0.50$3.00
5295Gemini 2.5 Flash1098±94.8K2.7%1.3%2 tps3.7s1M$0.30$2.50
5368Grok 41093±55.7K4.0%3.9%29 tps11.1s256K$3.00$15.00
5448Grok 4 Fast Reasoning1090±112.1K3.1%2.1%102 tps3.1s2M$0.30$0.75
5521Claude Opus 41087±93.2K2.9%<0.1%25 tps1.5s200K$15.00$75.00
5656Gemini 3.1 Flash Lite Preview Thinking1083±324853.0%1.7%75 tps4.7s1M$0.25$1.50
5733Kimi K2.51083±161.7K3.2%6.5%33 tps1.7s262K$0.34$2.57
5848gpt-oss-120b1083±73.5K3.0%0.7%213 tps0.5s131K$0.11$0.50
5956Claude Opus 4.1 (Thinking)1083±62K4.1%<0.1%20 tps3.9s200K$15.00$75.00
6044Grok 4.1 Fast Reasoning1076±102.6K4.2%1.5%58 tps7.3s2M$0.20$0.50
6171GPT-5 Mini1075±92.1K4.3%2.6%66 tps14.2s400K$0.25$2.00
6262GPT-5.1 Instant1075±92.2K2.6%1.3%50 tps1.9s400K$1.25$10.00
6393DeepSeek V3 0324 Turbo1074±142.1K1.9%6.3%12 tps2.4s164K$0.73$1.79
6471Gemini 2.5 Flash Lite Preview 09251066±112.2K2.8%1.2%209 tps0.7s1M$0.25$0.35
6586Claude Sonnet 41066±85.3K2.5%1.8%49 tps1.3s200K$3.00$15.00
6642Qwen3 Max Instruct Preview1063±72.7K1.5%1.1%31 tps1.7s256K$1.43$6.61
6740DeepSeek V3.21063±161.1K2.5%1.4%83 tps5.1s131K$0.43$1.09
6852Claude Haiku 4.51057±63.4K3.4%1.1%100 tps0.9s200K$1.00$5.00
6952Grok 4 Fast Non-Reasoning1054±81.6K2.5%1.5%93 tps0.6s2M$0.27$0.67
7081GPT-4o1046±151.4K2.5%1.0%49 tps2.4s128K$3.71$12.57
7148OpenAI o1-mini1045±81.8K3.5%<0.1%118 tpsN/A128K$1.13$4.51
7260MiniMax M2.11044±121.7K2.8%2.1%66 tps2.6s205K$0.30$1.20
7395Gemini 2.5 Flash Lite Thinking Preview 09251044±91.7K3.5%1.5%152 tps3.0s1M$0.10$0.40
7462MiniMax M21043±91.8K4.2%2.2%39 tps2.3s205K$0.21$0.85
7565GLM 4.61041±111.6K2.9%5.4%39 tps1.5s200K$0.42$1.66
7644DeepSeek V3.1 Terminus Chat1037±91.3K2.2%3.4%27 tps1.5s131K$0.86$1.80
7756DeepSeek V3.2 Thinking1033±151.7K2.0%9.0%30 tps2.6s131K$0.28$0.42
7877Claude Opus 4.11032±112K2.7%3.0%17 tps3.7s200K$15.00$75.00
79129Qwen3 Max Thinking1029±316002.4%13.5%32 tps2.3s256K$1.20$6.00
8065Mistral Large 31026±221.1K4.1%2.1%51 tps1.0s256K$0.50$1.50
View All (159 models)