Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1083
GPT-5
1083
Claude Sonnet 4
1080
MiniMax M2.1
1079
Qwen Turbo
1077
GPT-5 (Minimal)
1077
Grok 4 Fast Reasoning
1077
Claude Opus 4
1076
NVIDIA Llama 3.3 Nemotron Super 49B v1.5
1076
Claude Haiku 4.5
1075
Grok 4 Fast Non-Reasoning
1075
DeepSeek-R1 Turbo
1071
Gemini 3.1 Flash Lite Preview Thinking
1070
Grok 4
1068
Gemini 2.5 Flash
1067
MiniMax M2.1 Lightning

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
8152GPT-51083±57.6K2.2%3.1%78 tps23.1s400K$1.25$9.67
8286Claude Sonnet 41083±512K1.6%1.8%49 tps1.3s200K$3.00$15.00
8360MiniMax M2.11080±65.2K0.6%2.1%66 tps2.6s205K$0.30$1.20
84159Qwen Turbo1079±83.9K1.4%<0.1%53 tps1.1s1M$0.05$0.20
8580GPT-5 (Minimal)1077±53K3.6%<0.1%67 tps1.4s400K$1.25$10.00
8648Grok 4 Fast Reasoning1077±63.3K2.8%2.1%102 tps3.1s2M$0.30$0.75
8721Claude Opus 41077±139202.6%<0.1%25 tps1.5s200K$15.00$75.00
88121NVIDIA Llama 3.3 Nemotron Super 49B v1.51076±127551.9%2.0%50 tps0.6s131K$0.09$0.33
8952Claude Haiku 4.51076±84.2K2.2%1.1%100 tps0.9s200K$1.00$5.00
9052Grok 4 Fast Non-Reasoning1075±62.9K3.3%1.5%93 tps0.6s2M$0.27$0.67
9195DeepSeek-R1 Turbo1075±61.9K2.4%2.6%29 tps1.8s64K$2.85$4.75
9256Gemini 3.1 Flash Lite Preview Thinking1071±135601.8%1.7%75 tps4.7s1M$0.25$1.50
9368Grok 41070±413.8K1.6%3.9%29 tps11.1s256K$3.00$15.00
9495Gemini 2.5 Flash1068±411.2K1.2%1.3%2 tps3.7s1M$0.30$2.50
9556MiniMax M2.1 Lightning1067±128550.6%1.7%52 tps2.1s205K$0.30$2.40
96292AFM 4.5B1067±62.1K1.6%<0.1%81 tps0.3s66K$0.05$0.20
9779Qwen3 Max Thinking Preview1067±63.1K1.4%3.1%40 tps2.1s256K$1.20$6.00
98108GPT-5 Mini Low1067±117554.4%<0.1%69 tps3.2s400K$0.25$2.00
9971Gemini 2.5 Flash Lite Preview 09251066±63.3K2.8%1.2%209 tps0.7s1M$0.25$0.35
100124Qwen3 235B A22B Thinking 25071065±71.8K1.9%2.5%53 tps1.6s131K$0.59$5.70
10144Kimi K2 Thinking Turbo1065±63K1.9%2.0%75 tps1.4s262K$1.15$8.00
10265Mistral Large 31064±71.8K2.2%2.1%51 tps1.0s256K$0.50$1.50
103118GPT-4.1 mini1062±55.5K1.8%1.1%67 tps0.9s1M$0.34$1.60
10481OpenAI o3-pro1061±141.3K2.7%5.2%22 tps70.8s200K$20.00$80.00
105133Solar Pro 2 2507101060±54.8K1.5%<0.1%9 tpsN/A66K$0.50$0.50
106106Grok 31054±67.1K1.7%1.5%53 tps0.6s1M$3.67$18.33
10795Kimi K2 Thinking1054±91.9K3.8%4.2%61 tps5.9s262K$0.24$1.03
10871DeepSeek V3.11053±131.8K1.6%0.8%197 tps0.4s164K$0.55$1.60
10944DeepSeek V3.1 Terminus Chat1053±62.6K2.6%3.4%27 tps1.5s131K$0.86$1.80
110133Gemini 2.5 Pro Preview 06051051±12655<0.1%<0.1%0 tps3.7s1M$1.25$10.00
111126Qwen3 30B A3B1051±73.9K1.3%5.1%163 tps1.0s41K$0.06$0.21
11265DeepSeek V3.2 Exp Chat1047±92.2K3.1%2.6%29 tps1.5s131K$0.27$0.39
11362MiniMax M21046±63.8K1.9%2.2%39 tps2.3s205K$0.21$0.85
114119ERNIE 4.5 300B A47B1046±65.3K1.3%4.7%23 tps2.3s123K$0.28$1.10
115133GPT-4.1 nano1046±85.1K2.0%0.6%175 tps0.5s1M$0.10$0.40
116241OLMo 3 7B Think1045±127101.4%4.2%77 tps0.4s66K$0.12$0.20
11748Claude Sonnet 4 (Thinking)1044±58.4K2.3%1.5%52 tps1.5s200K$3.00$13.67
11871Gemini 2.5 Flash Thinking1042±56.5K1.5%2.2%88 tps6.4s1M$0.30$2.50
119182Gemini 2.5 Flash Preview Thinking1041±166201.6%<0.1%26 tps1.8s1M$0.15$1.76
12071Qwen3.5 397B A17B1040±101.4K1.4%4.3%57 tps1.4s256K$0.52$3.00
View All (260 models)