Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1068
Gemini 2.5 Flash
1070
Grok 4
1071
Gemini 3.1 Flash Lite Preview Thinking
1075
DeepSeek-R1 Turbo
1075
Grok 4 Fast Non-Reasoning
1076
Claude Haiku 4.5
1076
NVIDIA Llama 3.3 Nemotron Super 49B v1.5
1077
Grok 4 Fast Reasoning
1080
MiniMax M2.1
1083
Claude Sonnet 4
1083
GPT-5
1084
DeepSeek V3 0324
1085
GPT-5.1 Instant
1089
Nemotron 3 Nano (Thinking)
1090
Kimi K2.5

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
12195Gemini 2.5 Flash1068±411.2K1.2%1.3%2 tps3.7s1M$0.30$2.50
12268Grok 41070±413.8K1.6%3.9%29 tps11.1s256K$3.00$15.00
12356Gemini 3.1 Flash Lite Preview Thinking1071±135601.8%1.7%75 tps4.7s1M$0.25$1.50
12495DeepSeek-R1 Turbo1075±61.9K2.4%2.6%29 tps1.8s64K$2.85$4.75
12552Grok 4 Fast Non-Reasoning1075±62.9K3.3%1.5%93 tps0.6s2M$0.27$0.67
12652Claude Haiku 4.51076±84.2K2.2%1.1%100 tps0.9s200K$1.00$5.00
127121NVIDIA Llama 3.3 Nemotron Super 49B v1.51076±127551.9%2.0%50 tps0.6s131K$0.09$0.33
12848Grok 4 Fast Reasoning1077±63.3K2.8%2.1%102 tps3.1s2M$0.30$0.75
12960MiniMax M2.11080±65.2K0.6%2.1%66 tps2.6s205K$0.30$1.20
13086Claude Sonnet 41083±512K1.6%1.8%49 tps1.3s200K$3.00$15.00
13152GPT-51083±57.6K2.2%3.1%78 tps23.1s400K$1.25$9.67
132106DeepSeek V3 03241084±45.7K1.4%5.8%12 tps2.7s164K$0.38$0.93
13362GPT-5.1 Instant1085±63.7K1.1%1.3%50 tps1.9s400K$1.25$10.00
13486Nemotron 3 Nano (Thinking)1089±91.5K0.7%2.0%200 tps0.5s256K$0$0
13533Kimi K2.51090±64.5K0.7%6.5%33 tps1.7s262K$0.34$2.57
136121QwQ 32B1091±59.9K0.9%5.4%41 tps2.1s16K$0.43$0.56
137111LongCat Flash Chat1095±71.7K2.8%0.8%85 tps0.9s131K$0.14$0.68
13856DeepSeek V3.2 Thinking1096±63.8K0.9%9.0%30 tps2.6s131K$0.28$0.42
13986DeepSeek V3.1 Chat1097±71.9K2.3%2.8%21 tps1.6s131K$0.38$1.00
140121Qwen3 32B Fast1098±89K1.0%11.6%30 tps3.1s41K$0.10$0.25
14193DeepSeek V3 0324 Turbo1103±54.4K1.9%6.3%12 tps2.4s164K$0.73$1.79
14284GPT-5 Mini Minimal1107±109703.5%1.2%63 tps1.4s400K$0.25$2.00
14322GLM 51110±71.8K0.8%3.4%36 tps2.7s200K$0.72$2.55
14495Qwen3 32B1111±175151.9%3.9%30 tps3.1s41K$0.12$0.42
14593Qwen Max1111±67.6K1.4%1.5%49 tps1.5s33K$1.60$6.40
146101Gemini 2.5 Flash Lite1112±67.6K1.7%1.3%210 tps0.7s1M$0.10$0.40
14740DeepSeek V3.21113±53.6K0.8%1.4%83 tps5.1s131K$0.43$1.09
14856DeepSeek V3.1 Turbo1114±64K2.1%0.9%173 tps1.3s164K$2.00$3.75
14937Claude Sonnet 4.51116±65K3.1%1.4%41 tps1.3s200K$1.80$9.00
15044Grok 4.1 Fast Reasoning1119±65.4K1.5%1.5%58 tps7.3s2M$0.20$0.50
15144Gemini 2.5 Pro1126±416.2K1.5%2.3%45 tps2.6s1M$1.25$10.00
15242Qwen3 Max Instruct Preview1126±44.3K2.8%1.1%31 tps1.7s256K$1.43$6.61
15326Claude Haiku 4.5 (Extended Thinking)1129±53.6K1.6%1.4%115 tps0.7s200K$1.00$5.00
15442GPT-5.2 (Extra High) 1131±53.7K0.9%13.2%17 tps20.5s400K$1.75$14.00
15510Claude Sonnet 4.5 (Thinking)1136±56.8K2.7%1.9%44 tps1.1s200K$3.00$15.00
15629Nova Experimental Chat 12-101138±91.9K0.5%2.4%84 tps12.9s98K$0$0
15737Qwen3 Omni 30B A3B Thinking1139±71.6K1.2%3.7%67 tps1.2s66K$0.97$1.79
15848Step 3.5 Flash1140±159650.5%2.2%109 tps0.6s256K$0.05$0.15
15937Kimi K2.5 Instant1140±101.1K1.4%2.9%32 tps3.0s262K$0.50$3.00
16017Claude Opus 4.51144±82.4K2.1%1.5%45 tps1.5s200K$5.00$25.00
View All (193 models)