Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1038
DeepSeek V3 0324 Turbo
1038
GLM 4.5
1040
Qwen3 Omni 30B A3B Thinking
1042
Gemini 2.5 Flash Lite
1042
Amazon Nova 2 Lite
1043
Mistral Medium
1045
GPT-4.1 mini
1049
Grok 4 Fast Reasoning
1055
Claude Sonnet 3.5 v2
1056
DeepSeek V3.2
1056
Qwen3.5 27B
1056
DeepSeek V3.1 Terminus Chat
1058
Grok 4.1 Fast Non-Reasoning
1060
Gemini 3.1 Flash Lite Preview
1063
Qwen3.5 122B A17B

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
8193DeepSeek V3 0324 Turbo1038±92.2K1.8%6.3%12 tps2.4s164K$0.73$1.79
82113GLM 4.51038±129153.2%3.7%46 tps1.4s131K$0.43$1.63
8337Qwen3 Omni 30B A3B Thinking1040±207502.0%3.7%67 tps1.2s66K$0.97$1.79
84101Gemini 2.5 Flash Lite1042±67.8K4.3%1.3%210 tps0.7s1M$0.10$0.40
8586Amazon Nova 2 Lite1042±236902.1%1.0%137 tps0.6s300K$0.35$2.95
86113Mistral Medium1043±111.1K3.1%1.8%48 tps0.6s33K$1.48$4.55
87118GPT-4.1 mini1045±83.4K2.5%1.1%67 tps0.9s1M$0.34$1.60
8848Grok 4 Fast Reasoning1049±102.3K3.6%2.1%102 tps3.1s2M$0.30$0.75
89106Claude Sonnet 3.5 v21055±227703.8%<0.1%46 tps1.4s200K$3.00$15.00
9040DeepSeek V3.21056±151.4K1.4%1.4%83 tps5.1s131K$0.43$1.09
9181Qwen3.5 27B1056±176652.9%3.7%55 tps2.6s256K$0.30$2.40
9244DeepSeek V3.1 Terminus Chat1056±139552.1%3.4%27 tps1.5s131K$0.86$1.80
9326Grok 4.1 Fast Non-Reasoning1058±192K4.1%0.9%101 tps0.5s2M$0.20$0.50
9471Gemini 3.1 Flash Lite Preview1060±221.2K3.3%1.0%8 tps1.2s1M$0.25$1.50
9552Qwen3.5 122B A17B1063±149801.5%1.5%82 tps1.4s256K$0.40$3.20
9671Qwen3.5 397B A17B1067±151.3K2.2%4.3%57 tps1.4s256K$0.52$3.00
9748Step 3.5 Flash1067±206302.3%2.2%109 tps0.6s256K$0.05$0.15
9856DeepSeek V3.1 Turbo1070±121.3K2.6%0.9%173 tps1.3s164K$2.00$3.75
99113Gemini 2.5 Flash Lite Thinking1071±102.3K3.2%1.0%118 tps4.4s1M$0.03$0.13
10068GLM 4.71071±121.9K2.1%5.8%40 tps1.5s200K$0.77$1.73
10165DeepSeek V3.2 Exp Chat1072±127552.6%2.6%29 tps1.5s131K$0.27$0.39
10244Kimi K2 Thinking Turbo1072±171.3K2.2%2.0%75 tps1.4s262K$1.15$8.00
10348gpt-oss-120b1074±63K2.6%0.7%213 tps0.5s131K$0.11$0.50
10433Qwen3 Next 80B A3B Instruct1083±161.5K2.6%0.6%84 tps1.1s256K$0.20$1.42
10533Qwen3 30B A3B Instruct 25071084±82.3K3.2%1.2%55 tps1.3s131K$0.13$0.72
10671DeepSeek V3.11085±146903.5%0.8%197 tps0.4s164K$0.55$1.60
10795Gemini 2.5 Flash Lite Thinking Preview 09251086±83.4K2.7%1.5%152 tps3.0s1M$0.10$0.40
10833Kimi K2.51090±134.3K2.1%6.5%33 tps1.7s262K$0.34$2.57
10937Kimi K2.5 Instant1093±121.4K2.7%2.9%32 tps3.0s262K$0.50$3.00
11068Grok 41102±77.8K3.3%3.9%29 tps11.1s256K$3.00$15.00
11142Qwen3 Max Instruct Preview1103±132.2K2.0%1.1%31 tps1.7s256K$1.43$6.61
11295Gemini 2.5 Flash1104±79.8K2.7%1.3%2 tps3.7s1M$0.30$2.50
11329Nova Experimental Chat 12-101110±257101.4%2.4%84 tps12.9s98K$0$0
11481GPT-4o1113±82.2K3.6%1.0%49 tps2.4s128K$3.71$12.57
11552GPT-51115±85.4K3.7%3.1%78 tps23.1s400K$1.25$9.67
11626GPT-5 (High)1115±73.7K3.6%4.5%81 tps35.9s400K$1.25$10.00
11748Claude Sonnet 4 (Thinking)1116±75.3K4.2%1.5%52 tps1.5s200K$3.00$13.67
11860Gemini 2.5 Flash Preview 09251124±93.4K3.4%1.2%5 tps0.9s1M$0.13$0.97
11940Qwen3 235B A22B Instruct 25071126±82.1K2.5%6.8%13 tps1.9s262K$0.13$0.52
12022GLM 51132±121.7K1.4%3.4%36 tps2.7s200K$0.72$2.55
View All (154 models)