Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1079
Mistral Medium
1078
Claude Haiku 4.5
1076
Gemini 2.5 Flash Preview
1074
Qwen Turbo
1073
DeepSeek-R1 Turbo
1072
Kimi K2 Thinking
1071
MiniMax M2.1
1069
Kimi K2 Fast
1069
DeepSeek V3.1 Chat
1068
DeepSeek V3 0324 Turbo
1062
Grok 4
1061
Gemini 2.5 Flash
1061
Qwen3 32B Fast
1060
DeepSeek V3.1 Terminus Chat
1059
Nemotron 3 Nano (Thinking)

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
81113Mistral Medium1079±92.7K1.8%1.8%48 tps0.6s33K$1.48$4.55
8252Claude Haiku 4.51078±112.6K3.7%1.1%100 tps0.9s200K$1.00$5.00
83100Gemini 2.5 Flash Preview1076±146400.8%<0.1%138 tps6.9s1M$0.15$0.60
84159Qwen Turbo1074±72.9K1.2%<0.1%53 tps1.1s1M$0.05$0.20
8595DeepSeek-R1 Turbo1073±167803.7%2.6%29 tps1.8s64K$2.85$4.75
8695Kimi K2 Thinking1072±308808.8%4.2%61 tps5.9s262K$0.24$1.03
8760MiniMax M2.11071±112.6K1.1%2.1%66 tps2.6s205K$0.30$1.20
88113Kimi K2 Fast1069±78.5K1.0%0.8%365 tps0.5s131K$1.00$3.00
8986DeepSeek V3.1 Chat1069±111.3K3.0%2.8%21 tps1.6s131K$0.38$1.00
9093DeepSeek V3 0324 Turbo1068±74.2K0.8%6.3%12 tps2.4s164K$0.73$1.79
9168Grok 41062±610.5K1.3%3.9%29 tps11.1s256K$3.00$15.00
9295Gemini 2.5 Flash1061±710K1.0%1.3%2 tps3.7s1M$0.30$2.50
93121Qwen3 32B Fast1061±93K2.4%11.6%30 tps3.1s41K$0.10$0.25
9444DeepSeek V3.1 Terminus Chat1060±131.4K3.3%3.4%27 tps1.5s131K$0.86$1.80
9586Nemotron 3 Nano (Thinking)1059±188251.8%2.0%200 tps0.5s256K$0$0
9684GPT-5 Mini Minimal1057±117455.7%1.2%63 tps1.4s400K$0.25$2.00
9771Gemini 2.5 Flash Thinking1055±112.8K2.3%2.2%88 tps6.4s1M$0.30$2.50
9865DeepSeek V3.2 Exp Chat1054±141.2K3.7%2.6%29 tps1.5s131K$0.27$0.39
9952Qwen3.5 122B A17B1053±255803.3%1.5%82 tps1.4s256K$0.40$3.20
100133Qwen3 14B1053±131.7K2.9%1.7%109 tps0.8s41K$0.04$0.15
10195Gemini 2.5 Flash Lite Thinking Preview 09251051±151.5K4.2%1.5%152 tps3.0s1M$0.10$0.40
10256MiniMax M2.1 Lightning1050±236151.6%1.7%52 tps2.1s205K$0.30$2.40
103157Qwen3 Next 80B A3B Thinking1049±92K2.6%0.6%175 tps1.3s256K$0.21$2.26
104147GLM 4.5 Air1047±71.8K2.2%<0.1%22 tps1.4s131K$0.10$0.38
10568GLM 4.71047±152.4K1.2%5.8%40 tps1.5s200K$0.77$1.73
10671GPT-5 Mini1047±92.2K2.7%2.6%66 tps14.2s400K$0.25$2.00
10737Kimi K2.5 Instant1046±166202.4%2.9%32 tps3.0s262K$0.50$3.00
10895DeepSeek V3.2 Exp Thinking1046±187354.5%7.2%26 tps3.0s131K$0.28$0.42
109106DeepSeek V3 03241045±84.1K1.0%5.8%12 tps2.7s164K$0.38$0.93
110106Grok 31044±76K1.1%1.5%53 tps0.6s1M$3.67$18.33
111113Gemini 2.5 Flash Lite Thinking1041±92.2K1.8%1.0%118 tps4.4s1M$0.03$0.13
112133DeepSeek-R1 05281038±131.7K2.0%1.3%93 tps0.5s64K$1.60$3.67
11381OpenAI o3-pro1037±189502.6%5.2%22 tps70.8s200K$20.00$80.00
114165DeepSeek R1T2 Chimera1031±175753.4%3.0%28 tps1.8s164K$0.13$0.45
11548Claude Sonnet 4 (Thinking)1028±153.7K2.9%1.5%52 tps1.5s200K$3.00$13.67
116126Qwen3 VL 235B A22B Thinking1027±139354.1%4.3%47 tps3.0s127K$0.47$3.31
11762MiniMax M21027±92.5K5.2%2.2%39 tps2.3s205K$0.21$0.85
118129Qwen3 Max Thinking1022±121.3K1.1%13.5%32 tps2.3s256K$1.20$6.00
11971Qwen3.5 397B A17B1021±229101.1%4.3%57 tps1.4s256K$0.52$3.00
120124Kimi K2 0905 Turbo1017±122.1K2.3%0.7%373 tps0.5s262K$1.70$6.50
View All (223 models)