Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1091
GPT-4o
1090
Seed 1.8 251228
1089
Claude Haiku 4.5
1088
Qwen3 14B
1087
Gemini 2.5 Flash Lite Preview 0925
1087
DeepSeek V3.1 Chat
1087
GPT-5 Mini
1085
Qwen3 32B
1084
Qwen Max
1084
DeepSeek V3.2 Exp Thinking
1081
DeepSeek V3 0324 Turbo
1080
gpt-oss-20b
1076
Nemotron 3 Nano
1075
DeepSeek V3.1 Terminus Thinking
1075
DeepSeek Prover v2

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
8181GPT-4o1091±223.5K0.7%1.0%49 tps2.4s128K$3.71$12.57
8271Seed 1.8 2512281090±314.9K1.5%3.7%41 tps2.1s256K$0.25$2.00
8352Claude Haiku 4.51089±320.4K2.1%1.1%100 tps0.9s200K$1.00$5.00
84133Qwen3 14B1088±48.2K2.3%1.7%109 tps0.8s41K$0.04$0.15
8571Gemini 2.5 Flash Lite Preview 09251087±215.1K2.5%1.2%209 tps0.7s1M$0.25$0.35
8686DeepSeek V3.1 Chat1087±310.7K1.8%2.8%21 tps1.6s131K$0.38$1.00
8771GPT-5 Mini1087±311.3K2.1%2.6%66 tps14.2s400K$0.25$2.00
8895Qwen3 32B1085±52.6K1.5%3.9%30 tps3.1s41K$0.12$0.42
8993Qwen Max1084±254.8K0.9%1.5%49 tps1.5s33K$1.60$6.40
9095DeepSeek V3.2 Exp Thinking1084±54.8K1.9%7.2%26 tps3.0s131K$0.28$0.42
9193DeepSeek V3 0324 Turbo1081±350.9K1.4%6.3%12 tps2.4s164K$0.73$1.79
92101gpt-oss-20b1080±214.2K1.7%0.5%216 tps0.5s131K$0.06$0.26
93133Nemotron 3 Nano1076±81.6K1.9%1.3%216 tps0.8s256K$0.05$4.94
94106DeepSeek V3.1 Terminus Thinking1075±46.7K1.9%5.9%27 tps1.8s131K$0.56$1.68
95161DeepSeek Prover v21075±101.4K1.4%5.2%14 tps1.3s164K$0.40$1.56
9665GLM 4.61075±311.7K2.8%5.4%39 tps1.5s200K$0.42$1.66
97121NVIDIA Llama 3.3 Nemotron Super 49B v1.51074±63.5K2.2%2.0%50 tps0.6s131K$0.09$0.33
9871Gemini 3.1 Flash Lite Preview1073±111.3K2.2%1.0%8 tps1.2s1M$0.25$1.50
99126Qwen3 30B A3B1073±49.6K2.1%5.1%163 tps1.0s41K$0.06$0.21
10022MiniMax M2.7-highspeed1071±116752.2%2.3%50 tps2.1s205K$0.60$2.40
101111LongCat Flash Chat1071±43.8K1.9%0.8%85 tps0.9s131K$0.14$0.68
102133DeepSeek-R1 05281070±64.4K1.6%1.3%93 tps0.5s64K$1.60$3.67
103106DeepSeek V3 03241067±237.1K1.0%5.8%12 tps2.7s164K$0.38$0.93
104113Kimi K2 Fast1067±294.2K1.2%0.8%365 tps0.5s131K$1.00$3.00
105148DeepSeek-R11067±54.9K1.4%0.8%133 tps0.6s64K$0.91$3.07
106113GLM 4.5 AirX1067±53.5K1.8%3.3%75 tps1.2s131K$1.10$4.50
107101Gemini 2.5 Flash Lite1066±236K1.7%1.3%210 tps0.7s1M$0.10$0.40
10881Qwen3.5 27B1065±131.2K2.4%3.7%55 tps2.6s256K$0.30$2.40
109129DeepSeek V3.1 Thinking1063±49.3K2.2%7.1%18 tps1.8s131K$0.23$0.75
110124Qwen3 235B A22B Thinking 25071061±43.6K1.2%2.5%53 tps1.6s131K$0.59$5.70
11179MiniMax M2.5 Lightning1060±54.1K1.5%1.5%51 tps2.0s205K$0.60$2.40
11295Gemini 2.5 Flash1060±297.6K1.0%1.3%2 tps3.7s1M$0.30$2.50
113121Qwen3 32B Fast1060±411.4K2.3%11.6%30 tps3.1s41K$0.10$0.25
114153Apriel 1.5 15B Thinker1059±81.5K2.5%2.4%146 tps0.4s131K$0$0
115121QwQ 32B1058±315.3K2.1%5.4%41 tps2.1s16K$0.43$0.56
116119GLM 4.7 FP81057±62.2K1.6%6.9%40 tps1.3s200K$0.30$1.20
11771Gemini 2.5 Flash Thinking1054±47.9K1.4%2.2%88 tps6.4s1M$0.30$2.50
118143Solar Pro 2 2512151053±97551.3%1.8%107 tps1.5s66K$0.15$0.60
11995Gemini 2.5 Flash Lite Thinking Preview 09251052±29.4K2.4%1.5%152 tps3.0s1M$0.10$0.40
120113GLM 4.51050±212.3K1.6%3.7%46 tps1.4s131K$0.43$1.63
View All (283 models)