Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1110
Claude Opus 4.5
1110
MiniMax M2
1109
Gemini 2.5 Pro Preview 0605
1108
GPT-4.5 Preview
1107
Qwen3.5 397B A17B
1107
DeepSeek V3.1 Nex N1
1106
Qwen3 Max Thinking Preview
1105
DeepSeek V3 (Turbo)
1105
Gemini 3.1 Flash Lite Preview Thinking
1105
GLM 4.7
1104
DeepSeek-R1 Turbo
1100
GPT-5 Mini Low
1099
Amazon Nova 2 Lite
1098
Qwen Plus (Aug'24)
1097
GPT-5 (Low)

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
8117Claude Opus 4.51110±412.9K2.2%1.5%45 tps1.5s200K$5.00$25.00
8262MiniMax M21110±317.2K2.5%2.2%39 tps2.3s205K$0.21$0.85
83133Gemini 2.5 Pro Preview 06051109±135301.9%<0.1%0 tps3.7s1M$1.25$10.00
8477GPT-4.5 Preview1108±54.8K0.8%<0.1%36 tps3.0s200K$75.00$150.00
8571Qwen3.5 397B A17B1107±65.1K1.6%4.3%57 tps1.4s256K$0.52$3.00
8686DeepSeek V3.1 Nex N11107±81.5K1.3%3.4%24 tps7.2s131K$0.14$0.50
8779Qwen3 Max Thinking Preview1106±413.3K2.0%3.1%40 tps2.1s256K$1.20$6.00
88101DeepSeek V3 (Turbo)1105±53.7K1.5%1.5%32 tps1.5s64K$0.40$1.30
8956Gemini 3.1 Flash Lite Preview Thinking1105±82K1.7%1.7%75 tps4.7s1M$0.25$1.50
9068GLM 4.71105±321K1.0%5.8%40 tps1.5s200K$0.77$1.73
9195DeepSeek-R1 Turbo1104±54.8K1.5%2.6%29 tps1.8s64K$2.85$4.75
92108GPT-5 Mini Low1100±44.3K2.4%<0.1%69 tps3.2s400K$0.25$2.00
9386Amazon Nova 2 Lite1099±410.5K2.7%1.0%137 tps0.6s300K$0.35$2.95
9468Qwen Plus (Aug'24)1098±250.5K1.1%1.4%53 tps1.3s30K$0.40$1.20
95101GPT-5 (Low)1097±71.5K1.0%1.8%75 tps8.2s400K$1.25$10.00
9662GPT-5.1 Instant1096±314.9K1.5%1.3%50 tps1.9s400K$1.25$10.00
9784GPT-5 Mini Minimal1094±34.9K3.0%1.2%63 tps1.4s400K$0.25$2.00
98111Solar Pro 3 (Reasoning)1093±72.6K1.5%3.2%118 tps1.2s131K$0.15$0.60
9995Kimi K2 Thinking1092±45.4K2.0%4.2%61 tps5.9s262K$0.24$1.03
10037Claude Sonnet 4.51092±225.2K2.2%1.4%41 tps1.3s200K$1.80$9.00
10171MiniMax M2.5 FP81092±102.1K1.6%3.6%33 tps1.7s205K$0.45$1.75
10281GPT-4o1091±223.5K0.7%1.0%49 tps2.4s128K$3.71$12.57
10371Seed 1.8 2512281090±314.9K1.5%3.7%41 tps2.1s256K$0.25$2.00
10452Claude Haiku 4.51089±320.4K2.1%1.1%100 tps0.9s200K$1.00$5.00
105133Qwen3 14B1088±48.2K2.3%1.7%109 tps0.8s41K$0.04$0.15
10671Gemini 2.5 Flash Lite Preview 09251087±215.1K2.5%1.2%209 tps0.7s1M$0.25$0.35
10786DeepSeek V3.1 Chat1087±310.7K1.8%2.8%21 tps1.6s131K$0.38$1.00
10871GPT-5 Mini1087±311.3K2.1%2.6%66 tps14.2s400K$0.25$2.00
10951GPT-5.2 (Medium)1087±106852.1%<0.1%39 tps2.5s400K$1.75$14.00
11095Qwen3 32B1085±52.6K1.5%3.9%30 tps3.1s41K$0.12$0.42
11193Qwen Max1084±254.8K0.9%1.5%49 tps1.5s33K$1.60$6.40
11295DeepSeek V3.2 Exp Thinking1084±54.8K1.9%7.2%26 tps3.0s131K$0.28$0.42
113100Gemini 2.5 Flash Preview1082±58.8K0.6%<0.1%138 tps6.9s1M$0.15$0.60
114133Solar Pro 2 2507101081±221.6K1.4%<0.1%9 tpsN/A66K$0.50$0.50
11593DeepSeek V3 0324 Turbo1081±350.9K1.4%6.3%12 tps2.4s164K$0.73$1.79
116101gpt-oss-20b1080±214.2K1.7%0.5%216 tps0.5s131K$0.06$0.26
117147Grok 4 0709 EU1077±72.4K1.8%<0.1%33 tps8.2s128K$3.00$15.00
118133Nemotron 3 Nano1076±81.6K1.9%1.3%216 tps0.8s256K$0.05$4.94
119106DeepSeek V3.1 Terminus Thinking1075±46.7K1.9%5.9%27 tps1.8s131K$0.56$1.68
120161DeepSeek Prover v21075±101.4K1.4%5.2%14 tps1.3s164K$0.40$1.56
View All (410 models)