Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1053
Seed 2.0 Mini (Medium)
1051
Grok 3 Fast
1048
OpenAI o3-pro
1047
Qwen3 Omni 30B A3B Thinking
1046
GLM 4.7 FP8
1044
Qwen3 Max Thinking Preview
1042
Seed 1.6 250615
1041
Kimi K2 0905 Turbo
1038
Gemini 2.0 Flash
1038
Gemini 2.5 Flash Lite
1037
OpenAI o1
1035
GPT-5 Nano
1034
DeepSeek V3 (Turbo)
1032
ERNIE 4.5 300B A47B
1029
DeepSeek V3.1

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
81139Seed 2.0 Mini (Medium)1053±216054.0%11.9%33 tps1.7s256K$0.15$0.60
82111Grok 3 Fast1051±171.1K2.6%1.7%52 tps2.4s131K$5.00$25.00
8381OpenAI o3-pro1048±82.2K3.5%5.2%22 tps70.8s200K$20.00$80.00
8437Qwen3 Omni 30B A3B Thinking1047±111.3K5.9%3.7%67 tps1.2s66K$0.97$1.79
85119GLM 4.7 FP81046±194903.0%6.9%40 tps1.3s200K$0.30$1.20
8679Qwen3 Max Thinking Preview1044±55.1K7.7%3.1%40 tps2.1s256K$1.20$6.00
87143Seed 1.6 2506151042±131.2K4.8%3.1%46 tps2.2s256K$0.25$2.00
88124Kimi K2 0905 Turbo1041±46.8K12.4%0.7%373 tps0.5s262K$1.70$6.50
89143Gemini 2.0 Flash1038±63.7K8.9%<0.1%76 tps0.5s1M$0.14$0.56
90101Gemini 2.5 Flash Lite1038±512.8K12.6%1.3%210 tps0.7s1M$0.10$0.40
91153OpenAI o11037±151.2K4.8%4.2%92 tps5.5s200K$15.00$60.00
92157GPT-5 Nano1035±63.8K10.6%3.2%113 tps20.9s400K$0.05$0.40
93101DeepSeek V3 (Turbo)1034±111K5.9%1.5%32 tps1.5s64K$0.40$1.30
94119ERNIE 4.5 300B A47B1032±66.1K8.7%4.7%23 tps2.3s123K$0.28$1.10
9571DeepSeek V3.11029±101.1K4.5%0.8%197 tps0.4s164K$0.55$1.60
9686Amazon Nova 2 Lite1027±102.6K7.9%1.0%137 tps0.6s300K$0.35$2.95
97113GLM 4.5 AirX1020±108059.0%3.3%75 tps1.2s131K$1.10$4.50
98133GPT-4.1 nano1020±48.8K9.7%0.6%175 tps0.5s1M$0.10$0.40
99124Qwen3 235B A22B Thinking 25071018±111.1K4.2%2.5%53 tps1.6s131K$0.59$5.70
100148OpenAI o31016±111.3K4.6%0.9%85 tps6.8s128K$7.33$29.33
101129DeepSeek V3.1 Thinking1014±73.9K14.0%7.1%18 tps1.8s131K$0.23$0.75
102143Gemini 2.0 Flash Lite1011±65.7K6.9%<0.1%42 tps0.5s1M$0.08$0.30
103113GLM 4.51002±53.7K14.3%3.7%46 tps1.4s131K$0.43$1.63
104121NVIDIA Llama 3.3 Nemotron Super 49B v1.51000±161K9.9%2.0%50 tps0.6s131K$0.09$0.33
105139OpenAI o4-mini1000±54.8K10.2%1.4%97 tps7.0s128K$1.10$4.40
106111LongCat Flash Chat996±149307.0%0.8%85 tps0.9s131K$0.14$0.68
107133Kimi K2 0905991±67.5K5.6%4.0%30 tps1.4s262K$0.63$2.39
108129Qwen3 Max Thinking990±131.7K2.3%13.5%32 tps2.3s256K$1.20$6.00
109157Cogito v2.1 671B984±177155.9%0.8%85 tps0.5s128K$1.25$1.25
110126Qwen3 VL 235B A22B Thinking979±63.5K11.5%4.3%47 tps3.0s127K$0.47$3.31
111165DeepSeek R1T2 Chimera978±101.1K11.0%3.0%28 tps1.8s164K$0.13$0.45
112214Qwen 2.5 VL 32B Instruct977±208507.6%6.3%43 tps3.2s128K$0.35$0.62
113209Seed 1.6 Flash 250715974±169806.2%2.5%108 tps1.6s256K$0.07$0.30
114177OpenAI o3-mini946±66.7K12.3%0.8%143 tps3.3s200K$1.10$4.40
115153Ministral 14B 3.0945±2849011.7%2.0%119 tps0.5s128K$0.20$0.20
116170Devstral Medium945±111.6K14.7%1.5%77 tps0.6s131K$0.40$2.00
117201GPT-4o mini939±91.4K9.2%2.1%71 tps1.7s128K$0.15$0.60
118139GLM 4.6V938±112.5K6.1%6.4%21 tps1.8s128K$0.38$0.90
119148OpenAI o4-mini-high937±66K14.9%1.9%117 tps15.9s200K$1.10$4.40
120175OpenAI o3-mini-low937±46.1K13.6%0.7%139 tps1.5s200K$1.10$4.40
View All (170 models)