Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1051
Grok 3 Fast
1051
Grok 4 0709 EU
1048
OpenAI o3-pro
1047
Qwen3 Omni 30B A3B Thinking
1047
DeepSeek V3.1 Terminus Thinking
1046
GLM 4.7 FP8
1045
Qwen Plus 0728 (Thinking)
1044
Qwen3 Max Thinking Preview
1042
Seed 1.6 250615
1042
Pixtral Large
1041
Solar Pro 2 250710
1041
Kimi K2 0905 Turbo
1038
Gemini 2.0 Flash
1038
Gemini 2.5 Flash Lite
1037
OpenAI o1

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
121111Grok 3 Fast1051±171.1K2.6%1.7%52 tps2.4s131K$5.00$25.00
122147Grok 4 0709 EU1051±1292010.7%<0.1%33 tps8.2s128K$3.00$15.00
12381OpenAI o3-pro1048±82.2K3.5%5.2%22 tps70.8s200K$20.00$80.00
12437Qwen3 Omni 30B A3B Thinking1047±111.3K5.9%3.7%67 tps1.2s66K$0.97$1.79
125106DeepSeek V3.1 Terminus Thinking1047±72.5K11.6%5.9%27 tps1.8s131K$0.56$1.68
126119GLM 4.7 FP81046±194903.0%6.9%40 tps1.3s200K$0.30$1.20
127100Qwen Plus 0728 (Thinking)1045±1089011.0%<0.1%56 tps1.1s1M$0.40$4.00
12879Qwen3 Max Thinking Preview1044±55.1K7.7%3.1%40 tps2.1s256K$1.20$6.00
129143Seed 1.6 2506151042±131.2K4.8%3.1%46 tps2.2s256K$0.25$2.00
130165Pixtral Large1042±82.5K5.1%2.5%57 tps1.3s128K$1.50$4.50
131133Solar Pro 2 2507101041±66.4K14.2%<0.1%9 tpsN/A66K$0.50$0.50
132124Kimi K2 0905 Turbo1041±46.8K12.4%0.7%373 tps0.5s262K$1.70$6.50
133143Gemini 2.0 Flash1038±63.7K8.9%<0.1%76 tps0.5s1M$0.14$0.56
134101Gemini 2.5 Flash Lite1038±512.8K12.6%1.3%210 tps0.7s1M$0.10$0.40
135153OpenAI o11037±151.2K4.8%4.2%92 tps5.5s200K$15.00$60.00
136157GPT-5 Nano1035±63.8K10.6%3.2%113 tps20.9s400K$0.05$0.40
137101DeepSeek V3 (Turbo)1034±111K5.9%1.5%32 tps1.5s64K$0.40$1.30
138119ERNIE 4.5 300B A47B1032±66.1K8.7%4.7%23 tps2.3s123K$0.28$1.10
13995DeepSeek-R1 Turbo1032±101.4K5.5%2.6%29 tps1.8s64K$2.85$4.75
14071DeepSeek V3.11029±101.1K4.5%0.8%197 tps0.4s164K$0.55$1.60
141129Command A1029±511K8.4%2.2%42 tps0.8s256K$2.00$7.33
142126DeepSeek V31028±75.9K5.7%0.9%69 tps1.1s64K$0.59$1.49
14386Amazon Nova 2 Lite1027±102.6K7.9%1.0%137 tps0.6s300K$0.35$2.95
144200Claude Sonnet 3.51023±91.6K8.4%1.0%40 tps2.7s200K$3.00$15.00
145241GPT-5 Mini High1021±72.4K13.6%<0.1%33 tps3.9s400K$0.25$2.00
146113GLM 4.5 AirX1020±108059.0%3.3%75 tps1.2s131K$1.10$4.50
147133GPT-4.1 nano1020±48.8K9.7%0.6%175 tps0.5s1M$0.10$0.40
148124Qwen3 235B A22B Thinking 25071018±111.1K4.2%2.5%53 tps1.6s131K$0.59$5.70
149200NVIDIA Llama 3.1 Nemotron 70B1018±82.4K5.9%<0.1%9 tps0.1s128K$0.33$0.39
150111Solar Pro 3 (Reasoning)1017±156201.6%3.2%118 tps1.2s131K$0.15$0.60
151148OpenAI o31016±111.3K4.6%0.9%85 tps6.8s128K$7.33$29.33
152129DeepSeek V3.1 Thinking1014±73.9K14.0%7.1%18 tps1.8s131K$0.23$0.75
153139Qwen3 VL 30B A3B Instruct1012±171K6.5%1.8%80 tps2.6s129K$0.18$0.67
154153GLM 4.5 FP81012±1746012.4%<0.1%59 tps1.2s131K$0.41$1.65
155143Gemini 2.0 Flash Lite1011±65.7K6.9%<0.1%42 tps0.5s1M$0.08$0.30
156113Kimi K2 Fast1006±426.2K13.8%0.8%365 tps0.5s131K$1.00$3.00
157219NVIDIA Llama 3.3 Nemotron Super 49B v11002±131.2K9.7%<0.1%13 tpsN/A131K$0.07$0.20
158113GLM 4.51002±53.7K14.3%3.7%46 tps1.4s131K$0.43$1.63
159121NVIDIA Llama 3.3 Nemotron Super 49B v1.51000±161K9.9%2.0%50 tps0.6s131K$0.09$0.33
160139OpenAI o4-mini1000±54.8K10.2%1.4%97 tps7.0s128K$1.10$4.40
View All (312 models)