Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

739
Grok 3 Mini
745
Llama 3.3 70B
781
Gemma 3n E4B
790
Magistral Small 2509
797
Magistral Medium 2509
804
Qwen 2.5 VL 72B Instruct
818
Qwen3 30B A3B Thinking 2507
826
GPT-4o mini
827
Qwen3 8B
830
DeepSeek V3.2 Speciale
832
Grok 3 Mini Fast
835
GPT-5 Mini Minimal
838
OpenAI o3-mini-low
843
GPT-5 Nano
846
Qwen3 Next 80B A3B Thinking

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
1186Grok 3 Mini739±201.4K2.5%1.2%43 tps0.5s131K$0.30$0.50
2194Llama 3.3 70B745±305254.5%0.3%500 tps0.5s8K$0.48$0.66
3186Gemma 3n E4B781±275354.5%2.0%30 tps0.5s8K$0.01$0.02
4265Magistral Small 2509790±295306.2%2.7%116 tps0.6s131K$0.50$1.50
5229Magistral Medium 2509797±175705.0%4.0%58 tps0.9s131K$2.00$5.00
6265Qwen 2.5 VL 72B Instruct804±297156.5%5.3%25 tps3.7s128K$1.01$2.79
7148Qwen3 30B A3B Thinking 2507818±187953.0%0.5%124 tps1.2s131K$0.16$1.70
8201GPT-4o mini826±186456.5%2.1%71 tps1.7s128K$0.15$0.60
9161Qwen3 8B827±366004.0%2.4%61 tps1.4s41K$0.02$0.07
10133DeepSeek V3.2 Speciale830±285403.6%6.0%43 tps1.4s131K$0.84$1.52
11186Grok 3 Mini Fast832±231K3.3%1.6%44 tps0.5s131K$0.60$4.00
1284GPT-5 Mini Minimal835±131.1K6.6%1.2%63 tps1.4s400K$0.25$2.00
13175OpenAI o3-mini-low838±211.7K2.6%0.7%139 tps1.5s200K$1.10$4.40
14157GPT-5 Nano843±142K6.0%3.2%113 tps20.9s400K$0.05$0.40
15157Qwen3 Next 80B A3B Thinking846±151.3K3.9%0.6%175 tps1.3s256K$0.21$2.26
16170Kimi K2 0711858±248904.3%1.6%29 tps1.3s131K$0.72$2.60
17139GLM 4.6V865±248902.7%6.4%21 tps1.8s128K$0.38$0.90
18129Qwen3 Max Thinking866±141.5K1.7%13.5%32 tps2.3s256K$1.20$6.00
19214OpenAI o3-mini-high868±131.4K3.8%2.4%231 tps10.5s200K$1.10$4.40
20179GLM 4.7 Flash874±248552.8%5.8%61 tps2.8s128K$0.07$0.39
21160Llama 4 Scout875±152.3K2.9%0.6%88 tps5.1s131K$0.18$0.46
22165Qwen3 4B878±237353.9%1.9%94 tps1.5s128K$0.01$0.01
23139Seed 2.0 Mini (Medium)900±305153.7%11.9%33 tps1.7s256K$0.15$0.60
24177OpenAI o3-mini901±122.5K3.1%0.8%143 tps3.3s200K$1.10$4.40
2562MiniMax M2905±181.4K3.5%2.2%39 tps2.3s205K$0.21$0.85
26143Gemini 2.0 Flash Lite917±112.5K6.7%<0.1%42 tps0.5s1M$0.08$0.30
27119ERNIE 4.5 300B A47B918±171.6K2.7%4.7%23 tps2.3s123K$0.28$1.10
28126Qwen3 VL 235B A22B Thinking922±187454.5%4.3%47 tps3.0s127K$0.47$3.31
29133Kimi K2 0905922±218054.2%4.0%30 tps1.4s262K$0.63$2.39
30124Kimi K2 0905 Turbo925±131.5K4.7%0.7%373 tps0.5s262K$1.70$6.50
3195Kimi K2 Thinking926±177402.0%4.2%61 tps5.9s262K$0.24$1.03
32143Seed 1.6 250615928±216355.2%3.1%46 tps2.2s256K$0.25$2.00
33101Qwen3.5 35B A3B949±275302.8%2.1%116 tps2.1s256K$0.63$1.13
3481OpenAI o3-pro951±191.6K3.4%5.2%22 tps70.8s200K$20.00$80.00
3579Qwen3 Max Thinking Preview952±201.1K2.2%3.1%40 tps2.1s256K$1.20$6.00
36129DeepSeek V3.1 Thinking955±141.1K2.2%7.1%18 tps1.8s131K$0.23$0.75
37139OpenAI o4-mini956±161.4K2.8%1.4%97 tps7.0s128K$1.10$4.40
38148OpenAI o4-mini-high958±112.2K3.1%1.9%117 tps15.9s200K$1.10$4.40
39153OpenAI o1960±112.3K2.4%4.2%92 tps5.5s200K$15.00$60.00
40111LongCat Flash Chat963±255604.3%0.8%85 tps0.9s131K$0.14$0.68
View All (121 models)