Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1010
Qwen3 Max Thinking
1008
Gemini 3.1 Flash Lite Preview
1008
Gemini 2.0 Flash Lite
1004
Seed 2.0 Mini (Medium)
1003
OpenAI o4-mini
999
Qwen3 VL 235B A22B Thinking
999
INTELLECT-3
998
Qwen3 Next 80B A3B Thinking
998
Jamba 1.7 Large
998
Qwen3 4B
997
Devstral Medium
997
Llama 4 Scout
996
Qwen3 8B
995
Mistral Small 3.2 24B Instruct
994
Mistral Small 3.2 24B

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
121129Qwen3 Max Thinking1010±58.9K1.0%13.5%32 tps2.3s256K$1.20$6.00
12271Gemini 3.1 Flash Lite Preview1008±102.5K2.5%1.0%8 tps1.2s1M$0.25$1.50
123143Gemini 2.0 Flash Lite1008±263.2K3.2%<0.1%42 tps0.5s1M$0.08$0.30
124139Seed 2.0 Mini (Medium)1004±92.2K2.7%11.9%33 tps1.7s256K$0.15$0.60
125139OpenAI o4-mini1003±221.7K4.3%1.4%97 tps7.0s128K$1.10$4.40
126126Qwen3 VL 235B A22B Thinking999±39.9K6.2%4.3%47 tps3.0s127K$0.47$3.31
127194INTELLECT-3999±108952.7%1.5%114 tps0.6s131K$0.20$1.10
128157Qwen3 Next 80B A3B Thinking998±316.7K5.2%0.6%175 tps1.3s256K$0.21$2.26
129186Jamba 1.7 Large998±52.8K4.9%1.3%58 tps1.0s256K$1.33$5.33
130165Qwen3 4B998±312.9K6.5%1.9%94 tps1.5s128K$0.01$0.01
131170Devstral Medium997±311.7K2.9%1.5%77 tps0.6s131K$0.40$2.00
132160Llama 4 Scout997±266.9K2.4%0.6%88 tps5.1s131K$0.18$0.46
133161Qwen3 8B996±49.9K5.5%2.4%61 tps1.4s41K$0.02$0.07
134186Mistral Small 3.2 24B Instruct995±92.1K5.0%1.9%113 tps1.1s131K$0.02$0.08
135170Mistral Small 3.2 24B994±315.2K2.5%2.8%141 tps0.7s33K$0.02$0.08
136148OpenAI o4-mini-high988±233.5K4.5%1.9%117 tps15.9s200K$1.10$4.40
137133Kimi K2 0905988±316.2K3.9%4.0%30 tps1.4s262K$0.63$2.39
138148OpenAI o3987±312K2.6%0.9%85 tps6.8s128K$7.33$29.33
139179Baichuan-M2-32B983±71.9K5.9%<0.1%32 tps3.3s131K$0.07$0.07
140153OpenAI o1982±418.6K2.5%4.2%92 tps5.5s200K$15.00$60.00
141179Amazon Nova Pro 1.0982±224.5K1.6%0.9%96 tps0.7s300K$0.80$1.70
142179Inception Mercury979±228K1.8%0.4%257 tps1.1s32K$0.25$1.00
143186Jamba 1.6 Large977±215.8K1.3%2.0%59 tps1.2s256K$1.33$5.33
144194Llama 3.3 70B976±310.8K4.1%0.3%500 tps0.5s8K$0.48$0.66
145186Gemma 3n E4B976±225.5K1.8%2.0%30 tps0.5s8K$0.01$0.02
146157GPT-5 Nano974±310.1K6.0%3.2%113 tps20.9s400K$0.05$0.40
147179Qwen 2.5 72B972±45.6K2.1%1.2%96 tps1.2s131K$0.14$0.26
148175MiMo V2 Flash971±139004.3%7.2%24 tps1.9s262K$0.07$0.23
149165DeepSeek R1T2 Chimera967±45.9K3.3%3.0%28 tps1.8s164K$0.13$0.45
150175OpenAI o3-mini-low966±230.5K4.6%0.7%139 tps1.5s200K$1.10$4.40
151194Magistral Small 2506966±317.5K1.5%1.6%156 tps0.5s40K$0.37$1.10
152177OpenAI o3-mini962±233.6K4.2%0.8%143 tps3.3s200K$1.10$4.40
153201ERNIE 4.5 VL 424B A47B961±101.5K5.7%4.9%36 tps3.5s123K$0.42$1.25
154201Llama 3 8B960±213.1K1.8%6.0%85 tps0.7s8K$0.12$0.16
155209Seed 1.6 Flash 250715960±53.6K3.1%2.5%108 tps1.6s256K$0.07$0.30
156194GLM 4.5 Flash960±161.4K4.8%12.2%15 tps2.2s131K$0$0
157186Grok 3 Mini Fast958±226.4K4.4%1.6%44 tps0.5s131K$0.60$4.00
158214Qwen 2.5 VL 32B Instruct958±121.6K5.4%6.3%43 tps3.2s128K$0.35$0.62
159170Kimi K2 0711957±223.3K2.3%1.6%29 tps1.3s131K$0.72$2.60
160179Switchpoint Router957±48.5K2.0%1.7%71 tps4.9s131K$0.85$3.40
View All (208 models)