Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

793
Mistral Small 3.2 24B
787
Qwen3 30B A3B Thinking 2507
782
Llama 4 Scout
778
OpenAI o3-mini-high
777
OpenAI o3-mini
765
Magistral Medium 2509
752
OpenAI o3-mini-low
724
Grok 3 Mini Fast
719
Llama 3.3 70B
674
Pixtral 12B
635
Qwen 2.5 VL 72B Instruct
613
Inception Mercury
373
Qwen 2.5 VL 3B Instruct

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
121170Mistral Small 3.2 24B793±284755.9%2.8%141 tps0.7s33K$0.02$0.08
122148Qwen3 30B A3B Thinking 2507787±146554.4%0.5%124 tps1.2s131K$0.16$1.70
123160Llama 4 Scout782±151.6K4.1%0.6%88 tps5.1s131K$0.18$0.46
124214OpenAI o3-mini-high778±176303.1%2.4%231 tps10.5s200K$1.10$4.40
125177OpenAI o3-mini777±122K3.6%0.8%143 tps3.3s200K$1.10$4.40
126229Magistral Medium 2509765±186103.9%4.0%58 tps0.9s131K$2.00$5.00
127175OpenAI o3-mini-low752±171.4K4.3%0.7%139 tps1.5s200K$1.10$4.40
128186Grok 3 Mini Fast724±151.3K4.3%1.6%44 tps0.5s131K$0.60$4.00
129194Llama 3.3 70B719±185503.5%0.3%500 tps0.5s8K$0.48$0.66
130274Pixtral 12B674±337205.9%2.2%101 tps1.2s131K$0.08$0.08
131265Qwen 2.5 VL 72B Instruct635±345055.6%5.3%25 tps3.7s128K$1.01$2.79
132179Inception Mercury613±255006.5%0.4%257 tps1.1s32K$0.25$1.00
133288Qwen 2.5 VL 3B Instruct373±469557.7%3.0%44 tps2.5s128K$0.21$0.63
View All (133 models)