Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

954
Devstral Small
954
Rnj-1 Instruct
956
GLM 4.7 Flash
956
Gemma 3 12B
957
Switchpoint Router
957
Kimi K2 0711
958
Mistral Small 24B Instruct
958
Qwen 2.5 VL 32B Instruct
958
Grok 3 Mini Fast
960
GLM 4.5 Flash
960
Seed 1.6 Flash 250715
960
Llama 3 8B
961
DeepSeek Prover v2
961
ERNIE 4.5 VL 424B A47B
962
OpenAI o3-mini

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
81201Devstral Small954±65.4K2.4%2.4%180 tps0.6s131K$0.10$0.30
82222Rnj-1 Instruct954±63K3.7%0.6%103 tps0.3s33K$0.15$0.15
83179GLM 4.7 Flash956±84.8K1.8%5.8%61 tps2.8s128K$0.07$0.39
84214Gemma 3 12B956±39.8K1.9%4.2%73 tps0.8s131K$0.05$0.12
85179Switchpoint Router957±48.5K2.0%1.7%71 tps4.9s131K$0.85$3.40
86170Kimi K2 0711957±223.3K2.3%1.6%29 tps1.3s131K$0.72$2.60
87201Mistral Small 24B Instruct958±46.8K2.1%1.5%84 tps0.4s33K$0.80$0.80
88214Qwen 2.5 VL 32B Instruct958±121.6K5.4%6.3%43 tps3.2s128K$0.35$0.62
89186Grok 3 Mini Fast958±226.4K4.4%1.6%44 tps0.5s131K$0.60$4.00
90194GLM 4.5 Flash960±161.4K4.8%12.2%15 tps2.2s131K$0$0
91209Seed 1.6 Flash 250715960±53.6K3.1%2.5%108 tps1.6s256K$0.07$0.30
92201Llama 3 8B960±213.1K1.8%6.0%85 tps0.7s8K$0.12$0.16
93161DeepSeek Prover v2961±63.3K1.8%5.2%14 tps1.3s164K$0.40$1.56
94201ERNIE 4.5 VL 424B A47B961±101.5K5.7%4.9%36 tps3.5s123K$0.42$1.25
95177OpenAI o3-mini962±233.6K4.2%0.8%143 tps3.3s200K$1.10$4.40
96194Magistral Small 2506966±317.5K1.5%1.6%156 tps0.5s40K$0.37$1.10
97175OpenAI o3-mini-low966±230.5K4.6%0.7%139 tps1.5s200K$1.10$4.40
98165DeepSeek R1T2 Chimera967±45.9K3.3%3.0%28 tps1.8s164K$0.13$0.45
99194Llama 3.2 11B Instruct967±29.6K1.9%1.5%152 tps0.5s8K$0.16$0.16
100175MiMo V2 Flash971±139004.3%7.2%24 tps1.9s262K$0.07$0.23
101179Qwen 2.5 72B972±45.6K2.1%1.2%96 tps1.2s131K$0.14$0.26
102194Mistral Small 3 24B Instruct972±47.7K1.5%2.6%77 tps0.6s33K$0.07$0.14
103157GPT-5 Nano974±310.1K6.0%3.2%113 tps20.9s400K$0.05$0.40
104186Gemma 3n E4B976±225.5K1.8%2.0%30 tps0.5s8K$0.01$0.02
105179Llama 3.1 70B Instruct976±149252.6%6.3%30 tps0.8s128K$0.17$0.22
106194Llama 3.3 70B976±310.8K4.1%0.3%500 tps0.5s8K$0.48$0.66
107186Jamba 1.6 Large977±215.8K1.3%2.0%59 tps1.2s256K$1.33$5.33
108179Inception Mercury979±228K1.8%0.4%257 tps1.1s32K$0.25$1.00
109179Amazon Nova Pro 1.0982±224.5K1.6%0.9%96 tps0.7s300K$0.80$1.70
110177Mistral Small 3.1 24B Instruct982±211.2K1.8%7.5%15 tps2.4s131K$0.06$0.18
111153OpenAI o1982±418.6K2.5%4.2%92 tps5.5s200K$15.00$60.00
112186Gemma 3 27B983±63.5K3.7%1.8%35 tps1.1s66K$0.06$0.10
113179Baichuan-M2-32B983±71.9K5.9%<0.1%32 tps3.3s131K$0.07$0.07
114148OpenAI o3987±312K2.6%0.9%85 tps6.8s128K$7.33$29.33
115133Kimi K2 0905988±316.2K3.9%4.0%30 tps1.4s262K$0.63$2.39
116148OpenAI o4-mini-high988±233.5K4.5%1.9%117 tps15.9s200K$1.10$4.40
117201Qwen 2.5 7B Turbo992±72.6K2.8%0.5%125 tps0.4s131K$0.30$0.30
118194Llama 3 70B993±91.9K1.3%4.5%21 tps1.7s8K$1.08$1.38
119165Pixtral Large994±49.9K2.6%2.5%57 tps1.3s128K$1.50$4.50
120170Mistral Small 3.2 24B994±315.2K2.5%2.8%141 tps0.7s33K$0.02$0.08
View All (288 models)