Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

995
Mistral Small 3.2 24B Instruct
996
Qwen3 8B
997
Llama 4 Scout
997
Devstral Medium
998
Qwen3 4B
998
Jamba 1.7 Large
998
Qwen3 Next 80B A3B Thinking
999
INTELLECT-3
999
Llama 4 Maverick
999
Qwen3 VL 235B A22B Thinking
1001
DeepSeek-R1
1003
OpenAI o4-mini
1004
Seed 2.0 Mini (Medium)
1006
Mistral Small 3.1
1008
Gemini 2.0 Flash Lite

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
121186Mistral Small 3.2 24B Instruct995±92.1K5.0%1.9%113 tps1.1s131K$0.02$0.08
122161Qwen3 8B996±49.9K5.5%2.4%61 tps1.4s41K$0.02$0.07
123160Llama 4 Scout997±266.9K2.4%0.6%88 tps5.1s131K$0.18$0.46
124170Devstral Medium997±311.7K2.9%1.5%77 tps0.6s131K$0.40$2.00
125165Qwen3 4B998±312.9K6.5%1.9%94 tps1.5s128K$0.01$0.01
126186Jamba 1.7 Large998±52.8K4.9%1.3%58 tps1.0s256K$1.33$5.33
127157Qwen3 Next 80B A3B Thinking998±316.7K5.2%0.6%175 tps1.3s256K$0.21$2.26
128194INTELLECT-3999±108952.7%1.5%114 tps0.6s131K$0.20$1.10
129161Llama 4 Maverick999±173.4K2.5%1.2%88 tps2.4s1M$0.23$0.83
130126Qwen3 VL 235B A22B Thinking999±39.9K6.2%4.3%47 tps3.0s127K$0.47$3.31
131148DeepSeek-R11001±313.5K2.4%0.8%133 tps0.6s64K$0.91$3.07
132139OpenAI o4-mini1003±221.7K4.3%1.4%97 tps7.0s128K$1.10$4.40
133139Seed 2.0 Mini (Medium)1004±92.2K2.7%11.9%33 tps1.7s256K$0.15$0.60
134161Mistral Small 3.11006±39.7K2.0%7.4%13 tps2.6s32K$0.17$0.28
135143Gemini 2.0 Flash Lite1008±263.2K3.2%<0.1%42 tps0.5s1M$0.08$0.30
13671Gemini 3.1 Flash Lite Preview1008±102.5K2.5%1.0%8 tps1.2s1M$0.25$1.50
137129Qwen3 Max Thinking1010±58.9K1.0%13.5%32 tps2.3s256K$1.20$6.00
138153Qwen 2.5 32B Instruct1011±316.8K3.1%2.5%48 tps1.0s131K$0.21$0.25
139124Kimi K2 0905 Turbo1012±223.1K5.9%0.7%373 tps0.5s262K$1.70$6.50
140246Amazon Nova Micro 1.01015±161.3K1.6%4.1%193 tps0.6s128K$0.04$0.07
141157Cogito v2.1 671B1017±54.6K1.9%0.8%85 tps0.5s128K$1.25$1.25
142165Qwen3 VL 30B A3B Thinking1020±43.5K6.5%4.5%84 tps2.9s127K$0.20$1.47
143133DeepSeek-R1 05281020±312.3K2.1%1.3%93 tps0.5s64K$1.60$3.67
144148OpenAI o1-pro1021±99855.3%5.2%33 tps72.8s200K$150.00$600.00
145170Devstral Small 25071021±71.4K3.9%2.2%186 tps0.5s131K$0.10$0.30
146129DeepSeek V3.1 Thinking1022±312.7K6.2%7.1%18 tps1.8s131K$0.23$0.75
147139Qwen3 VL 30B A3B Instruct1022±72.1K4.8%1.8%80 tps2.6s129K$0.18$0.67
148165ERNIE 4.5 21B A3B1023±61.7K3.4%2.3%78 tps1.5s120K$0.05$0.19
149143Gemini 2.0 Flash1024±229.2K2.1%<0.1%76 tps0.5s1M$0.14$0.56
150139GLM 4.6V1024±210K2.3%6.4%21 tps1.8s128K$0.38$0.90
151133DeepSeek V3.2 Speciale1026±37.5K2.8%6.0%43 tps1.4s131K$0.84$1.52
152170Llama 3.1 8B Turbo1027±38.2K1.4%2.1%650 tps0.5s128K$0.13$0.14
153148Qwen3 30B A3B Thinking 25071027±37.7K2.3%0.5%124 tps1.2s131K$0.16$1.70
154113Gemini 2.5 Flash Lite Thinking1033±319K4.9%1.0%118 tps4.4s1M$0.03$0.13
155143Mistral Medium 31034±101.5K2.9%2.4%47 tps0.8s33K$0.40$2.00
156126DeepSeek V31035±257.9K1.7%0.9%69 tps1.1s64K$0.59$1.49
157113Kimi K2 Fast1037±2107.2K4.5%0.8%365 tps0.5s131K$1.00$3.00
158143Seed 1.6 2506151037±55K2.0%3.1%46 tps2.2s256K$0.25$2.00
159133Qwen3 14B1037±212.5K5.7%1.7%109 tps0.8s41K$0.04$0.15
160129Seed 2.0 Mini (Low)1038±137803.1%10.7%33 tps1.8s256K$0.20$0.80
View All (288 models)