Models
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

876
DeepSeek R1T2 Chimera
875
Mistral Medium 3
874
Solar Mini 250422
871
GLM 4.7 Flash
871
Mixtral 8x22B
871
Devstral Small 2505
869
AFM 4.5B
869
Claude Sonnet 3
868
Krutrim Spectre V2
868
GLM 4 32B
867
Gemma 3 12B
863
Moonshot V1 8k
861
Qwen 2.5 14B Instruct
856
DeepSeek-R1 0528 Qwen3 8B
854
Ministral 3B 2512

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
241210DeepSeek R1T2 Chimera876±102.1K5.9%3.0%28 tps1.8s164K$0.13$0.45
242210Mistral Medium 3875±234856.7%2.4%47 tps0.8s33K$0.40$2.00
243210Solar Mini 250422874±171.3K5.9%1.8%90 tps1.7s33K$0.15$0.15
244210GLM 4.7 Flash871±286104.7%5.8%61 tps2.8s128K$0.07$0.39
245210Mixtral 8x22B871±221.3K5.0%1.2%140 tps0.6s64K$2.00$6.00
246293Devstral Small 2505871±151.7K6.2%<0.1%141 tps1.3s33K$0.03$0.09
247293AFM 4.5B869±74.4K8.9%<0.1%81 tps0.3s66K$0.05$0.20
248293Claude Sonnet 3869±179001.6%<0.1%35 tps1.0s200K$3.00$15.00
249210Krutrim Spectre V2868±161.3K3.6%<0.1%33 tps3.1s4K$0.19$0.19
250210GLM 4 32B868±122.9K4.9%2.6%40 tps1.6s33K$0.14$0.14
251210Gemma 3 12B867±112.5K4.9%4.2%73 tps0.8s131K$0.05$0.12
252210Moonshot V1 8k863±139155.2%1.0%55 tps1.5s8K$0.20$2.00
253210Qwen 2.5 14B Instruct861±162.4K5.7%2.4%40 tps1.6s1M$0.40$1.61
254312DeepSeek-R1 0528 Qwen3 8B856±84.9K6.5%<0.1%45 tps2.4s128K$0.05$0.09
255210Ministral 3B 2512854±575158.0%2.8%339 tps0.6s131K$0.10$0.10
256234Jamba 1.5 Large851±92.9K4.0%1.7%48 tps0.9s256K$1.50$6.00
257312Wikipedia846±79.8K4.9%<0.1%47 tps2.1s32K$0$0
258324MAI-DS-R1842±73.5K11.7%<0.1%73 tps3.2s64K$0.10$0.40
259234GPT-3.5 Turbo 16k838±102.7K3.6%<0.1%22 tps0.6s16K$3.00$4.00
260324Cogito V2 671B838±171.6K5.9%<0.1%41 tps0.6s164K$1.25$1.25
261234ERNIE 4.5 21B A3B Thinking838±231.1K6.9%1.8%87 tps1.5s120K$0.07$0.28
262240GLM 4.5 Flash834±375208.8%12.2%15 tps2.2s131K$0$0
263324NVIDIA Llama 3.1 Nemotron Ultra 253B v1832±162.2K4.1%<0.1%40 tps0.8s128K$0.30$0.90
264240LFM2 2.6B826±2681010.0%6.7%184 tps0.4s33K$0.01$0.02
265240Krutrim 2825±102.3K2.3%12.5%33 tps2.1s128K$1.00$1.00
266240Moonshot V1 32k820±179503.1%1.4%53 tps1.4s33K$1.00$3.00
267240LFM2 8B A1B818±1882511.3%<0.1%142 tps0.3s33K$0.01$0.02
268337GLM 4.1V 9B Thinking813±161.1K4.2%<0.1%69 tps1.3s66K$0.04$0.14
269337Qwen 2 72B Instruct805±191K3.3%<0.1%3 tpsN/A33K$0.90$0.90
270346Magistral Medium (Thinking)804±102.2K5.7%<0.1%67 tps0.8s41K$2.00$5.00
271252Magistral Small 2509802±181.8K7.5%2.7%116 tps0.6s131K$0.50$1.50
272252WizardLM-2 8x22B801±121.9K3.1%11.6%11 tps2.5s66K$0.77$0.77
273252Hermes 4 405B FP8797±218158.4%3.5%31 tps0.9s131K$0.52$1.73
274346Magistral Medium 2507795±2566514.2%<0.1%86 tps0.7s41K$2.00$5.00
275252Mercury Coder793±275103.8%<0.1%247 tps2.2s32K$0.25$1.00
276252GPT-3.5 Turbo Instruct787±92K2.7%<0.1%46 tps1.2s4K$1.50$2.00
277252Mistral Large785±161.1K5.8%1.5%54 tps0.7s33K$2.00$6.00
278252Hermes 4 70B781±294608.9%1.1%67 tps0.6s131K$0.12$0.39
279262Baichuan-M2-32B770±3074010.8%<0.1%32 tps3.3s131K$0.07$0.07
280354OLMo 3 7B Think763±217707.8%4.2%77 tps0.4s66K$0.12$0.20
View All (305 models)