Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1006
Mistral Small 3.1
1004
Seed 2.0 Mini (Medium)
1004
Gemini 2.5 Flash Preview Thinking
1003
OpenAI o4-mini
1002
OLMo 3 7B Think
1001
DeepSeek-R1
1000
Solar Pro 2 250909
999
Qwen3 VL 235B A22B Thinking
999
Llama 4 Maverick
999
INTELLECT-3
998
Arcee AI Virtuoso-Large
998
Qwen3 Next 80B A3B Thinking
998
Jamba 1.7 Large
998
Qwen3 4B
997
Devstral Medium

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
201161Mistral Small 3.11006±39.7K2.0%7.4%13 tps2.6s32K$0.17$0.28
202139Seed 2.0 Mini (Medium)1004±92.2K2.7%11.9%33 tps1.7s256K$0.15$0.60
203182Gemini 2.5 Flash Preview Thinking1004±33.1K1.9%<0.1%26 tps1.8s1M$0.15$1.76
204139OpenAI o4-mini1003±221.7K4.3%1.4%97 tps7.0s128K$1.10$4.40
205241OLMo 3 7B Think1002±53.4K2.4%4.2%77 tps0.4s66K$0.12$0.20
206148DeepSeek-R11001±313.5K2.4%0.8%133 tps0.6s64K$0.91$3.07
207193Solar Pro 2 2509091000±119104.7%<0.1%84 tps1.1s66K$0.15$0.15
208126Qwen3 VL 235B A22B Thinking999±39.9K6.2%4.3%47 tps3.0s127K$0.47$3.31
209161Llama 4 Maverick999±173.4K2.5%1.2%88 tps2.4s1M$0.23$0.83
210194INTELLECT-3999±108952.7%1.5%114 tps0.6s131K$0.20$1.10
211219Arcee AI Virtuoso-Large998±212.6K2.6%<0.1%64 tps0.5s131K$0.75$1.20
212157Qwen3 Next 80B A3B Thinking998±316.7K5.2%0.6%175 tps1.3s256K$0.21$2.26
213186Jamba 1.7 Large998±52.8K4.9%1.3%58 tps1.0s256K$1.33$5.33
214165Qwen3 4B998±312.9K6.5%1.9%94 tps1.5s128K$0.01$0.01
215170Devstral Medium997±311.7K2.9%1.5%77 tps0.6s131K$0.40$2.00
216160Llama 4 Scout997±266.9K2.4%0.6%88 tps5.1s131K$0.18$0.46
217161Qwen3 8B996±49.9K5.5%2.4%61 tps1.4s41K$0.02$0.07
218186Mistral Small 3.2 24B Instruct995±92.1K5.0%1.9%113 tps1.1s131K$0.02$0.08
219233Llama 3.1 70B Instruct Turbo995±217.5K1.8%<0.1%110 tps0.8s128K$0.88$0.88
220170Mistral Small 3.2 24B994±315.2K2.5%2.8%141 tps0.7s33K$0.02$0.08
221165Pixtral Large994±49.9K2.6%2.5%57 tps1.3s128K$1.50$4.50
222238Tesslate UIGEN-X-32B-0727994±134856.7%<0.1%27 tps0.8s41K$0.02$0.08
223194Llama 3 70B993±91.9K1.3%4.5%21 tps1.7s8K$1.08$1.38
224201Qwen 2.5 7B Turbo992±72.6K2.8%0.5%125 tps0.4s131K$0.30$0.30
225213Claude Haiku 3.5991±219.2K3.0%0.8%40 tps2.8s200K$0.80$4.00
226292AFM 4.5B990±312.8K6.6%<0.1%81 tps0.3s66K$0.05$0.20
227241Arcee AI Blitz989±215.7K0.8%<0.1%6 tpsN/A33K$0.45$0.75
228148OpenAI o4-mini-high988±233.5K4.5%1.9%117 tps15.9s200K$1.10$4.40
229133Kimi K2 0905988±316.2K3.9%4.0%30 tps1.4s262K$0.63$2.39
230211Gemini 1.5 Pro988±39.7K2.2%<0.1%15 tps0.0s2M$0.78$3.13
231219Grok 3 Mini Beta988±38.8K0.8%<0.1%75 tps0.5s131K$0.45$2.25
232182GLM 4.5 Turbo987±91.2K6.0%<0.1%46 tps1.6s131K$1.00$3.00
233213DeepSeek R1T Chimera987±45.4K4.0%<0.1%46 tps1.1s164K$0.09$0.36
234148OpenAI o3987±312K2.6%0.9%85 tps6.8s128K$7.33$29.33
235159GLM 4.5 X986±157455.1%<0.1%48 tps2.8s131K$2.20$8.90
236200Claude Sonnet 3.5986±210.3K2.7%1.0%40 tps2.7s200K$3.00$15.00
237179Baichuan-M2-32B983±71.9K5.9%<0.1%32 tps3.3s131K$0.07$0.07
238186Gemma 3 27B983±63.5K3.7%1.8%35 tps1.1s66K$0.06$0.10
239153OpenAI o1982±418.6K2.5%4.2%92 tps5.5s200K$15.00$60.00
240331Marin 8B Instruct982±71K1.0%<0.1%170 tps0.2s131K$0.18$0.18
View All (432 models)