Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1005
GLM 4.5 Flash
1005
Mistral Small 3.1
1004
Qwen 2.5 72B Turbo
1004
Gemini 2.0 Flash
1003
Solar Pro 2 250909
1002
Devstral Small 2507
1001
R1 1776
1000
Amazon Nova Micro 1.0
999
Llama 3 8B Turbo
999
EXAONE Deep 32B
998
DeepSeek R1T2 Chimera
997
Pixtral Large
997
Claude Opus 4
994
Jamba 1.7 Large
994
Gemini 2.0 Flash Lite

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
201194GLM 4.5 Flash1005±111.1K2.3%12.2%15 tps2.2s131K$0$0
202161Mistral Small 3.11005±39.4K1.0%7.4%13 tps2.6s32K$0.17$0.28
203182Qwen 2.5 72B Turbo1004±62.9K1.2%<0.1%84 tps0.8s131K$0.60$0.60
204143Gemini 2.0 Flash1004±323.8K0.9%<0.1%76 tps0.5s1M$0.14$0.56
205193Solar Pro 2 2509091003±147252.7%<0.1%84 tps1.1s66K$0.15$0.15
206170Devstral Small 25071002±89802.0%2.2%186 tps0.5s131K$0.10$0.30
207253R1 17761001±52.2K2.2%<0.1%61 tps1.0s128K$2.00$8.00
208246Amazon Nova Micro 1.01000±236302.3%4.1%193 tps0.6s128K$0.04$0.07
209200Llama 3 8B Turbo999±72.5K1.4%<0.1%97 tps0.1s8K$0.12$0.13
210219EXAONE Deep 32B999±53.4K1.7%<0.1%24 tpsN/A33K$0$0
211165DeepSeek R1T2 Chimera998±55.6K1.7%3.0%28 tps1.8s164K$0.13$0.45
212165Pixtral Large997±57.6K1.8%2.5%57 tps1.3s128K$1.50$4.50
21321Claude Opus 4997±53.4K2.1%<0.1%25 tps1.5s200K$15.00$75.00
214186Jamba 1.7 Large994±92.1K2.5%1.3%58 tps1.0s256K$1.33$5.33
215143Gemini 2.0 Flash Lite994±355.3K2.8%<0.1%42 tps0.5s1M$0.08$0.30
216170Devstral Medium992±410.6K0.9%1.5%77 tps0.6s131K$0.40$2.00
217200K2 Think991±63.6K1.2%<0.1%418 tps2.8sN/A$0$0
218219NVIDIA Llama 3.3 Nemotron Super 49B v1991±214.5K0.5%<0.1%13 tpsN/A131K$0.07$0.20
219160Llama 4 Scout990±255K1.5%0.6%88 tps5.1s131K$0.18$0.46
220200NVIDIA Llama 3.1 Nemotron 70B990±320.1K0.8%<0.1%9 tps0.1s128K$0.33$0.39
221186Gemma 3 27B990±63.1K1.8%1.8%35 tps1.1s66K$0.06$0.10
222170Llama 3.1 8B Turbo989±57.3K1.5%2.1%650 tps0.5s128K$0.13$0.14
223157GPT-5 Nano989±36.6K3.1%3.2%113 tps20.9s400K$0.05$0.40
224161Llama 4 Maverick987±259.4K1.5%1.2%88 tps2.4s1M$0.23$0.83
225170Kimi K2 0711987±319K1.1%1.6%29 tps1.3s131K$0.72$2.60
22648Claude Sonnet 4 (Thinking)986±38.7K1.8%1.5%52 tps1.5s200K$3.00$13.67
227157Cogito v2.1 671B986±43.6K1.2%0.8%85 tps0.5s128K$1.25$1.25
228170Mistral Small 3.2 24B986±313K1.1%2.8%141 tps0.7s33K$0.02$0.08
229219Arcee AI Virtuoso-Large985±211.7K1.1%<0.1%64 tps0.5s131K$0.75$1.20
230213DeepSeek R1T Chimera980±62.7K2.0%<0.1%46 tps1.1s164K$0.09$0.36
231277GLM Z1 32B979±73K1.9%<0.1%18 tps9.3s33K$0.09$0.11
232200Claude Sonnet 3.5976±58.2K1.2%1.0%40 tps2.7s200K$3.00$15.00
233314DeepSeek-R1 0528 Qwen3 8B976±63.5K3.2%<0.1%45 tps2.4s128K$0.05$0.09
234213Claude Haiku 3.5976±315.2K1.5%0.8%40 tps2.8s200K$0.80$4.00
235211Gemini 1.5 Pro976±46.7K1.8%<0.1%15 tps0.0s2M$0.78$3.13
236314MAI-DS-R1974±45.1K2.9%<0.1%73 tps3.2s64K$0.10$0.40
237246DeepSeek-R1 Distill Llama 70B973±112.1K3.0%3.6%27 tps1.6s32K$0.73$0.95
238241OLMo 3 7B Think973±62.6K2.4%4.2%77 tps0.4s66K$0.12$0.20
239179Amazon Nova Pro 1.0971±222.3K0.8%0.9%96 tps0.7s300K$0.80$1.70
240179Qwen 2.5 72B970±45.3K1.0%1.2%96 tps1.2s131K$0.14$0.26
View All (410 models)