Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

936
Jamba 1.7 Mini
935
Dobby Unhinged Llama 3.3 70B
935
R1 1776
935
DeepSeek Prover v2
933
Llama 3.3 70B
933
Llama 3.1 70B Instruct Turbo
931
Codestral
928
NVIDIA Llama 3.1 Nemotron 70B
927
Grok 3 Mini
925
Mistral Small 3.1
924
Jamba 1.6 Large
922
Grok 3 Mini Beta
921
GLM Z1 32B
920
GPT-5 Nano Minimal
919
Solar Pro 2 250710 (Reasoning)

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
241230Jamba 1.7 Mini936±241K8.4%<0.1%84 tps0.9s256K$0.20$0.40
242230Dobby Unhinged Llama 3.3 70B935±198602.8%<0.1%41 tps0.4s128K$0.90$0.90
243230R1 1776935±93.3K4.2%<0.1%61 tps1.0s128K$2.00$8.00
244179DeepSeek Prover v2935±101.3K3.0%5.2%14 tps1.3s164K$0.40$1.56
245189Llama 3.3 70B933±103K6.6%0.3%500 tps0.5s8K$0.48$0.66
246245Llama 3.1 70B Instruct Turbo933±114.1K3.8%<0.1%110 tps0.8s128K$0.88$0.88
247189Codestral931±198556.0%5.2%151 tps0.9s262K$0.15$0.45
248245NVIDIA Llama 3.1 Nemotron 70B928±75.3K2.0%<0.1%9 tps0.1s128K$0.33$0.39
249189Grok 3 Mini927±69.9K6.4%1.2%43 tps0.5s131K$0.30$0.50
250189Mistral Small 3.1925±142.4K4.0%7.4%13 tps2.6s32K$0.17$0.28
251189Jamba 1.6 Large924±93.3K3.8%2.0%59 tps1.2s256K$1.33$5.33
252245Grok 3 Mini Beta922±141.8K1.9%<0.1%75 tps0.5s131K$0.45$2.25
253245GLM Z1 32B921±101.9K10.1%<0.1%18 tps9.3s33K$0.09$0.11
254245GPT-5 Nano Minimal920±131.4K10.8%<0.1%88 tps0.8s400K$0.05$0.40
255245Solar Pro 2 250710 (Reasoning)919±102.6K3.9%<0.1%9 tpsN/A66K$0.50$0.50
256189Llama 3.3 Swallow 70B Instruct919±83.5K5.5%1.4%153 tps1.3s131K$0.13$0.39
257189Open Mistral Nemo918±211.7K4.5%1.5%171 tps0.5s131K$0.15$0.15
258189Jamba 1.7 Large917±169758.0%1.3%58 tps1.0s256K$1.33$5.33
259189Devstral Small916±171.4K5.0%2.4%180 tps0.6s131K$0.10$0.30
260189Rnj-1 Instruct915±219706.7%0.6%103 tps0.3s33K$0.15$0.15
261189Seed 1.6 Flash 250715913±161.2K5.7%2.5%108 tps1.6s256K$0.07$0.30
262245Solar Pro 3 (Reasoning)913±185954.8%3.2%118 tps1.2s131K$0.15$0.60
263189Inception Mercury Coder Small Beta912±206103.2%1.7%270 tps1.4s32K$0.25$1.00
264264Arcee AI Spotlight910±84.6K4.3%<0.1%121 tps0.4s131K$0.18$0.18
265264YouTube910±132K5.5%<0.1%34 tps2.7s32K$0.99$0.99
266264OLMo 3 32B Think910±254706.0%<0.1%84 tps0.6s66K$0.15$0.50
267201Magistral Small 2506908±114.3K3.1%1.6%156 tps0.5s40K$0.37$1.10
268201GPT-3.5 Turbo908±151.2K2.5%1.3%74 tps0.9s16K$0.75$1.75
269264Fauna Fox908±103.4K8.2%<0.1%194 tps0.3s128K$0.04$0.15
270201Llama 3 8B904±103.2K3.5%6.0%85 tps0.7s8K$0.12$0.16
271201Mistral Small 3.2 24B Instruct903±208508.6%1.9%113 tps1.1s131K$0.02$0.08
272264DeepSeek R1T Chimera901±92.9K6.7%<0.1%46 tps1.1s164K$0.09$0.36
273264Grok 2901±72.3K2.7%<0.1%55 tps1.1s131K$2.00$10.00
274201GPT-4o mini900±142.8K5.3%2.1%71 tps1.7s128K$0.15$0.60
275201Moonshot V1 Auto899±229304.1%1.2%54 tps1.5s8K$2.00$5.00
276264Llama 3.1 405B Instruct Turbo896±112K3.9%<0.1%26 tps0.8s131K$3.50$3.50
277264Arcee AI Virtuoso-Medium896±122K2.6%<0.1%3 tpsN/A131K$0.50$0.80
278201Amazon Nova Pro 1.0894±105.7K4.0%0.9%96 tps0.7s300K$0.80$1.70
279264Exaone 3.5 32B Instruct893±216503.0%<0.1%17 tpsN/A33K$0$0
280201GLM 4.6V Flash892±102.5K7.6%3.7%64 tps2.1s128K$0.04$0.40
View All (404 models)