Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

980
Llama 4 Maverick
978
Gemini 2.5 Flash Lite Thinking Preview 0925
977
Fauna Fox
972
Kimi K2 0905 Turbo
972
MiniMax M2.5 Lightning
970
Qwen3 4B
966
OpenAI o4-mini-high
966
Amazon Nova 2 Lite
963
OpenAI o3-mini-low
956
DeepSeek V3.1 Thinking
955
Solar Pro 2 250710 (Reasoning)
954
Llama 3.1 8B Turbo
953
Mistral Small 3.2 24B
949
Mistral Small 3.1 24B Instruct
949
Grok 2

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
161161Llama 4 Maverick980±67.3K1.8%1.2%88 tps2.4s1M$0.23$0.83
16295Gemini 2.5 Flash Lite Thinking Preview 0925978±72.2K2.5%1.5%152 tps3.0s1M$0.10$0.40
163182Fauna Fox977±62K1.7%<0.1%194 tps0.3s128K$0.04$0.15
164124Kimi K2 0905 Turbo972±73.2K3.9%0.7%373 tps0.5s262K$1.70$6.50
16579MiniMax M2.5 Lightning972±179351.1%1.5%51 tps2.0s205K$0.60$2.40
166165Qwen3 4B970±83.1K2.7%1.9%94 tps1.5s128K$0.01$0.01
167148OpenAI o4-mini-high966±49.3K1.8%1.9%117 tps15.9s200K$1.10$4.40
16886Amazon Nova 2 Lite966±91.6K3.0%1.0%137 tps0.6s300K$0.35$2.95
169175OpenAI o3-mini-low963±88.8K1.9%0.7%139 tps1.5s200K$1.10$4.40
170129DeepSeek V3.1 Thinking956±102.2K2.4%7.1%18 tps1.8s131K$0.23$0.75
171270Solar Pro 2 250710 (Reasoning)955±72.3K2.3%<0.1%9 tpsN/A66K$0.50$0.50
172170Llama 3.1 8B Turbo954±206852.1%2.1%650 tps0.5s128K$0.13$0.14
173170Mistral Small 3.2 24B953±91.4K1.8%2.8%141 tps0.7s33K$0.02$0.08
174177Mistral Small 3.1 24B Instruct949±167452.0%7.5%15 tps2.4s131K$0.06$0.18
175277Grok 2949±166500.8%<0.1%55 tps1.1s131K$2.00$10.00
176165DeepSeek R1T2 Chimera947±206202.4%3.0%28 tps1.8s164K$0.13$0.45
177160Llama 4 Scout941±76.9K1.5%0.6%88 tps5.1s131K$0.18$0.46
178179GLM 4.7 Flash939±111.1K1.3%5.8%61 tps2.8s128K$0.07$0.39
179186Grok 3 Mini Fast939±113.9K2.3%1.6%44 tps0.5s131K$0.60$4.00
180253R1 1776938±103.1K0.9%<0.1%61 tps1.0s128K$2.00$8.00
181157Cogito v2.1 671B937±158851.7%0.8%85 tps0.5s128K$1.25$1.25
182177Llama 3 70B Turbo935±141K1.4%<0.1%31 tps0.0s8K$0.73$0.83
183148OpenAI o3935±54.3K1.7%0.9%85 tps6.8s128K$7.33$29.33
184161DeepSeek Prover v2933±178252.4%5.2%14 tps1.3s164K$0.40$1.56
185214Llama 3.3 70B Instruct Turbo931±275053.8%2.0%78 tps1.0s131K$0.88$0.88
186201Gemma 3 27B IT930±156552.2%2.0%60 tps0.8s128K$0.17$0.29
187157GPT-5 Nano928±101.8K3.0%3.2%113 tps20.9s400K$0.05$0.40
188170Kimi K2 0711928±93.2K2.2%1.6%29 tps1.3s131K$0.72$2.60
189253Magistral Medium927±165305.4%<0.1%95 tps0.5s41K$2.00$5.00
190139OpenAI o4-mini925±84K2.3%1.4%97 tps7.0s128K$1.10$4.40
191209Seed 1.6 Flash 250715922±165802.5%2.5%108 tps1.6s256K$0.07$0.30
192214OpenAI o3-mini-high922±67.5K2.0%2.4%231 tps10.5s200K$1.10$4.40
193177OpenAI o3-mini921±69K1.9%0.8%143 tps3.3s200K$1.10$4.40
194214Gemma 3 12B921±186353.1%4.2%73 tps0.8s131K$0.05$0.12
195219NVIDIA Llama 3.3 Nemotron Super 49B v1918±159201.1%<0.1%13 tpsN/A131K$0.07$0.20
196186Jamba 1.6 Large915±186601.5%2.0%59 tps1.2s256K$1.33$5.33
197200Claude Sonnet 3.5914±148302.4%1.0%40 tps2.7s200K$3.00$15.00
198113GLM 4.5 AirX913±235401.8%3.3%75 tps1.2s131K$1.10$4.50
199314DeepSeek-R1 0528 Qwen3 8B912±53.4K2.3%<0.1%45 tps2.4s128K$0.05$0.09
200211Gemini 1.5 Pro912±155802.5%<0.1%15 tps0.0s2M$0.78$3.13
View All (260 models)