Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

570
MiniMax M1
638
DeepSeek-R1 Distill Qwen 32B
699
GPT-3.5 Turbo 16k
751
GLM 4 32B
803
GPT-4o mini
807
Amazon Nova Pro 1.0
819
Jamba 1.5 Large
826
Llama 3 8B
829
Inception Mercury
830
Magistral Small 2509
830
Qwen 2.5 14B Instruct
847
Magistral Small 2506
849
Magistral Medium 2509
854
Llama 3.3 Swallow 70B Instruct
866
ERNIE 4.5 21B A3B Thinking

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
1284MiniMax M1570±103K1.3%<0.1%31 tps2.8s1M$0.55$2.20
2274DeepSeek-R1 Distill Qwen 32B638±91.8K2.7%6.2%22 tps1.8s131K$0.37$0.39
3225GPT-3.5 Turbo 16k699±176900.7%<0.1%22 tps0.6s16K$3.00$4.00
4235GLM 4 32B751±197402.0%2.6%40 tps1.6s33K$0.14$0.14
5201GPT-4o mini803±245454.4%2.1%71 tps1.7s128K$0.15$0.60
6179Amazon Nova Pro 1.0807±191.4K1.7%0.9%96 tps0.7s300K$0.80$1.70
7222Jamba 1.5 Large819±156901.4%1.7%48 tps0.9s256K$1.50$6.00
8201Llama 3 8B826±177201.4%6.0%85 tps0.7s8K$0.12$0.16
9179Inception Mercury829±102K1.5%0.4%257 tps1.1s32K$0.25$1.00
10265Magistral Small 2509830±238255.7%2.7%116 tps0.6s131K$0.50$1.50
11209Qwen 2.5 14B Instruct830±245701.7%2.4%40 tps1.6s1M$0.40$1.61
12194Magistral Small 2506847±141.3K1.9%1.6%156 tps0.5s40K$0.37$1.10
13229Magistral Medium 2509849±169903.9%4.0%58 tps0.9s131K$2.00$5.00
14209Llama 3.3 Swallow 70B Instruct854±188901.1%1.4%153 tps1.3s131K$0.13$0.39
15229ERNIE 4.5 21B A3B Thinking866±166852.8%1.8%87 tps1.5s120K$0.07$0.28
16179Switchpoint Router876±166751.5%1.7%71 tps4.9s131K$0.85$3.40
17186Gemma 3n E4B892±102K2.6%2.0%30 tps0.5s8K$0.01$0.02
18170Devstral Medium899±109951.5%1.5%77 tps0.6s131K$0.40$2.00
19194Llama 3.3 70B900±121.1K3.0%0.3%500 tps0.5s8K$0.48$0.66
20186Grok 3 Mini908±93.8K2.3%1.2%43 tps0.5s131K$0.30$0.50
21113GLM 4.5 AirX913±235401.8%3.3%75 tps1.2s131K$1.10$4.50
22186Jamba 1.6 Large915±186601.5%2.0%59 tps1.2s256K$1.33$5.33
23214Gemma 3 12B921±186353.1%4.2%73 tps0.8s131K$0.05$0.12
24177OpenAI o3-mini921±69K1.9%0.8%143 tps3.3s200K$1.10$4.40
25214OpenAI o3-mini-high922±67.5K2.0%2.4%231 tps10.5s200K$1.10$4.40
26209Seed 1.6 Flash 250715922±165802.5%2.5%108 tps1.6s256K$0.07$0.30
27139OpenAI o4-mini925±84K2.3%1.4%97 tps7.0s128K$1.10$4.40
28170Kimi K2 0711928±93.2K2.2%1.6%29 tps1.3s131K$0.72$2.60
29157GPT-5 Nano928±101.8K3.0%3.2%113 tps20.9s400K$0.05$0.40
30148OpenAI o3935±54.3K1.7%0.9%85 tps6.8s128K$7.33$29.33
31157Cogito v2.1 671B937±158851.7%0.8%85 tps0.5s128K$1.25$1.25
32186Grok 3 Mini Fast939±113.9K2.3%1.6%44 tps0.5s131K$0.60$4.00
33179GLM 4.7 Flash939±111.1K1.3%5.8%61 tps2.8s128K$0.07$0.39
34160Llama 4 Scout941±76.9K1.5%0.6%88 tps5.1s131K$0.18$0.46
35165DeepSeek R1T2 Chimera947±206202.4%3.0%28 tps1.8s164K$0.13$0.45
36170Mistral Small 3.2 24B953±91.4K1.8%2.8%141 tps0.7s33K$0.02$0.08
37129DeepSeek V3.1 Thinking956±102.2K2.4%7.1%18 tps1.8s131K$0.23$0.75
38175OpenAI o3-mini-low963±88.8K1.9%0.7%139 tps1.5s200K$1.10$4.40
3986Amazon Nova 2 Lite966±91.6K3.0%1.0%137 tps0.6s300K$0.35$2.95
40148OpenAI o4-mini-high966±49.3K1.8%1.9%117 tps15.9s200K$1.10$4.40
View All (142 models)