Models
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1029
Gemini 2.0 Flash Lite
1026
Amazon Nova 2 Lite
1021
DeepSeek V3.1 Nex N1
1020
OpenAI o3
1018
Gemini 2.0 Flash
1009
Qwen3 VL 235B A22B Thinking
1007
Qwen3 Coder Plus
1001
Qwen 2.5 VL 32B Instruct
1000
Qwen3 235B A22B Thinking 2507
999
OpenAI o3-mini-high
999
OpenAI o3-mini
995
OpenAI o4-mini-high
995
Seed 1.6 250615
989
GPT-5 Nano
988
OpenAI o3-mini-low

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
121135Gemini 2.0 Flash Lite1029±514.7K9.5%<0.1%42 tps0.5s1M$0.08$0.30
122135Amazon Nova 2 Lite1026±103.6K6.0%1.0%137 tps0.6s300K$0.35$2.95
123144DeepSeek V3.1 Nex N11021±195655.0%3.4%24 tps7.2s131K$0.14$0.50
124144OpenAI o31020±75.9K4.0%0.9%85 tps6.8s128K$7.33$29.33
125144Gemini 2.0 Flash1018±78.2K3.8%<0.1%76 tps0.5s1M$0.14$0.56
126148Qwen3 VL 235B A22B Thinking1009±64.6K8.3%4.3%47 tps3.0s127K$0.47$3.31
127148Qwen3 Coder Plus1007±226104.7%5.1%56 tps2.3s128K$1.80$9.80
128148Qwen 2.5 VL 32B Instruct1001±218654.9%6.3%43 tps3.2s128K$0.35$0.62
129148Qwen3 235B A22B Thinking 25071000±102.8K4.4%2.5%53 tps1.6s131K$0.59$5.70
130148OpenAI o3-mini-high999±58.3K4.1%2.4%231 tps10.5s200K$1.10$4.40
131148OpenAI o3-mini999±415K5.5%0.8%143 tps3.3s200K$1.10$4.40
132148OpenAI o4-mini-high995±713.6K6.2%1.9%117 tps15.9s200K$1.10$4.40
133148Seed 1.6 250615995±211.6K6.0%3.1%46 tps2.2s256K$0.25$2.00
134159GPT-5 Nano989±64.6K8.0%3.2%113 tps20.9s400K$0.05$0.40
135159OpenAI o3-mini-low988±612.2K6.4%0.7%139 tps1.5s200K$1.10$4.40
136159Grok Code Fast 1987±92.5K6.0%5.9%294 tps0.5s256K$0.20$1.50
137159GLM 4.6V986±83K5.5%6.4%21 tps1.8s128K$0.38$0.90
138159Kimi K2 0711981±67K4.5%1.6%29 tps1.3s131K$0.72$2.60
139159Seed 2.0 Mini (Medium)981±355705.8%11.9%33 tps1.7s256K$0.15$0.60
140167DeepSeek V3.1 Thinking976±95.2K9.5%7.1%18 tps1.8s131K$0.23$0.75
141167Nemotron 3 Nano974±465806.5%1.3%216 tps0.8s256K$0.05$4.94
142167Mistral Small 3.2 24B970±134.6K4.9%2.8%141 tps0.7s33K$0.02$0.08
143167Llama 4 Scout965±517.5K5.3%0.6%88 tps5.1s131K$0.18$0.46
144167Devstral Medium962±113.5K5.2%1.5%77 tps0.6s131K$0.40$2.00
145167Qwen 2.5 72B960±151.4K4.5%1.2%96 tps1.2s131K$0.14$0.26
146179Qwen3 30B A3B Thinking 2507953±103.5K4.7%0.5%124 tps1.2s131K$0.16$1.70
147179Switchpoint Router949±102.7K3.6%1.7%71 tps4.9s131K$0.85$3.40
148179Qwen3 8B948±94.2K8.2%2.4%61 tps1.4s41K$0.02$0.07
149179Ministral 14B 3.0948±168058.5%2.0%119 tps0.5s128K$0.20$0.20
150179Grok 3 Mini Fast943±79K7.0%1.6%44 tps0.5s131K$0.60$4.00
151179ERNIE 4.5 21B A3B943±285406.9%2.3%78 tps1.5s120K$0.05$0.19
152179ERNIE 4.5 VL 424B A47B942±187256.5%4.9%36 tps3.5s123K$0.42$1.25
153179NVIDIA Llama 3.3 Nemotron Super 49B v1.5941±191.4K6.9%2.0%50 tps0.6s131K$0.09$0.33
154189Llama 3.3 70B933±103K6.6%0.3%500 tps0.5s8K$0.48$0.66
155189Grok 3 Mini927±69.9K6.4%1.2%43 tps0.5s131K$0.30$0.50
156189Jamba 1.6 Large924±93.3K3.8%2.0%59 tps1.2s256K$1.33$5.33
157189Llama 3.3 Swallow 70B Instruct919±83.5K5.5%1.4%153 tps1.3s131K$0.13$0.39
158189Jamba 1.7 Large917±169758.0%1.3%58 tps1.0s256K$1.33$5.33
159189Devstral Small916±171.4K5.0%2.4%180 tps0.6s131K$0.10$0.30
160189Rnj-1 Instruct915±219706.7%0.6%103 tps0.3s33K$0.15$0.15
View All (210 models)