Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

1076
Gemini 2.5 Flash Thinking
1076
DeepSeek V3 0324 Turbo
1076
Apriel 1.5 15B Thinker
1074
Kimi K2 Thinking
1065
DeepSeek V3 0324
1065
Claude Sonnet 4
1064
Qwen3.5 35B A3B
1064
NVIDIA Llama 3.3 Nemotron Super 49B v1.5
1063
Grok 3
1060
Grok 3 Fast
1060
Gemini 2.5 Flash Lite
1059
Solar Pro 2 251215
1058
ERNIE 4.5 300B A47B
1058
Gemini 2.5 Flash
1057
GLM 4.5 AirX

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
8171Gemini 2.5 Flash Thinking1076±222.8K2.3%2.2%88 tps6.4s1M$0.30$2.50
8293DeepSeek V3 0324 Turbo1076±256.4K2.9%6.3%12 tps2.4s164K$0.73$1.79
83153Apriel 1.5 15B Thinker1076±52.1K1.6%2.4%146 tps0.4s131K$0$0
8495Kimi K2 Thinking1074±38.3K2.9%4.2%61 tps5.9s262K$0.24$1.03
85106DeepSeek V3 03241065±146.7K2.5%5.8%12 tps2.7s164K$0.38$0.93
8686Claude Sonnet 41065±2113.6K2.4%1.8%49 tps1.3s200K$3.00$15.00
87101Qwen3.5 35B A3B1064±82.1K2.3%2.1%116 tps2.1s256K$0.63$1.13
88121NVIDIA Llama 3.3 Nemotron Super 49B v1.51064±54.5K3.7%2.0%50 tps0.6s131K$0.09$0.33
89106Grok 31063±265.8K2.6%1.5%53 tps0.6s1M$3.67$18.33
90111Grok 3 Fast1060±312.4K1.1%1.7%52 tps2.4s131K$5.00$25.00
91101Gemini 2.5 Flash Lite1060±250.1K4.8%1.3%210 tps0.7s1M$0.10$0.40
92143Solar Pro 2 2512151059±99852.5%1.8%107 tps1.5s66K$0.15$0.60
93119ERNIE 4.5 300B A47B1058±251.6K1.9%4.7%23 tps2.3s123K$0.28$1.10
9495Gemini 2.5 Flash1058±1118.2K1.8%1.3%2 tps3.7s1M$0.30$2.50
95113GLM 4.5 AirX1057±44.1K3.0%3.3%75 tps1.2s131K$1.10$4.50
96118GPT-4.1 mini1055±267.2K2.2%1.1%67 tps0.9s1M$0.34$1.60
97106Claude Sonnet 3.5 v21055±221.4K1.9%<0.1%46 tps1.4s200K$3.00$15.00
98119GLM 4.7 FP81053±62.7K1.1%6.9%40 tps1.3s200K$0.30$1.20
99101GPT-5 (Low)1051±52.1K1.4%1.8%75 tps8.2s400K$1.25$10.00
10095Gemini 2.5 Flash Lite Thinking Preview 09251051±414.2K4.3%1.5%152 tps3.0s1M$0.10$0.40
10181OpenAI o3-pro1049±46.7K3.4%5.2%22 tps70.8s200K$20.00$80.00
102113Mistral Medium1048±237.8K2.5%1.8%48 tps0.6s33K$1.48$4.55
103113GLM 4.51044±215.4K5.0%3.7%46 tps1.4s131K$0.43$1.63
104124Qwen3 235B A22B Thinking 25071044±36.5K2.4%2.5%53 tps1.6s131K$0.59$5.70
105133GPT-4.1 nano1040±161.4K2.5%0.6%175 tps0.5s1M$0.10$0.40
106129Seed 2.0 Mini (Low)1038±137803.1%10.7%33 tps1.8s256K$0.20$0.80
107143Seed 1.6 2506151037±55K2.0%3.1%46 tps2.2s256K$0.25$2.00
108143Mistral Medium 31034±101.5K2.9%2.4%47 tps0.8s33K$0.40$2.00
109113Gemini 2.5 Flash Lite Thinking1033±319K4.9%1.0%118 tps4.4s1M$0.03$0.13
110148Qwen3 30B A3B Thinking 25071027±37.7K2.3%0.5%124 tps1.2s131K$0.16$1.70
111133DeepSeek V3.2 Speciale1026±37.5K2.8%6.0%43 tps1.4s131K$0.84$1.52
112139GLM 4.6V1024±210K2.3%6.4%21 tps1.8s128K$0.38$0.90
113143Gemini 2.0 Flash1024±229.2K2.1%<0.1%76 tps0.5s1M$0.14$0.56
114165ERNIE 4.5 21B A3B1023±61.7K3.4%2.3%78 tps1.5s120K$0.05$0.19
115129DeepSeek V3.1 Thinking1022±312.7K6.2%7.1%18 tps1.8s131K$0.23$0.75
116170Devstral Small 25071021±71.4K3.9%2.2%186 tps0.5s131K$0.10$0.30
117148OpenAI o1-pro1021±99855.3%5.2%33 tps72.8s200K$150.00$600.00
118157Cogito v2.1 671B1017±54.6K1.9%0.8%85 tps0.5s128K$1.25$1.25
119246Amazon Nova Micro 1.01015±161.3K1.6%4.1%193 tps0.6s128K$0.04$0.07
120124Kimi K2 0905 Turbo1012±223.1K5.9%0.7%373 tps0.5s262K$1.70$6.50
View All (208 models)