Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

613
Inception Mercury
635
Qwen 2.5 VL 72B Instruct
719
Llama 3.3 70B
724
Grok 3 Mini Fast
752
OpenAI o3-mini-low
765
Magistral Medium 2509
777
OpenAI o3-mini
778
OpenAI o3-mini-high
782
Llama 4 Scout
787
Qwen3 30B A3B Thinking 2507
793
Mistral Small 3.2 24B
799
Grok 3 Mini
813
Qwen3 8B
818
Qwen3 4B
837
GLM 4.6V

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
1179Inception Mercury613±255006.5%0.4%257 tps1.1s32K$0.25$1.00
2265Qwen 2.5 VL 72B Instruct635±345055.6%5.3%25 tps3.7s128K$1.01$2.79
3194Llama 3.3 70B719±185503.5%0.3%500 tps0.5s8K$0.48$0.66
4186Grok 3 Mini Fast724±151.3K4.3%1.6%44 tps0.5s131K$0.60$4.00
5175OpenAI o3-mini-low752±171.4K4.3%0.7%139 tps1.5s200K$1.10$4.40
6229Magistral Medium 2509765±186103.9%4.0%58 tps0.9s131K$2.00$5.00
7177OpenAI o3-mini777±122K3.6%0.8%143 tps3.3s200K$1.10$4.40
8214OpenAI o3-mini-high778±176303.1%2.4%231 tps10.5s200K$1.10$4.40
9160Llama 4 Scout782±151.6K4.1%0.6%88 tps5.1s131K$0.18$0.46
10148Qwen3 30B A3B Thinking 2507787±146554.4%0.5%124 tps1.2s131K$0.16$1.70
11170Mistral Small 3.2 24B793±284755.9%2.8%141 tps0.7s33K$0.02$0.08
12186Grok 3 Mini799±231.9K2.6%1.2%43 tps0.5s131K$0.30$0.50
13161Qwen3 8B813±236105.4%2.4%61 tps1.4s41K$0.02$0.07
14165Qwen3 4B818±208704.9%1.9%94 tps1.5s128K$0.01$0.01
15139GLM 4.6V837±236405.2%6.4%21 tps1.8s128K$0.38$0.90
16133DeepSeek V3.2 Speciale842±384854.9%6.0%43 tps1.4s131K$0.84$1.52
17119ERNIE 4.5 300B A47B873±121.1K3.4%4.7%23 tps2.3s123K$0.28$1.10
18143Gemini 2.0 Flash885±168705.9%<0.1%76 tps0.5s1M$0.14$0.56
19157Qwen3 Next 80B A3B Thinking890±161.1K3.4%0.6%175 tps1.3s256K$0.21$2.26
20133GPT-4.1 nano896±111.8K3.2%0.6%175 tps0.5s1M$0.10$0.40
21157GPT-5 Nano901±91.2K5.0%3.2%113 tps20.9s400K$0.05$0.40
22143Gemini 2.0 Flash Lite902±149907.9%<0.1%42 tps0.5s1M$0.08$0.30
23124Qwen3 235B A22B Thinking 2507905±166952.1%2.5%53 tps1.6s131K$0.59$5.70
24153OpenAI o1926±159152.1%4.2%92 tps5.5s200K$15.00$60.00
25126Qwen3 VL 235B A22B Thinking939±159653.5%4.3%47 tps3.0s127K$0.47$3.31
26139OpenAI o4-mini947±111.2K2.5%1.4%97 tps7.0s128K$1.10$4.40
2771Seed 1.8 251228949±181.2K2.7%3.7%41 tps2.1s256K$0.25$2.00
2868GLM 4.7949±131.6K1.8%5.8%40 tps1.5s200K$0.77$1.73
29148OpenAI o4-mini-high950±121.5K3.8%1.9%117 tps15.9s200K$1.10$4.40
3056DeepSeek V3.1 Turbo957±148204.1%0.9%173 tps1.3s164K$2.00$3.75
31129DeepSeek V3.1 Thinking958±141K2.4%7.1%18 tps1.8s131K$0.23$0.75
32148OpenAI o3960±166002.4%0.9%85 tps6.8s128K$7.33$29.33
3384GPT-5 Mini Minimal968±177953.6%1.2%63 tps1.4s400K$0.25$2.00
34113GLM 4.5969±191.3K3.5%3.7%46 tps1.4s131K$0.43$1.63
35118GPT-4.1 mini976±132.7K1.8%1.1%67 tps0.9s1M$0.34$1.60
3695Kimi K2 Thinking985±216203.1%4.2%61 tps5.9s262K$0.24$1.03
37113Mistral Medium989±141.1K2.7%1.8%48 tps0.6s33K$1.48$4.55
38106DeepSeek V3 0324990±112.1K3.0%5.8%12 tps2.7s164K$0.38$0.93
39124Kimi K2 0905 Turbo991±121.6K1.8%0.7%373 tps0.5s262K$1.70$6.50
40113Gemini 2.5 Flash Lite Thinking996±112.5K3.7%1.0%118 tps4.4s1M$0.03$0.13
View All (107 models)