Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

373
Qwen 2.5 VL 3B Instruct
613
Inception Mercury
635
Qwen 2.5 VL 72B Instruct
674
Pixtral 12B
719
Llama 3.3 70B
724
Grok 3 Mini Fast
752
OpenAI o3-mini-low
765
Magistral Medium 2509
777
OpenAI o3-mini
778
OpenAI o3-mini-high
782
Llama 4 Scout
787
Qwen3 30B A3B Thinking 2507
793
Mistral Small 3.2 24B
796
Pixtral Large
799
Grok 3 Mini

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
1288Qwen 2.5 VL 3B Instruct373±469557.7%3.0%44 tps2.5s128K$0.21$0.63
2179Inception Mercury613±255006.5%0.4%257 tps1.1s32K$0.25$1.00
3265Qwen 2.5 VL 72B Instruct635±345055.6%5.3%25 tps3.7s128K$1.01$2.79
4274Pixtral 12B674±337205.9%2.2%101 tps1.2s131K$0.08$0.08
5194Llama 3.3 70B719±185503.5%0.3%500 tps0.5s8K$0.48$0.66
6186Grok 3 Mini Fast724±151.3K4.3%1.6%44 tps0.5s131K$0.60$4.00
7175OpenAI o3-mini-low752±171.4K4.3%0.7%139 tps1.5s200K$1.10$4.40
8229Magistral Medium 2509765±186103.9%4.0%58 tps0.9s131K$2.00$5.00
9177OpenAI o3-mini777±122K3.6%0.8%143 tps3.3s200K$1.10$4.40
10214OpenAI o3-mini-high778±176303.1%2.4%231 tps10.5s200K$1.10$4.40
11160Llama 4 Scout782±151.6K4.1%0.6%88 tps5.1s131K$0.18$0.46
12148Qwen3 30B A3B Thinking 2507787±146554.4%0.5%124 tps1.2s131K$0.16$1.70
13170Mistral Small 3.2 24B793±284755.9%2.8%141 tps0.7s33K$0.02$0.08
14165Pixtral Large796±266107.6%2.5%57 tps1.3s128K$1.50$4.50
15186Grok 3 Mini799±231.9K2.6%1.2%43 tps0.5s131K$0.30$0.50
16161Qwen3 8B813±236105.4%2.4%61 tps1.4s41K$0.02$0.07
17165Qwen3 4B818±208704.9%1.9%94 tps1.5s128K$0.01$0.01
18121QwQ 32B825±161.3K5.5%5.4%41 tps2.1s16K$0.43$0.56
19126Qwen3 30B A3B832±209504.0%5.1%163 tps1.0s41K$0.06$0.21
20139GLM 4.6V837±236405.2%6.4%21 tps1.8s128K$0.38$0.90
21161Llama 4 Maverick838±102.4K4.3%1.2%88 tps2.4s1M$0.23$0.83
22133DeepSeek V3.2 Speciale842±384854.9%6.0%43 tps1.4s131K$0.84$1.52
23121Qwen3 32B Fast863±132K4.5%11.6%30 tps3.1s41K$0.10$0.25
24119ERNIE 4.5 300B A47B873±121.1K3.4%4.7%23 tps2.3s123K$0.28$1.10
25143Gemini 2.0 Flash885±168705.9%<0.1%76 tps0.5s1M$0.14$0.56
26157Qwen3 Next 80B A3B Thinking890±161.1K3.4%0.6%175 tps1.3s256K$0.21$2.26
27133GPT-4.1 nano896±111.8K3.2%0.6%175 tps0.5s1M$0.10$0.40
28157GPT-5 Nano901±91.2K5.0%3.2%113 tps20.9s400K$0.05$0.40
29143Gemini 2.0 Flash Lite902±149907.9%<0.1%42 tps0.5s1M$0.08$0.30
30133DeepSeek-R1 0528904±196404.5%1.3%93 tps0.5s64K$1.60$3.67
31124Qwen3 235B A22B Thinking 2507905±166952.1%2.5%53 tps1.6s131K$0.59$5.70
3265DeepSeek V3.2 Exp Chat909±111.3K2.6%2.6%29 tps1.5s131K$0.27$0.39
33101gpt-oss-20b912±121.5K4.1%0.5%216 tps0.5s131K$0.06$0.26
34153OpenAI o1926±159152.1%4.2%92 tps5.5s200K$15.00$60.00
35148DeepSeek-R1936±217054.1%0.8%133 tps0.6s64K$0.91$3.07
36126Qwen3 VL 235B A22B Thinking939±159653.5%4.3%47 tps3.0s127K$0.47$3.31
37133Qwen3 14B943±168254.1%1.7%109 tps0.8s41K$0.04$0.15
38139OpenAI o4-mini947±111.2K2.5%1.4%97 tps7.0s128K$1.10$4.40
39129Command A948±121.9K3.1%2.2%42 tps0.8s256K$2.00$7.33
4071Seed 1.8 251228949±181.2K2.7%3.7%41 tps2.1s256K$0.25$2.00
4168GLM 4.7949±131.6K1.8%5.8%40 tps1.5s200K$0.77$1.73
42148OpenAI o4-mini-high950±121.5K3.8%1.9%117 tps15.9s200K$1.10$4.40
43126DeepSeek V3956±121.7K2.5%0.9%69 tps1.1s64K$0.59$1.49
4456DeepSeek V3.1 Turbo957±148204.1%0.9%173 tps1.3s164K$2.00$3.75
45129DeepSeek V3.1 Thinking958±141K2.4%7.1%18 tps1.8s131K$0.23$0.75
46148OpenAI o3960±166002.4%0.9%85 tps6.8s128K$7.33$29.33
4784GPT-5 Mini Minimal968±177953.6%1.2%63 tps1.4s400K$0.25$2.00
48113GLM 4.5969±191.3K3.5%3.7%46 tps1.4s131K$0.43$1.63
49118GPT-4.1 mini976±132.7K1.8%1.1%67 tps0.9s1M$0.34$1.60
50106DeepSeek V3.1 Terminus Thinking979±136603.6%5.9%27 tps1.8s131K$0.56$1.68
5195Kimi K2 Thinking985±216203.1%4.2%61 tps5.9s262K$0.24$1.03
5244Kimi K2 Thinking Turbo986±141.1K3.2%2.0%75 tps1.4s262K$1.15$8.00
53113Mistral Medium989±141.1K2.7%1.8%48 tps0.6s33K$1.48$4.55
5486Qwen3 235B A22B989±197403.9%5.3%71 tps0.9s41K$0.23$0.63
55113Kimi K2 Fast989±107.4K2.2%0.8%365 tps0.5s131K$1.00$3.00
56106DeepSeek V3 0324990±112.1K3.0%5.8%12 tps2.7s164K$0.38$0.93
57124Kimi K2 0905 Turbo991±121.6K1.8%0.7%373 tps0.5s262K$1.70$6.50
58113Gemini 2.5 Flash Lite Thinking996±112.5K3.7%1.0%118 tps4.4s1M$0.03$0.13
5995DeepSeek V3.2 Exp Thinking999±227751.9%7.2%26 tps3.0s131K$0.28$0.42
60170Kimi K2 07111002±157203.4%1.6%29 tps1.3s131K$0.72$2.60
61106Grok 31003±92K2.6%1.5%53 tps0.6s1M$3.67$18.33
6286Amazon Nova 2 Lite1013±188154.7%1.0%137 tps0.6s300K$0.35$2.95
63101Gemini 2.5 Flash Lite1014±95.3K3.9%1.3%210 tps0.7s1M$0.10$0.40
64133Kimi K2 09051014±138102.4%4.0%30 tps1.4s262K$0.63$2.39
6586DeepSeek V3.1 Chat1018±121.1K3.1%2.8%21 tps1.6s131K$0.38$1.00
6679Qwen3 Max Thinking Preview1020±101.2K2.4%3.1%40 tps2.1s256K$1.20$6.00
6793Qwen Max1021±141.8K2.7%1.5%49 tps1.5s33K$1.60$6.40
6895DeepSeek-R1 Turbo1021±134853.0%2.6%29 tps1.8s64K$2.85$4.75
6965Mistral Large 31026±221.1K4.1%2.1%51 tps1.0s256K$0.50$1.50
70129Qwen3 Max Thinking1029±316002.4%13.5%32 tps2.3s256K$1.20$6.00
7156DeepSeek V3.2 Thinking1033±151.7K2.0%9.0%30 tps2.6s131K$0.28$0.42
7244DeepSeek V3.1 Terminus Chat1037±91.3K2.2%3.4%27 tps1.5s131K$0.86$1.80
7365GLM 4.61041±111.6K2.9%5.4%39 tps1.5s200K$0.42$1.66
7462MiniMax M21043±91.8K4.2%2.2%39 tps2.3s205K$0.21$0.85
7595Gemini 2.5 Flash Lite Thinking Preview 09251044±91.7K3.5%1.5%152 tps3.0s1M$0.10$0.40
7660MiniMax M2.11044±121.7K2.8%2.1%66 tps2.6s205K$0.30$1.20
7781GPT-4o1046±151.4K2.5%1.0%49 tps2.4s128K$3.71$12.57
7852Grok 4 Fast Non-Reasoning1054±81.6K2.5%1.5%93 tps0.6s2M$0.27$0.67
7952Claude Haiku 4.51057±63.4K3.4%1.1%100 tps0.9s200K$1.00$5.00
8040DeepSeek V3.21063±161.1K2.5%1.4%83 tps5.1s131K$0.43$1.09
8142Qwen3 Max Instruct Preview1063±72.7K1.5%1.1%31 tps1.7s256K$1.43$6.61
8286Claude Sonnet 41066±85.3K2.5%1.8%49 tps1.3s200K$3.00$15.00
8371Gemini 2.5 Flash Lite Preview 09251066±112.2K2.8%1.2%209 tps0.7s1M$0.25$0.35
8493DeepSeek V3 0324 Turbo1074±142.1K1.9%6.3%12 tps2.4s164K$0.73$1.79
8562GPT-5.1 Instant1075±92.2K2.6%1.3%50 tps1.9s400K$1.25$10.00
8671GPT-5 Mini1075±92.1K4.3%2.6%66 tps14.2s400K$0.25$2.00
8744Grok 4.1 Fast Reasoning1076±102.6K4.2%1.5%58 tps7.3s2M$0.20$0.50
8848gpt-oss-120b1083±73.5K3.0%0.7%213 tps0.5s131K$0.11$0.50
8933Kimi K2.51083±161.7K3.2%6.5%33 tps1.7s262K$0.34$2.57
9056Gemini 3.1 Flash Lite Preview Thinking1083±324853.0%1.7%75 tps4.7s1M$0.25$1.50
9148Grok 4 Fast Reasoning1090±112.1K3.1%2.1%102 tps3.1s2M$0.30$0.75
9268Grok 41093±55.7K4.0%3.9%29 tps11.1s256K$3.00$15.00
9395Gemini 2.5 Flash1098±94.8K2.7%1.3%2 tps3.7s1M$0.30$2.50
9437Kimi K2.5 Instant1101±284951.0%2.9%32 tps3.0s262K$0.50$3.00
9526GPT-5 (High)1110±74.3K3.1%4.5%81 tps35.9s400K$1.25$10.00
9671Qwen3.5 397B A17B1112±245802.5%4.3%57 tps1.4s256K$0.52$3.00
9729Qwen3 VL 235B A22B Instruct1114±81.3K2.5%3.1%75 tps1.9s129K$0.37$1.81
9826Claude Haiku 4.5 (Extended Thinking)1115±122.2K2.7%1.4%115 tps0.7s200K$1.00$5.00
9968Qwen Plus (Aug'24)1115±81.9K2.6%1.4%53 tps1.3s30K$0.40$1.20
10040Qwen3 235B A22B Instruct 25071118±112.5K2.1%6.8%13 tps1.9s262K$0.13$0.52
10181OpenAI o3-pro1126±102.3K3.1%5.2%22 tps70.8s200K$20.00$80.00
10233Qwen3 30B A3B Instruct 25071127±92.5K2.9%1.2%55 tps1.3s131K$0.13$0.72
10326Grok 4.1 Fast Non-Reasoning1130±152K4.3%0.9%101 tps0.5s2M$0.20$0.50
10460Gemini 2.5 Flash Preview 09251131±102.1K2.7%1.2%5 tps0.9s1M$0.13$0.97
10529Nova Experimental Chat 12-101135±197202.0%2.4%84 tps12.9s98K$0$0
10633Qwen3 Next 80B A3B Instruct1142±91.8K2.5%0.6%84 tps1.1s256K$0.20$1.42
10748Claude Sonnet 4 (Thinking)1144±64.2K3.5%1.5%52 tps1.5s200K$3.00$13.67
10871Gemini 2.5 Flash Thinking1148±72.6K4.2%2.2%88 tps6.4s1M$0.30$2.50
10937Claude Sonnet 4.51156±55.1K2.8%1.4%41 tps1.3s200K$1.80$9.00
11052GPT-51157±75.3K2.9%3.1%78 tps23.1s400K$1.25$9.67
11144Gemini 2.5 Pro1158±57.8K3.3%2.3%45 tps2.6s1M$1.25$10.00
11213GPT-5.3 Instant1160±191.5K1.9%0.9%63 tps0.8s400K$1.75$14.00
11342GPT-5.2 (Extra High) 1168±122.4K1.8%13.2%17 tps20.5s400K$1.75$14.00
11417GPT-5.2 (High)1175±106.4K1.7%6.7%18 tps16.3s400K$1.75$14.00
11522GLM 51184±227952.5%3.4%36 tps2.7s200K$0.72$2.55
11632Gemini 2.5 Pro High1189±64.5K2.5%1.5%48 tps2.3s1M$1.25$10.00
11717Gemini 3 Flash Preview1198±142K2.0%1.3%138 tps1.4s1M$0.50$3.00
11814Gemini 3 Flash Preview Thinking1212±84.2K1.8%1.6%3 tps6.2s1M$0.50$3.00
11916GPT-5.21213±122.7K2.2%4.1%18 tps2.7s400K$1.75$14.00
12022GPT-5 Chat1220±610.4K2.1%1.3%95 tps0.9s400K$1.25$10.00
12117Claude Opus 4.51228±92.9K1.8%1.5%45 tps1.5s200K$5.00$25.00
1225Claude Sonnet 4.6 (Thinking)1235±181.7K2.0%4.7%57 tps1.1s200K$3.00$15.00
1237Claude Opus 4.5 (Thinking)1248±69.3K2.3%1.8%49 tps1.4s200K$5.00$25.00
12410Claude Sonnet 4.5 (Thinking)1260±48.3K1.8%1.9%44 tps1.1s200K$3.00$15.00
12510Gemini 3 Pro1265±512K1.3%2.1%50 tps3.6s1M$2.00$12.00
1268GPT-5.11268±63.3K1.5%2.3%71 tps1.4s400K$1.42$11.33
12714Gemini 3 Pro (Low)1270±123.5K2.4%2.4%51 tps3.5s1M$2.00$12.00
1288GPT-5.1 (High)1274±64.2K2.5%3.2%76 tps6.9s400K$1.25$10.00
12910GPT-5.2 Instant1274±83.9K2.2%1.7%52 tps2.0s400K$1.75$14.00
1304Claude Sonnet 4.61295±161.8K0.8%1.6%47 tps1.2s200K$3.00$15.00
1316Gemini 3.1 Pro1344±162.8K2.4%3.5%35 tps4.1s1M$2.00$12.00
1322Claude Opus 4.61483±113.2K0.9%2.1%48 tps1.7s200K$5.00$25.00
1331Claude Opus 4.6 (Thinking)1485±112.3K1.3%2.5%56 tps1.6s200K$5.00$25.00
Show Less