Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

659
OpenAI o3-mini-low
676
OpenAI o3-mini
719
Llama 4 Maverick
722
Llama 4 Scout
767
Qwen3 Next 80B A3B Thinking
768
OpenAI o4-mini
804
Gemini 2.5 Flash Lite Thinking
816
gpt-oss-20b
824
GPT-4.1 nano
836
Command A
836
GPT-5 Nano
843
DeepSeek V3 0324 Turbo
854
OpenAI o4-mini-high
856
Gemini 2.0 Flash Lite
859
Gemini 2.5 Flash Lite Thinking Preview 0925

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
1175OpenAI o3-mini-low659±215052.9%0.7%139 tps1.5s200K$1.10$4.40
2177OpenAI o3-mini676±276901.4%0.8%143 tps3.3s200K$1.10$4.40
3161Llama 4 Maverick719±271K2.9%1.2%88 tps2.4s1M$0.23$0.83
4160Llama 4 Scout722±337002.1%0.6%88 tps5.1s131K$0.18$0.46
5157Qwen3 Next 80B A3B Thinking767±238102.4%0.6%175 tps1.3s256K$0.21$2.26
6139OpenAI o4-mini768±325450.9%1.4%97 tps7.0s128K$1.10$4.40
7113Gemini 2.5 Flash Lite Thinking804±187753.1%1.0%118 tps4.4s1M$0.03$0.13
8101gpt-oss-20b816±205551.8%0.5%216 tps0.5s131K$0.06$0.26
9133GPT-4.1 nano824±227352.0%0.6%175 tps0.5s1M$0.10$0.40
10129Command A836±158551.2%2.2%42 tps0.8s256K$2.00$7.33
11157GPT-5 Nano836±346854.2%3.2%113 tps20.9s400K$0.05$0.40
1293DeepSeek V3 0324 Turbo843±216350.8%6.3%12 tps2.4s164K$0.73$1.79
13148OpenAI o4-mini-high854±225601.8%1.9%117 tps15.9s200K$1.10$4.40
14143Gemini 2.0 Flash Lite856±245853.3%<0.1%42 tps0.5s1M$0.08$0.30
1595Gemini 2.5 Flash Lite Thinking Preview 0925859±161.2K2.5%1.5%152 tps3.0s1M$0.10$0.40
1671GPT-5 Mini870±159404.1%2.6%66 tps14.2s400K$0.25$2.00
17106Grok 3872±267451.3%1.5%53 tps0.6s1M$3.67$18.33
18124Kimi K2 0905 Turbo881±177102.1%0.7%373 tps0.5s262K$1.70$6.50
1965Mistral Large 3881±274953.9%2.1%51 tps1.0s256K$0.50$1.50
20129DeepSeek V3.1 Thinking886±165102.9%7.1%18 tps1.8s131K$0.23$0.75
21113Kimi K2 Fast887±141.6K1.0%0.8%365 tps0.5s131K$1.00$3.00
2262MiniMax M2900±247202.7%2.2%39 tps2.3s205K$0.21$0.85
23118GPT-4.1 mini900±169501.0%1.1%67 tps0.9s1M$0.34$1.60
24126DeepSeek V3910±385651.7%0.9%69 tps1.1s64K$0.59$1.49
2544Kimi K2 Thinking Turbo917±275301.9%2.0%75 tps1.4s262K$1.15$8.00
26106DeepSeek V3 0324920±255700.9%5.8%12 tps2.7s164K$0.38$0.93
27126Qwen3 VL 235B A22B Thinking922±196453.0%4.3%47 tps3.0s127K$0.47$3.31
2879Qwen3 Max Thinking Preview925±265301.9%3.1%40 tps2.1s256K$1.20$6.00
2956DeepSeek V3.2 Thinking942±267052.8%9.0%30 tps2.6s131K$0.28$0.42
3081GPT-4o945±315053.8%1.0%49 tps2.4s128K$3.71$12.57
3171Gemini 2.5 Flash Lite Preview 0925948±161.1K2.2%1.2%209 tps0.7s1M$0.25$0.35
32101Gemini 2.5 Flash Lite948±161.6K2.7%1.3%210 tps0.7s1M$0.10$0.40
3384GPT-5 Mini Minimal953±165953.3%1.2%63 tps1.4s400K$0.25$2.00
3452GPT-5957±201.6K2.9%3.1%78 tps23.1s400K$1.25$9.67
3556DeepSeek V3.1 Turbo969±376651.5%0.9%173 tps1.3s164K$2.00$3.75
3693Qwen Max979±196952.1%1.5%49 tps1.5s33K$1.60$6.40
3768GLM 4.7992±336352.3%5.8%40 tps1.5s200K$0.77$1.73
3848gpt-oss-120b1000±151.1K1.3%0.7%213 tps0.5s131K$0.11$0.50
3971Gemini 2.5 Flash Thinking1000±181K1.9%2.2%88 tps6.4s1M$0.30$2.50
4086Claude Sonnet 41011±191.8K1.4%1.8%49 tps1.3s200K$3.00$15.00
4144Grok 4.1 Fast Reasoning1016±181.4K2.0%1.5%58 tps7.3s2M$0.20$0.50
4248Grok 4 Fast Reasoning1022±141.2K2.0%2.1%102 tps3.1s2M$0.30$0.75
4368Grok 41022±102.1K2.5%3.9%29 tps11.1s256K$3.00$15.00
4452Grok 4 Fast Non-Reasoning1023±168701.7%1.5%93 tps0.6s2M$0.27$0.67
4526Grok 4.1 Fast Non-Reasoning1023±218201.8%0.9%101 tps0.5s2M$0.20$0.50
4660Gemini 2.5 Flash Preview 09251025±131.2K2.0%1.2%5 tps0.9s1M$0.13$0.97
4760MiniMax M2.11036±226951.4%2.1%66 tps2.6s205K$0.30$1.20
4833Qwen3 Next 80B A3B Instruct1038±159202.6%0.6%84 tps1.1s256K$0.20$1.42
4937Claude Sonnet 4.51040±82K3.2%1.4%41 tps1.3s200K$1.80$9.00
5062GPT-5.1 Instant1040±139152.7%1.3%50 tps1.9s400K$1.25$10.00
5168Qwen Plus (Aug'24)1048±227302.0%1.4%53 tps1.3s30K$0.40$1.20
5295Gemini 2.5 Flash1049±182.1K1.9%1.3%2 tps3.7s1M$0.30$2.50
5340Qwen3 235B A22B Instruct 25071053±196800.7%6.8%13 tps1.9s262K$0.13$0.52
5433Qwen3 30B A3B Instruct 25071056±188102.4%1.2%55 tps1.3s131K$0.13$0.72
5565GLM 4.61059±256402.3%5.4%39 tps1.5s200K$0.42$1.66
5652Claude Haiku 4.51060±131.6K3.1%1.1%100 tps0.9s200K$1.00$5.00
5726GPT-5 (High)1061±92.5K2.7%4.5%81 tps35.9s400K$1.25$10.00
5844DeepSeek V3.1 Terminus Chat1078±125801.7%3.4%27 tps1.5s131K$0.86$1.80
5942Qwen3 Max Instruct Preview1083±171.1K1.7%1.1%31 tps1.7s256K$1.43$6.61
6048Claude Sonnet 4 (Thinking)1093±141.6K2.4%1.5%52 tps1.5s200K$3.00$13.67
6129Qwen3 VL 235B A22B Instruct1094±156752.2%3.1%75 tps1.9s129K$0.37$1.81
6210Claude Sonnet 4.5 (Thinking)1102±133.2K3.6%1.9%44 tps1.1s200K$3.00$15.00
6342GPT-5.2 (Extra High) 1107±248902.7%13.2%17 tps20.5s400K$1.75$14.00
6433Kimi K2.51110±267202.0%6.5%33 tps1.7s262K$0.34$2.57
6513GPT-5.3 Instant1110±335151.0%0.9%63 tps0.8s400K$1.75$14.00
6632Gemini 2.5 Pro High1119±102.5K2.4%1.5%48 tps2.3s1M$1.25$10.00
6726Claude Haiku 4.5 (Extended Thinking)1123±191.1K1.9%1.4%115 tps0.7s200K$1.00$5.00
6844Gemini 2.5 Pro1125±63.1K3.7%2.3%45 tps2.6s1M$1.25$10.00
6917Claude Opus 4.51135±211.1K1.4%1.5%45 tps1.5s200K$5.00$25.00
7017GPT-5.2 (High)1145±152.2K1.6%6.7%18 tps16.3s400K$1.75$14.00
7116GPT-5.21162±187851.9%4.1%18 tps2.7s400K$1.75$14.00
7222GPT-5 Chat1164±123.5K1.6%1.3%95 tps0.9s400K$1.25$10.00
7317Gemini 3 Flash Preview1165±216751.5%1.3%138 tps1.4s1M$0.50$3.00
7414Gemini 3 Flash Preview Thinking1167±171.4K1.7%1.6%3 tps6.2s1M$0.50$3.00
755Claude Sonnet 4.6 (Thinking)1222±236301.6%4.7%57 tps1.1s200K$3.00$15.00
768GPT-5.11230±131.3K1.9%2.3%71 tps1.4s400K$1.42$11.33
778GPT-5.1 (High)1231±151.8K1.7%3.2%76 tps6.9s400K$1.25$10.00
7814Gemini 3 Pro (Low)1240±191.2K0.8%2.4%51 tps3.5s1M$2.00$12.00
796Gemini 3.1 Pro1244±231.4K1.7%3.5%35 tps4.1s1M$2.00$12.00
8010Gemini 3 Pro1248±163.5K1.5%2.1%50 tps3.6s1M$2.00$12.00
8110GPT-5.2 Instant1256±151.3K1.8%1.7%52 tps2.0s400K$1.75$14.00
824Claude Sonnet 4.61266±276501.5%1.6%47 tps1.2s200K$3.00$15.00
837Claude Opus 4.5 (Thinking)1280±142.8K1.4%1.8%49 tps1.4s200K$5.00$25.00
842Claude Opus 4.61424±169501.0%2.1%48 tps1.7s200K$5.00$25.00
851Claude Opus 4.6 (Thinking)1524±169801.0%2.5%56 tps1.6s200K$5.00$25.00
Show Less