Models
Topics
Language
More

More filters

Show inactive models

Hide models that are no longer actively available on Yupp.

Turns

Filter model performance by the number of turns in a conversation.

Open license models

Filter the leaderboard to only show models that have an open license.

659
OpenAI o3-mini-low
676
OpenAI o3-mini
722
Llama 4 Scout
767
Qwen3 Next 80B A3B Thinking
768
OpenAI o4-mini
804
Gemini 2.5 Flash Lite Thinking
824
GPT-4.1 nano
836
GPT-5 Nano
843
DeepSeek V3 0324 Turbo
854
OpenAI o4-mini-high
856
Gemini 2.0 Flash Lite
859
Gemini 2.5 Flash Lite Thinking Preview 0925
870
GPT-5 Mini
872
Grok 3
881
Kimi K2 0905 Turbo

Last updated about 1 month ago

RankOverallNameVIBE
Score
Confidence
Interval
VotesDownvote %Abort %SpeedLatencyContextCost
(Input)
Cost
(Output)
1175OpenAI o3-mini-low659±215052.9%0.7%139 tps1.5s200K$1.10$4.40
2177OpenAI o3-mini676±276901.4%0.8%143 tps3.3s200K$1.10$4.40
3160Llama 4 Scout722±337002.1%0.6%88 tps5.1s131K$0.18$0.46
4157Qwen3 Next 80B A3B Thinking767±238102.4%0.6%175 tps1.3s256K$0.21$2.26
5139OpenAI o4-mini768±325450.9%1.4%97 tps7.0s128K$1.10$4.40
6113Gemini 2.5 Flash Lite Thinking804±187753.1%1.0%118 tps4.4s1M$0.03$0.13
7133GPT-4.1 nano824±227352.0%0.6%175 tps0.5s1M$0.10$0.40
8157GPT-5 Nano836±346854.2%3.2%113 tps20.9s400K$0.05$0.40
993DeepSeek V3 0324 Turbo843±216350.8%6.3%12 tps2.4s164K$0.73$1.79
10148OpenAI o4-mini-high854±225601.8%1.9%117 tps15.9s200K$1.10$4.40
11143Gemini 2.0 Flash Lite856±245853.3%<0.1%42 tps0.5s1M$0.08$0.30
1295Gemini 2.5 Flash Lite Thinking Preview 0925859±161.2K2.5%1.5%152 tps3.0s1M$0.10$0.40
1371GPT-5 Mini870±159404.1%2.6%66 tps14.2s400K$0.25$2.00
14106Grok 3872±267451.3%1.5%53 tps0.6s1M$3.67$18.33
15124Kimi K2 0905 Turbo881±177102.1%0.7%373 tps0.5s262K$1.70$6.50
16129DeepSeek V3.1 Thinking886±165102.9%7.1%18 tps1.8s131K$0.23$0.75
1762MiniMax M2900±247202.7%2.2%39 tps2.3s205K$0.21$0.85
18118GPT-4.1 mini900±169501.0%1.1%67 tps0.9s1M$0.34$1.60
19106DeepSeek V3 0324920±255700.9%5.8%12 tps2.7s164K$0.38$0.93
20126Qwen3 VL 235B A22B Thinking922±196453.0%4.3%47 tps3.0s127K$0.47$3.31
2179Qwen3 Max Thinking Preview925±265301.9%3.1%40 tps2.1s256K$1.20$6.00
2281GPT-4o945±315053.8%1.0%49 tps2.4s128K$3.71$12.57
2371Gemini 2.5 Flash Lite Preview 0925948±161.1K2.2%1.2%209 tps0.7s1M$0.25$0.35
24101Gemini 2.5 Flash Lite948±161.6K2.7%1.3%210 tps0.7s1M$0.10$0.40
2584GPT-5 Mini Minimal953±165953.3%1.2%63 tps1.4s400K$0.25$2.00
2652GPT-5957±201.6K2.9%3.1%78 tps23.1s400K$1.25$9.67
2756DeepSeek V3.1 Turbo969±376651.5%0.9%173 tps1.3s164K$2.00$3.75
2893Qwen Max979±196952.1%1.5%49 tps1.5s33K$1.60$6.40
2968GLM 4.7992±336352.3%5.8%40 tps1.5s200K$0.77$1.73
3071Gemini 2.5 Flash Thinking1000±181K1.9%2.2%88 tps6.4s1M$0.30$2.50
3186Claude Sonnet 41011±191.8K1.4%1.8%49 tps1.3s200K$3.00$15.00
3244Grok 4.1 Fast Reasoning1016±181.4K2.0%1.5%58 tps7.3s2M$0.20$0.50
3348Grok 4 Fast Reasoning1022±141.2K2.0%2.1%102 tps3.1s2M$0.30$0.75
3468Grok 41022±102.1K2.5%3.9%29 tps11.1s256K$3.00$15.00
3552Grok 4 Fast Non-Reasoning1023±168701.7%1.5%93 tps0.6s2M$0.27$0.67
3626Grok 4.1 Fast Non-Reasoning1023±218201.8%0.9%101 tps0.5s2M$0.20$0.50
3760Gemini 2.5 Flash Preview 09251025±131.2K2.0%1.2%5 tps0.9s1M$0.13$0.97
3860MiniMax M2.11036±226951.4%2.1%66 tps2.6s205K$0.30$1.20
3937Claude Sonnet 4.51040±82K3.2%1.4%41 tps1.3s200K$1.80$9.00
4062GPT-5.1 Instant1040±139152.7%1.3%50 tps1.9s400K$1.25$10.00
View All (74 models)