Filter model performance by the number of turns in a conversation.
Filter the leaderboard to only show models that have an open license.
Last updated about 1 month ago
| Rank | Overall | Name | VIBE Score | Confidence Interval | Votes | Downvote % | Abort % | Speed | Latency | Context | Cost (Input) | Cost (Output) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 41 | 214 | 901 | ±3 | 15.8K | 3.0% | 2.4% | 231 tps | 10.5s | 200K | $1.10 | $4.40 | |
| 42 | 235 | 902 | ±5 | 6.6K | 2.2% | 2.8% | 36 tps | 0.7s | 128K | $2.08 | $9.45 | |
| 43 | 253 | 903 | ±13 | 785 | 4.3% | 4.7% | 21 tps | 1.9s | 128K | $10.00 | $30.00 | |
| 44 | 246 | 903 | ±5 | 5.2K | 2.1% | 1.2% | 140 tps | 0.6s | 64K | $2.00 | $6.00 | |
| 45 | 235 | 904 | ±2 | 11.7K | 2.1% | 2.6% | 40 tps | 1.6s | 33K | $0.14 | $0.14 | |
| 46 | 253 | 906 | ±3 | 6.9K | 1.8% | 1.4% | 44 tps | 1.4s | 8K | $0.80 | $0.80 | |
| 47 | 256 | 909 | ±4 | 7.1K | 3.0% | 0.6% | 176 tps | 1.0s | 33K | $0.06 | $0.10 | |
| 48 | 235 | 909 | ±2 | 12.6K | 1.9% | 1.3% | 138 tps | 0.7s | 131K | $0.02 | $0.04 | |
| 49 | 246 | 911 | ±2 | 9.8K | 1.1% | 11.6% | 11 tps | 2.5s | 66K | $0.77 | $0.77 | |
| 50 | 240 | 911 | ±6 | 4K | 1.8% | 1.0% | 55 tps | 1.5s | 8K | $0.20 | $2.00 | |
| 51 | 229 | 912 | ±4 | 7.4K | 1.1% | <0.1% | 33 tps | 3.1s | 4K | $0.19 | $0.19 | |
| 52 | 229 | 915 | ±8 | 1.3K | 2.9% | 1.9% | 61 tps | 1.0s | 8K | $0.07 | $0.09 | |
| 53 | 214 | 917 | ±5 | 4.7K | 1.7% | 1.4% | 54 tps | 1.5s | 131K | $2.00 | $5.00 | |
| 54 | 225 | 920 | ±3 | 9.8K | 2.0% | 5.8% | 54 tps | 0.6s | 128K | $0.30 | $0.99 | |
| 55 | 229 | 922 | ±3 | 7.7K | 2.5% | 1.4% | 177 tps | 0.4s | 128K | $0.14 | $0.14 | |
| 56 | 235 | 923 | ±4 | 5.3K | 2.2% | 2.2% | 142 tps | 0.6s | 33K | $0.23 | $0.23 | |
| 57 | 225 | 923 | ±3 | 10.7K | 1.6% | <0.1% | 22 tps | 0.6s | 16K | $3.00 | $4.00 | |
| 58 | 222 | Sky T1 32B Preview | 923 | ±3 | 11.2K | 1.6% | 7.8% | 73 tps | 0.6s | 16K | $0.12 | $0.18 |
| 59 | 214 | 925 | ±5 | 4.2K | 3.3% | 2.0% | 78 tps | 1.0s | 131K | $0.88 | $0.88 | |
| 60 | 240 | 926 | ±3 | 8K | 1.4% | <0.1% | 46 tps | 1.2s | 4K | $1.50 | $2.00 | |
| 61 | 240 | 927 | ±11 | 900 | 2.7% | 5.3% | 28 tps | 1.3s | 128K | $0.38 | $0.55 | |
| 62 | 240 | 928 | ±3 | 4.2K | 1.2% | <0.1% | 112 tps | 0.4s | 131K | $0.07 | $0.13 | |
| 63 | 229 | 928 | ±6 | 4K | 1.6% | 1.2% | 54 tps | 1.5s | 8K | $2.00 | $5.00 | |
| 64 | 246 | 929 | ±3 | 8.6K | 2.2% | 0.8% | 248 tps | 0.4s | 131K | $0.08 | $0.08 | |
| 65 | 229 | 930 | ±6 | 2.7K | 3.6% | 1.8% | 87 tps | 1.5s | 120K | $0.07 | $0.28 | |
| 66 | 214 | 930 | ±2 | 17.9K | 1.6% | 1.5% | 43 tps | 0.5s | 128K | $0.50 | $1.50 | |
| 67 | 201 | 932 | ±4 | 9K | 3.4% | 2.1% | 71 tps | 1.7s | 128K | $0.15 | $0.60 | |
| 68 | 186 | 933 | ±4 | 7.9K | 3.7% | 3.7% | 64 tps | 2.1s | 128K | $0.04 | $0.40 | |
| 69 | 214 | 934 | ±4 | 7.5K | 2.3% | 3.7% | 40 tps | 1.9s | 131K | $0.08 | $0.27 | |
| 70 | 201 | 938 | ±3 | 9.7K | 1.7% | 2.0% | 60 tps | 0.8s | 128K | $0.17 | $0.29 | |
| 71 | 209 | Llama 3.3 Swallow 70B Instruct | 938 | ±3 | 10.7K | 3.2% | 1.4% | 153 tps | 1.3s | 131K | $0.13 | $0.39 |
| 72 | 225 | 940 | ±3 | 14K | 1.9% | 1.1% | 76 tps | 0.4s | 128K | $0.04 | $0.15 | |
| 73 | 209 | 944 | ±3 | 9K | 2.3% | 2.4% | 40 tps | 1.6s | 1M | $0.40 | $1.61 | |
| 74 | 222 | 944 | ±3 | 13.6K | 1.6% | 1.7% | 48 tps | 0.9s | 256K | $1.50 | $6.00 | |
| 75 | 235 | 945 | ±2 | 8.8K | 1.0% | <0.1% | 76 tps | 1.0s | 131K | $0.08 | $0.09 | |
| 76 | 225 | 946 | ±3 | 7.3K | 2.1% | 1.5% | 171 tps | 0.5s | 131K | $0.15 | $0.15 | |
| 77 | 214 | 947 | ±2 | 11.8K | 0.6% | 12.5% | 33 tps | 2.1s | 128K | $1.00 | $1.00 | |
| 78 | 186 | 948 | ±2 | 28.5K | 3.9% | 1.2% | 43 tps | 0.5s | 131K | $0.30 | $0.50 | |
| 79 | 209 | 950 | ±2 | 6K | 1.0% | 1.3% | 74 tps | 0.9s | 16K | $0.75 | $1.75 | |
| 80 | 209 | 952 | ±19 | 495 | 3.9% | 5.8% | 64 tps | 0.7s | 256K | $0.10 | $0.15 |