Filter model performance by the number of turns in a conversation.
Filter the leaderboard to only show models that have an open license.
Last updated about 1 month ago
| Rank | Overall | Name | VIBE Score | Confidence Interval | Votes | Downvote % | Abort % | Speed | Latency | Context | Cost (Input) | Cost (Output) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 41 | 246 | 886 | ±5 | 7.5K | 1.4% | 0.8% | 248 tps | 0.4s | 131K | $0.08 | $0.08 | |
| 42 | 222 | 888 | ±8 | 2.4K | 4.5% | 0.6% | 103 tps | 0.3s | 33K | $0.15 | $0.15 | |
| 43 | 229 | 889 | ±5 | 4.4K | 3.3% | 4.0% | 58 tps | 0.9s | 131K | $2.00 | $5.00 | |
| 44 | 253 | 889 | ±4 | 6.5K | 1.2% | 1.4% | 44 tps | 1.4s | 8K | $0.80 | $0.80 | |
| 45 | 201 | 891 | ±4 | 7K | 1.9% | 2.1% | 71 tps | 1.7s | 128K | $0.15 | $0.60 | |
| 46 | 240 | 891 | ±11 | 1.6K | 3.0% | 3.5% | 31 tps | 0.9s | 131K | $0.52 | $1.73 | |
| 47 | 256 | 892 | ±5 | 6.1K | 1.9% | 0.6% | 176 tps | 1.0s | 33K | $0.06 | $0.10 | |
| 48 | 229 | 893 | ±3 | 6.8K | 1.1% | <0.1% | 33 tps | 3.1s | 4K | $0.19 | $0.19 | |
| 49 | 240 | 893 | ±6 | 3.8K | 0.9% | 1.0% | 55 tps | 1.5s | 8K | $0.20 | $2.00 | |
| 50 | 246 | 894 | ±3 | 9.3K | 0.8% | 11.6% | 11 tps | 2.5s | 66K | $0.77 | $0.77 | |
| 51 | 235 | 894 | ±3 | 10.8K | 1.2% | 2.6% | 40 tps | 1.6s | 33K | $0.14 | $0.14 | |
| 52 | 274 | 896 | ±10 | 1.4K | 2.1% | 6.2% | 22 tps | 1.8s | 131K | $0.37 | $0.39 | |
| 53 | 246 | 897 | ±10 | 1K | 1.9% | 1.1% | 67 tps | 0.6s | 131K | $0.12 | $0.39 | |
| 54 | 229 | 898 | ±8 | 3.8K | 1.0% | 1.2% | 54 tps | 1.5s | 8K | $2.00 | $5.00 | |
| 55 | 235 | 899 | ±6 | 4.7K | 1.3% | 2.2% | 142 tps | 0.6s | 33K | $0.23 | $0.23 | |
| 56 | 240 | 903 | ±3 | 7K | 0.6% | <0.1% | 46 tps | 1.2s | 4K | $1.50 | $2.00 | |
| 57 | 235 | 904 | ±5 | 6.3K | 1.3% | 2.8% | 36 tps | 0.7s | 128K | $2.08 | $9.45 | |
| 58 | 229 | 905 | ±4 | 6.8K | 1.4% | 1.4% | 177 tps | 0.4s | 128K | $0.14 | $0.14 | |
| 59 | 222 | Sky T1 32B Preview | 905 | ±4 | 10.5K | 1.1% | 7.8% | 73 tps | 0.6s | 16K | $0.12 | $0.18 |
| 60 | 235 | 908 | ±3 | 8.3K | 0.7% | <0.1% | 76 tps | 1.0s | 131K | $0.08 | $0.09 | |
| 61 | 194 | INTELLECT-3 | 909 | ±14 | 570 | 2.6% | 1.5% | 114 tps | 0.6s | 131K | $0.20 | $1.10 |
| 62 | 235 | 909 | ±4 | 11.3K | 1.0% | 1.3% | 138 tps | 0.7s | 131K | $0.02 | $0.04 | |
| 63 | 240 | 910 | ±5 | 3.8K | 0.5% | <0.1% | 112 tps | 0.4s | 131K | $0.07 | $0.13 | |
| 64 | 225 | 910 | ±5 | 6.7K | 1.1% | 1.5% | 171 tps | 0.5s | 131K | $0.15 | $0.15 | |
| 65 | 225 | 913 | ±3 | 9.6K | 1.5% | 5.8% | 54 tps | 0.6s | 128K | $0.30 | $0.99 | |
| 66 | 201 | 917 | ±4 | 5.1K | 1.3% | 2.4% | 180 tps | 0.6s | 131K | $0.10 | $0.30 | |
| 67 | 214 | 919 | ±8 | 3.7K | 1.5% | 2.0% | 78 tps | 1.0s | 131K | $0.88 | $0.88 | |
| 68 | 214 | 920 | ±4 | 9K | 1.3% | 4.2% | 73 tps | 0.8s | 131K | $0.05 | $0.12 | |
| 69 | 177 | 921 | ±3 | 19.4K | 2.3% | 0.8% | 143 tps | 3.3s | 200K | $1.10 | $4.40 | |
| 70 | 209 | 921 | ±5 | 2.5K | 1.9% | 2.5% | 108 tps | 1.6s | 256K | $0.07 | $0.30 | |
| 71 | 214 | 922 | ±5 | 4.4K | 0.9% | 1.4% | 54 tps | 1.5s | 131K | $2.00 | $5.00 | |
| 72 | 222 | 924 | ±4 | 13K | 1.0% | 1.7% | 48 tps | 0.9s | 256K | $1.50 | $6.00 | |
| 73 | 214 | 924 | ±4 | 6.9K | 1.4% | 3.7% | 40 tps | 1.9s | 131K | $0.08 | $0.27 | |
| 74 | 175 | 925 | ±3 | 18K | 2.6% | 0.7% | 139 tps | 1.5s | 200K | $1.10 | $4.40 | |
| 75 | 201 | 927 | ±3 | 8.9K | 0.9% | 2.0% | 60 tps | 0.8s | 128K | $0.17 | $0.29 | |
| 76 | 186 | 927 | ±2 | 21.1K | 1.5% | 1.6% | 44 tps | 0.5s | 131K | $0.60 | $4.00 | |
| 77 | 209 | 928 | ±4 | 5.8K | 0.6% | 1.3% | 74 tps | 0.9s | 16K | $0.75 | $1.75 | |
| 78 | 209 | Llama 3.3 Swallow 70B Instruct | 928 | ±3 | 9.7K | 1.1% | 1.4% | 153 tps | 1.3s | 131K | $0.13 | $0.39 |
| 79 | 225 | 928 | ±3 | 12.7K | 1.1% | 1.1% | 76 tps | 0.4s | 128K | $0.04 | $0.15 | |
| 80 | 225 | 929 | ±3 | 9.2K | 0.6% | <0.1% | 22 tps | 0.6s | 16K | $3.00 | $4.00 |