Filter model performance by the number of turns in a conversation.
Filter the leaderboard to only show models that have an open license.
Last updated about 1 month ago
| Rank | Overall | Name | VIBE Score | Confidence Interval | Votes | Downvote % | Abort % | Speed | Latency | Context | Cost (Input) | Cost (Output) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 161 | 194 | INTELLECT-3 | 999 | ±10 | 895 | 2.7% | 1.5% | 114 tps | 0.6s | 131K | $0.20 | $1.10 |
| 162 | 157 | 998 | ±3 | 16.7K | 5.2% | 0.6% | 175 tps | 1.3s | 256K | $0.21 | $2.26 | |
| 163 | 186 | 998 | ±5 | 2.8K | 4.9% | 1.3% | 58 tps | 1.0s | 256K | $1.33 | $5.33 | |
| 164 | 165 | 998 | ±3 | 12.9K | 6.5% | 1.9% | 94 tps | 1.5s | 128K | $0.01 | $0.01 | |
| 165 | 170 | 997 | ±3 | 11.7K | 2.9% | 1.5% | 77 tps | 0.6s | 131K | $0.40 | $2.00 | |
| 166 | 160 | 997 | ±2 | 66.9K | 2.4% | 0.6% | 88 tps | 5.1s | 131K | $0.18 | $0.46 | |
| 167 | 161 | 996 | ±4 | 9.9K | 5.5% | 2.4% | 61 tps | 1.4s | 41K | $0.02 | $0.07 | |
| 168 | 186 | 995 | ±9 | 2.1K | 5.0% | 1.9% | 113 tps | 1.1s | 131K | $0.02 | $0.08 | |
| 169 | 170 | 994 | ±3 | 15.2K | 2.5% | 2.8% | 141 tps | 0.7s | 33K | $0.02 | $0.08 | |
| 170 | 165 | 994 | ±4 | 9.9K | 2.6% | 2.5% | 57 tps | 1.3s | 128K | $1.50 | $4.50 | |
| 171 | 194 | 993 | ±9 | 1.9K | 1.3% | 4.5% | 21 tps | 1.7s | 8K | $1.08 | $1.38 | |
| 172 | 201 | 992 | ±7 | 2.6K | 2.8% | 0.5% | 125 tps | 0.4s | 131K | $0.30 | $0.30 | |
| 173 | 148 | 988 | ±2 | 33.5K | 4.5% | 1.9% | 117 tps | 15.9s | 200K | $1.10 | $4.40 | |
| 174 | 133 | 988 | ±3 | 16.2K | 3.9% | 4.0% | 30 tps | 1.4s | 262K | $0.63 | $2.39 | |
| 175 | 148 | 987 | ±3 | 12K | 2.6% | 0.9% | 85 tps | 6.8s | 128K | $7.33 | $29.33 | |
| 176 | 179 | Baichuan-M2-32B | 983 | ±7 | 1.9K | 5.9% | <0.1% | 32 tps | 3.3s | 131K | $0.07 | $0.07 |
| 177 | 186 | 983 | ±6 | 3.5K | 3.7% | 1.8% | 35 tps | 1.1s | 66K | $0.06 | $0.10 | |
| 178 | 153 | 982 | ±4 | 18.6K | 2.5% | 4.2% | 92 tps | 5.5s | 200K | $15.00 | $60.00 | |
| 179 | 177 | 982 | ±2 | 11.2K | 1.8% | 7.5% | 15 tps | 2.4s | 131K | $0.06 | $0.18 | |
| 180 | 179 | 982 | ±2 | 24.5K | 1.6% | 0.9% | 96 tps | 0.7s | 300K | $0.80 | $1.70 | |
| 181 | 179 | 979 | ±2 | 28K | 1.8% | 0.4% | 257 tps | 1.1s | 32K | $0.25 | $1.00 | |
| 182 | 186 | 977 | ±2 | 15.8K | 1.3% | 2.0% | 59 tps | 1.2s | 256K | $1.33 | $5.33 | |
| 183 | 194 | 976 | ±3 | 10.8K | 4.1% | 0.3% | 500 tps | 0.5s | 8K | $0.48 | $0.66 | |
| 184 | 179 | 976 | ±14 | 925 | 2.6% | 6.3% | 30 tps | 0.8s | 128K | $0.17 | $0.22 | |
| 185 | 186 | 976 | ±2 | 25.5K | 1.8% | 2.0% | 30 tps | 0.5s | 8K | $0.01 | $0.02 | |
| 186 | 157 | 974 | ±3 | 10.1K | 6.0% | 3.2% | 113 tps | 20.9s | 400K | $0.05 | $0.40 | |
| 187 | 194 | 972 | ±4 | 7.7K | 1.5% | 2.6% | 77 tps | 0.6s | 33K | $0.07 | $0.14 | |
| 188 | 179 | 972 | ±4 | 5.6K | 2.1% | 1.2% | 96 tps | 1.2s | 131K | $0.14 | $0.26 | |
| 189 | 175 | 971 | ±13 | 900 | 4.3% | 7.2% | 24 tps | 1.9s | 262K | $0.07 | $0.23 | |
| 190 | 194 | 967 | ±2 | 9.6K | 1.9% | 1.5% | 152 tps | 0.5s | 8K | $0.16 | $0.16 | |
| 191 | 165 | DeepSeek R1T2 Chimera | 967 | ±4 | 5.9K | 3.3% | 3.0% | 28 tps | 1.8s | 164K | $0.13 | $0.45 |
| 192 | 175 | 966 | ±2 | 30.5K | 4.6% | 0.7% | 139 tps | 1.5s | 200K | $1.10 | $4.40 | |
| 193 | 194 | 966 | ±3 | 17.5K | 1.5% | 1.6% | 156 tps | 0.5s | 40K | $0.37 | $1.10 | |
| 194 | 177 | 962 | ±2 | 33.6K | 4.2% | 0.8% | 143 tps | 3.3s | 200K | $1.10 | $4.40 | |
| 195 | 201 | 961 | ±10 | 1.5K | 5.7% | 4.9% | 36 tps | 3.5s | 123K | $0.42 | $1.25 | |
| 196 | 161 | 961 | ±6 | 3.3K | 1.8% | 5.2% | 14 tps | 1.3s | 164K | $0.40 | $1.56 | |
| 197 | 201 | 960 | ±2 | 13.1K | 1.8% | 6.0% | 85 tps | 0.7s | 8K | $0.12 | $0.16 | |
| 198 | 209 | 960 | ±5 | 3.6K | 3.1% | 2.5% | 108 tps | 1.6s | 256K | $0.07 | $0.30 | |
| 199 | 194 | 960 | ±16 | 1.4K | 4.8% | 12.2% | 15 tps | 2.2s | 131K | $0 | $0 | |
| 200 | 186 | 958 | ±2 | 26.4K | 4.4% | 1.6% | 44 tps | 0.5s | 131K | $0.60 | $4.00 |