Filter model performance by the number of turns in a conversation.
Filter the leaderboard to only show models that have an open license.
Last updated about 1 month ago
| Rank | Overall | Name | VIBE Score | Confidence Interval | Votes | Downvote % | Abort % | Speed | Latency | Context | Cost (Input) | Cost (Output) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 241 | 229 | 889 | ±5 | 4.4K | 3.3% | 4.0% | 58 tps | 0.9s | 131K | $2.00 | $5.00 | |
| 242 | 222 | 888 | ±8 | 2.4K | 4.5% | 0.6% | 103 tps | 0.3s | 33K | $0.15 | $0.15 | |
| 243 | 246 | 886 | ±5 | 7.5K | 1.4% | 0.8% | 248 tps | 0.4s | 131K | $0.08 | $0.08 | |
| 244 | 246 | 886 | ±6 | 4.7K | 1.1% | 1.2% | 140 tps | 0.6s | 64K | $2.00 | $6.00 | |
| 245 | 265 | 876 | ±9 | 1.3K | 3.4% | 2.8% | 339 tps | 0.6s | 131K | $0.10 | $0.10 | |
| 246 | 240 | 872 | ±4 | 3.8K | 0.8% | 1.4% | 53 tps | 1.4s | 33K | $1.00 | $3.00 | |
| 247 | 246 | 871 | ±4 | 5.4K | 1.6% | 1.8% | 142 tps | 0.7s | 66K | $0.45 | $0.45 | |
| 248 | 229 | 868 | ±9 | 1.2K | 2.4% | 1.9% | 61 tps | 1.0s | 8K | $0.07 | $0.09 | |
| 249 | 256 | 867 | ±3 | 7.7K | 1.2% | 5.1% | 28 tps | 1.3s | 128K | $0.10 | $0.32 | |
| 250 | 260 | 859 | ±6 | 4.4K | 1.6% | 1.7% | 142 tps | 0.6s | 32K | $0.43 | $1.30 | |
| 251 | 265 | 858 | ±4 | 6.5K | 0.8% | 0.6% | 50 tps | 3.2s | 8K | $2.50 | $10.00 | |
| 252 | 256 | 856 | ±8 | 3.2K | 2.0% | 1.8% | 90 tps | 1.7s | 33K | $0.15 | $0.15 | |
| 253 | 260 | 849 | ±6 | 4.8K | 1.2% | 0.7% | 176 tps | 0.4s | 33K | $0.25 | $0.25 | |
| 254 | 265 | 849 | ±5 | 5K | 1.5% | 1.3% | 54 tps | 0.4s | 33K | $0.60 | $0.60 | |
| 255 | 214 | 847 | ±16 | 840 | 4.0% | 6.3% | 43 tps | 3.2s | 128K | $0.35 | $0.62 | |
| 256 | 256 | 844 | ±5 | 5.7K | 1.3% | 0.2% | 79 tps | 0.7s | 33K | $0.23 | $0.31 | |
| 257 | 284 | 843 | ±5 | 2.7K | 1.6% | <0.1% | 31 tps | 2.8s | 1M | $0.55 | $2.20 | |
| 258 | 260 | Apriel 1.6 15B Thinker | 840 | ±12 | 855 | 2.3% | 2.6% | 92 tps | 0.4s | 131K | $0 | $0 |
| 259 | 214 | 839 | ±6 | 3.6K | 1.6% | 2.4% | 231 tps | 10.5s | 200K | $1.10 | $4.40 | |
| 260 | 240 | 839 | ±18 | 675 | 2.2% | 5.3% | 28 tps | 1.3s | 128K | $0.38 | $0.55 | |
| 261 | 271 | 837 | ±4 | 6.8K | 0.9% | 1.1% | 33 tps | 3.4s | 8K | $2.50 | $10.00 | |
| 262 | 260 | 826 | ±4 | 5K | 3.5% | 3.6% | 32 tps | 0.8s | 131K | $1.00 | $3.00 | |
| 263 | 274 | 824 | ±9 | 995 | 3.4% | 2.5% | 50 tps | 1.0s | 33K | $0.06 | $0.25 | |
| 264 | 271 | 818 | ±6 | 4.3K | 1.6% | 1.5% | 54 tps | 0.7s | 33K | $2.00 | $6.00 | |
| 265 | 265 | 816 | ±8 | 1.7K | 5.2% | 6.7% | 184 tps | 0.4s | 33K | $0.01 | $0.02 | |
| 266 | 271 | 814 | ±4 | 5.5K | 1.2% | 2.3% | 20 tps | 1.1s | 131K | $0.80 | $0.80 | |
| 267 | 265 | 808 | ±12 | 1.3K | 5.5% | 5.3% | 25 tps | 3.7s | 128K | $1.01 | $2.79 | |
| 268 | 281 | 796 | ±4 | 9K | 1.6% | 1.2% | 22 tps | 1.1s | 4K | $0.18 | $0.18 | |
| 269 | 265 | 786 | ±11 | 1.7K | 3.7% | 2.7% | 116 tps | 0.6s | 131K | $0.50 | $1.50 | |
| 270 | 281 | 779 | ±7 | 1.3K | 3.6% | <0.1% | 100 tps | 0.4s | 8K | $0.09 | $0.09 | |
| 271 | 274 | 778 | ±15 | 885 | 2.7% | 0.9% | 61 tps | 0.4s | 8K | $0.50 | $1.50 | |
| 272 | 285 | 777 | ±5 | 4.1K | 2.4% | 2.3% | 67 tps | 2.0s | 33K | $0.01 | $0.01 | |
| 273 | 285 | 774 | ±5 | 3.7K | 2.0% | 7.4% | 40 tps | 1.1s | 128K | $0.07 | $0.30 | |
| 274 | 274 | 767 | ±11 | 880 | 1.7% | <0.1% | 108 tps | 0.7s | 205K | $0.30 | $1.20 | |
| 275 | 274 | 757 | ±9 | 1.9K | 4.8% | <0.1% | 142 tps | 0.3s | 33K | $0.01 | $0.02 | |
| 276 | 274 | 748 | ±15 | 2.3K | 5.1% | 2.2% | 101 tps | 1.2s | 131K | $0.08 | $0.08 | |
| 277 | 281 | 741 | ±9 | 2.6K | 2.1% | 2.7% | 21 tps | 2.2s | 6K | $6.56 | $9.38 | |
| 278 | 287 | 707 | ±10 | 1K | 2.8% | 21.0% | 29 tps | 1.0s | 33K | $0.06 | $0.25 | |
| 279 | 274 | 649 | ±31 | 515 | 6.4% | 3.1% | 44 tps | 3.8s | 131K | $2.00 | $5.00 | |
| 280 | 289 | UI-TARS 1.5 7B | 646 | ±17 | 915 | 4.2% | 4.0% | 75 tps | 0.9s | 128K | $0.10 | $0.20 |