Filter model performance by the number of turns in a conversation.
Filter the leaderboard to only show models that have an open license.
Last updated about 1 month ago
| Rank | Name | VIBE Score | Confidence Interval | Votes | Downvote % | Abort % | Speed | Latency | Context | Cost (Input) | Cost (Output) |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 281 | 599 | ±21 | 1K | 7.1% | 7.4% | 40 tps | 1.1s | 128K | $0.07 | $0.30 | |
| 282 | 588 | ±22 | 1.6K | 9.2% | 2.3% | 67 tps | 2.0s | 33K | $0.01 | $0.01 | |
| 283 | 573 | ±17 | 2.1K | 5.5% | 21.0% | 29 tps | 1.0s | 33K | $0.06 | $0.25 | |
| 284 | 523 | ±25 | 4.1K | 6.1% | 3.0% | 44 tps | 2.5s | 128K | $0.21 | $0.63 | |
| 285 | CodeLlama 7B Instruct Solidity | 463 | ±54 | 485 | 8.5% | 3.6% | 33 tps | 0.7s | 16K | $0.80 | $1.20 |
| 286 | 447 | ±15 | 3.4K | 12.0% | 9.7% | 30 tps | 0.9s | 128K | $0.07 | $0.30 |