Filter model performance by the number of turns in a conversation.
Filter the leaderboard to only show models that have an open license.
Last updated about 1 month ago
| Rank | Overall | Name | VIBE Score | Confidence Interval | Votes | Downvote % | Abort % | Speed | Latency | Context | Cost (Input) | Cost (Output) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 81 | 157 | 767 | ±23 | 810 | 2.4% | 0.6% | 175 tps | 1.3s | 256K | $0.21 | $2.26 | |
| 82 | 160 | 722 | ±33 | 700 | 2.1% | 0.6% | 88 tps | 5.1s | 131K | $0.18 | $0.46 | |
| 83 | 161 | 719 | ±27 | 1K | 2.9% | 1.2% | 88 tps | 2.4s | 1M | $0.23 | $0.83 | |
| 84 | 177 | 676 | ±27 | 690 | 1.4% | 0.8% | 143 tps | 3.3s | 200K | $1.10 | $4.40 | |
| 85 | 175 | 659 | ±21 | 505 | 2.9% | 0.7% | 139 tps | 1.5s | 200K | $1.10 | $4.40 |