Filter model performance by the number of turns in a conversation.
Filter the leaderboard to only show models that have an open license.
Last updated about 1 month ago
| Rank | Overall | Name | VIBE Score | Confidence Interval | Votes | Downvote % | Abort % | Speed | Latency | Context | Cost (Input) | Cost (Output) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 121 | 170 | 793 | ±28 | 475 | 5.9% | 2.8% | 141 tps | 0.7s | 33K | $0.02 | $0.08 | |
| 122 | 148 | 787 | ±14 | 655 | 4.4% | 0.5% | 124 tps | 1.2s | 131K | $0.16 | $1.70 | |
| 123 | 160 | 782 | ±15 | 1.6K | 4.1% | 0.6% | 88 tps | 5.1s | 131K | $0.18 | $0.46 | |
| 124 | 214 | 778 | ±17 | 630 | 3.1% | 2.4% | 231 tps | 10.5s | 200K | $1.10 | $4.40 | |
| 125 | 177 | 777 | ±12 | 2K | 3.6% | 0.8% | 143 tps | 3.3s | 200K | $1.10 | $4.40 | |
| 126 | 229 | 765 | ±18 | 610 | 3.9% | 4.0% | 58 tps | 0.9s | 131K | $2.00 | $5.00 | |
| 127 | 175 | 752 | ±17 | 1.4K | 4.3% | 0.7% | 139 tps | 1.5s | 200K | $1.10 | $4.40 | |
| 128 | 186 | 724 | ±15 | 1.3K | 4.3% | 1.6% | 44 tps | 0.5s | 131K | $0.60 | $4.00 | |
| 129 | 194 | 719 | ±18 | 550 | 3.5% | 0.3% | 500 tps | 0.5s | 8K | $0.48 | $0.66 | |
| 130 | 274 | 674 | ±33 | 720 | 5.9% | 2.2% | 101 tps | 1.2s | 131K | $0.08 | $0.08 | |
| 131 | 265 | 635 | ±34 | 505 | 5.6% | 5.3% | 25 tps | 3.7s | 128K | $1.01 | $2.79 | |
| 132 | 179 | 613 | ±25 | 500 | 6.5% | 0.4% | 257 tps | 1.1s | 32K | $0.25 | $1.00 | |
| 133 | 288 | 373 | ±46 | 955 | 7.7% | 3.0% | 44 tps | 2.5s | 128K | $0.21 | $0.63 |