Filter model performance by the number of turns in a conversation.
Filter the leaderboard to only show models that have an open license.
Last updated about 1 month ago
| Rank | Overall | Name | VIBE Score | Confidence Interval | Votes | Downvote % | Abort % | Speed | Latency | Context | Cost (Input) | Cost (Output) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 159 | 981 | ±6 | 7K | 4.5% | 1.6% | 29 tps | 1.3s | 131K | $0.72 | $2.60 | |
| 2 | 128 | 1042 | ±10 | 3.3K | 5.1% | 4.2% | 61 tps | 5.9s | 262K | $0.24 | $1.03 | |
| 3 | 112 | 1070 | ±6 | 7.5K | 9.1% | 0.7% | 373 tps | 0.5s | 262K | $1.70 | $6.50 | |
| 4 | 112 | 1073 | ±5 | 35K | 6.4% | 0.8% | 365 tps | 0.5s | 131K | $1.00 | $3.00 | |
| 5 | 112 | 1074 | ±7 | 8.7K | 4.3% | 4.0% | 30 tps | 1.4s | 262K | $0.63 | $2.39 | |
| 6 | 49 | 1192 | ±6 | 20.3K | 3.4% | 2.0% | 75 tps | 1.4s | 262K | $1.15 | $8.00 | |
| 7 | 36 | 1210 | ±8 | 1.8K | 3.2% | 2.9% | 32 tps | 3.0s | 262K | $0.50 | $3.00 | |
| 8 | 19 | 1291 | ±11 | 16.5K | 3.4% | 6.5% | 33 tps | 1.7s | 262K | $0.34 | $2.57 |