Filter model performance by the number of turns in a conversation.
Filter the leaderboard to only show models that have an open license.
Last updated about 1 month ago
| Rank | Overall | Name | VIBE Score | Confidence Interval | Votes | Downvote % | Abort % | Speed | Latency | Context | Cost (Input) | Cost (Output) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 81 | 10 | 1256 | ±15 | 1.3K | 1.8% | 1.7% | 52 tps | 2.0s | 400K | $1.75 | $14.00 | |
| 82 | 4 | 1266 | ±27 | 650 | 1.5% | 1.6% | 47 tps | 1.2s | 200K | $3.00 | $15.00 | |
| 83 | 7 | 1280 | ±14 | 2.8K | 1.4% | 1.8% | 49 tps | 1.4s | 200K | $5.00 | $25.00 | |
| 84 | 2 | 1424 | ±16 | 950 | 1.0% | 2.1% | 48 tps | 1.7s | 200K | $5.00 | $25.00 | |
| 85 | 1 | 1524 | ±16 | 980 | 1.0% | 2.5% | 56 tps | 1.6s | 200K | $5.00 | $25.00 |