Filter model performance by the number of turns in a conversation.
Filter the leaderboard to only show models that have an open license.
Last updated about 1 month ago
| Rank | Overall | Name | VIBE Score | Confidence Interval | Votes | Downvote % | Abort % | Speed | Latency | Context | Cost (Input) | Cost (Output) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 22 | 1297 | ±83 | 600 | 1.6% | 1.3% | 95 tps | 0.9s | 400K | $1.25 | $10.00 | |
| 2 | 10 | 1066 | ±219 | 740 | 0.7% | 2.1% | 50 tps | 3.6s | 1M | $2.00 | $12.00 | |
| 3 | 44 | 944 | ±101 | 650 | 1.5% | 2.3% | 45 tps | 2.6s | 1M | $1.25 | $10.00 | |
| 4 | 86 | 900 | ±146 | 730 | 1.4% | 1.8% | 49 tps | 1.3s | 200K | $3.00 | $15.00 | |
| 5 | 68 | 868 | ±135 | 770 | 1.9% | 3.9% | 29 tps | 11.1s | 256K | $3.00 | $15.00 | |
| 6 | 95 | 712 | ±181 | 640 | 0.8% | 1.3% | 2 tps | 3.7s | 1M | $0.30 | $2.50 | |
| 7 | 113 | 700 | ±161 | 660 | 0.8% | 0.8% | 365 tps | 0.5s | 131K | $1.00 | $3.00 |