Filter model performance by the number of turns in a conversation.
Filter the leaderboard to only show models that have an open license.
Last updated about 1 month ago
| Rank | Overall | Name | VIBE Score | Confidence Interval | Votes | Downvote % | Abort % | Speed | Latency | Context | Cost (Input) | Cost (Output) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 31 | 1239 | ±6 | 9.4K | 5.4% | 0.9% | 101 tps | 0.5s | 2M | $0.20 | $0.50 | |
| 2 | 49 | 1185 | ±5 | 8.1K | 7.1% | 1.5% | 93 tps | 0.6s | 2M | $0.27 | $0.67 | |
| 3 | 60 | 1178 | ±7 | 39.5K | 4.4% | 1.5% | 58 tps | 7.3s | 2M | $0.20 | $0.50 | |
| 4 | 60 | 1177 | ±3 | 14.5K | 5.0% | 2.1% | 102 tps | 3.1s | 2M | $0.30 | $0.75 | |
| 5 | 60 | 1167 | ±22 | 1.2K | 4.1% | 1.1% | 77 tps | 4.5s | 2M | $2.00 | $5.50 | |
| 6 | 77 | 1129 | ±19 | 945 | 3.6% | 1.2% | 56 tps | 8.8s | 2M | $2.00 | $6.00 | |
| 7 | 77 | 1125 | ±3 | 39.6K | 4.4% | 3.9% | 29 tps | 11.1s | 256K | $3.00 | $15.00 | |
| 8 | 90 | 1102 | ±14 | 2.5K | 4.7% | 1.7% | 52 tps | 2.4s | 131K | $5.00 | $25.00 | |
| 9 | 98 | 1098 | ±4 | 19.1K | 5.5% | 1.5% | 53 tps | 0.6s | 1M | $3.67 | $18.33 | |
| 10 | 112 | 1063 | ±36 | 500 | 4.8% | 1.1% | 151 tps | 0.6s | 2M | $2.00 | $6.00 | |
| 11 | 159 | 987 | ±9 | 2.5K | 6.0% | 5.9% | 294 tps | 0.5s | 256K | $0.20 | $1.50 | |
| 12 | 179 | 943 | ±7 | 9K | 7.0% | 1.6% | 44 tps | 0.5s | 131K | $0.60 | $4.00 | |
| 13 | 189 | 927 | ±6 | 9.9K | 6.4% | 1.2% | 43 tps | 0.5s | 131K | $0.30 | $0.50 |