Last updated about 1 month ago
| Rank | Overall | Name | VIBE Score | Confidence Interval | Votes | Downvote % | Abort % | Speed | Latency | Context | Cost (Input) | Cost (Output) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 404 | 329 | ±41 | 700 | 4.1% | <0.1% | 25 tps | N/A | 32K | $0.99 | $0.99 | |
| 2 | 402 | 406 | ±35 | 1K | 10.9% | <0.1% | 34 tps | 3.3s | 33K | $0.02 | $0.07 | |
| 3 | 399 | Mistral Nemo 12B Inferor v0.0 | 454 | ±28 | 565 | 1.7% | <0.1% | 83 tps | 0.8s | 16K | $0.80 | $1.20 |
| 4 | 279 | 588 | ±22 | 1.6K | 9.2% | 2.3% | 67 tps | 2.0s | 33K | $0.01 | $0.01 | |
| 5 | 390 | 595 | ±19 | 1.2K | 5.6% | <0.1% | 17 tps | N/A | 32K | $0.04 | $0.04 | |
| 6 | 390 | 602 | ±40 | 685 | 11.0% | <0.1% | 85 tps | 2.2s | 120K | $0 | $0 | |
| 7 | 279 | UI-TARS 1.5 7B | 610 | ±40 | 530 | 11.7% | 4.0% | 75 tps | 0.9s | 128K | $0.10 | $0.20 |
| 8 | 386 | Shisa V2 Llama 3.3 70B | 623 | ±24 | 585 | 9.3% | <0.1% | 8 tps | 2.0s | 33K | $0.03 | $0.09 |
| 9 | 386 | 652 | ±19 | 1.1K | 2.6% | <0.1% | 20 tps | 1.5s | 16K | $0.90 | $0.90 | |
| 10 | 386 | 655 | ±13 | 1.2K | 3.5% | <0.1% | 31 tps | 0.4s | 32K | $0.49 | $0.49 | |
| 11 | 276 | 686 | ±13 | 3.8K | 5.3% | <0.1% | 31 tps | 2.8s | 1M | $0.55 | $2.20 | |
| 12 | 383 | ArliAI QwQ 32B Arliai RpR V1 | 686 | ±40 | 635 | 9.3% | <0.1% | 34 tps | 1.8s | 33K | $0.02 | $0.07 |
| 13 | 276 | 696 | ±20 | 2K | 5.5% | 6.2% | 22 tps | 1.8s | 131K | $0.37 | $0.39 | |
| 14 | 374 | 701 | ±16 | 890 | 7.8% | <0.1% | 13 tps | 0.1s | 33K | $0.03 | $0.09 | |
| 15 | 269 | 706 | ±30 | 715 | 5.9% | 2.5% | 50 tps | 1.0s | 33K | $0.06 | $0.25 | |
| 16 | 269 | 719 | ±18 | 1.5K | 4.1% | 1.1% | 33 tps | 3.4s | 8K | $2.50 | $10.00 | |
| 17 | 374 | 720 | ±19 | 530 | 6.2% | <0.1% | 13 tps | 0.6s | 33K | $0 | $0 | |
| 18 | 374 | Mistral Nemo 12B Celeste V1.9 | 725 | ±18 | 1.1K | 3.5% | <0.1% | 6 tps | 10.2s | 8K | $0.80 | $1.20 |
| 19 | 361 | 730 | ±28 | 850 | 9.6% | <0.1% | 36 tps | 1.8s | 131K | $0 | $0 | |
| 20 | 361 | 734 | ±39 | 965 | 9.8% | <0.1% | 92 tps | 1.2s | 131K | $0 | $0 | |
| 21 | 269 | 737 | ±24 | 1.5K | 5.0% | 0.6% | 50 tps | 3.2s | 8K | $2.50 | $10.00 | |
| 22 | 262 | 746 | ±20 | 2.1K | 6.0% | 5.3% | 25 tps | 3.7s | 128K | $1.01 | $2.79 | |
| 23 | 361 | 751 | ±22 | 605 | 2.4% | <0.1% | 35 tps | N/A | 32K | $0.99 | $0.99 | |
| 24 | 361 | 756 | ±16 | 1.9K | 6.3% | <0.1% | 44 tps | 1.7s | 64K | $0.63 | $0.63 | |
| 25 | 262 | 762 | ±18 | 1.3K | 4.7% | 0.7% | 176 tps | 0.4s | 33K | $0.25 | $0.25 | |
| 26 | 354 | 763 | ±21 | 770 | 7.8% | 4.2% | 77 tps | 0.4s | 66K | $0.12 | $0.20 | |
| 27 | 262 | Baichuan-M2-32B | 770 | ±30 | 740 | 10.8% | <0.1% | 32 tps | 3.3s | 131K | $0.07 | $0.07 |
| 28 | 252 | 781 | ±29 | 460 | 8.9% | 1.1% | 67 tps | 0.6s | 131K | $0.12 | $0.39 | |
| 29 | 252 | 785 | ±16 | 1.1K | 5.8% | 1.5% | 54 tps | 0.7s | 33K | $2.00 | $6.00 | |
| 30 | 252 | 787 | ±9 | 2K | 2.7% | <0.1% | 46 tps | 1.2s | 4K | $1.50 | $2.00 | |
| 31 | 252 | 793 | ±27 | 510 | 3.8% | <0.1% | 247 tps | 2.2s | 32K | $0.25 | $1.00 | |
| 32 | 346 | 795 | ±25 | 665 | 14.2% | <0.1% | 86 tps | 0.7s | 41K | $2.00 | $5.00 | |
| 33 | 252 | 797 | ±21 | 815 | 8.4% | 3.5% | 31 tps | 0.9s | 131K | $0.52 | $1.73 | |
| 34 | 252 | 801 | ±12 | 1.9K | 3.1% | 11.6% | 11 tps | 2.5s | 66K | $0.77 | $0.77 | |
| 35 | 252 | 802 | ±18 | 1.8K | 7.5% | 2.7% | 116 tps | 0.6s | 131K | $0.50 | $1.50 | |
| 36 | 346 | 804 | ±10 | 2.2K | 5.7% | <0.1% | 67 tps | 0.8s | 41K | $2.00 | $5.00 | |
| 37 | 337 | 805 | ±19 | 1K | 3.3% | <0.1% | 3 tps | N/A | 33K | $0.90 | $0.90 | |
| 38 | 337 | 813 | ±16 | 1.1K | 4.2% | <0.1% | 69 tps | 1.3s | 66K | $0.04 | $0.14 | |
| 39 | 240 | 818 | ±18 | 825 | 11.3% | <0.1% | 142 tps | 0.3s | 33K | $0.01 | $0.02 | |
| 40 | 240 | 820 | ±17 | 950 | 3.1% | 1.4% | 53 tps | 1.4s | 33K | $1.00 | $3.00 |