Spaces:
Running
Running
Update organization README
Browse files
README.md
CHANGED
|
@@ -1,5 +1,4 @@
|
|
| 1 |
-
|
| 2 |
-
|
| 3 |
|
| 4 |
```
|
| 5 |
βββββββββββββββββββ Energy-efficient inference engine for running AI on mobile devices
|
|
@@ -95,21 +94,21 @@ graph.hard_reset();
|
|
| 95 |
|
| 96 |
## Benchmark (INT8)
|
| 97 |
|
| 98 |
-
| Device | LFM2.5-1.2B<br>(1k-Prefill/100-Decode) | LFM2.5-VL-1.6B<br>(256px-Latency & Decode) | Whisper-Small<br>(30s-audio-Latency & Decode)
|
| 99 |
|--------|--------|--------|----------|
|
| 100 |
-
| Mac M4 Pro |
|
| 101 |
| iPad/Mac M4 | - | - | - |
|
| 102 |
-
| iPhone 17 Pro |
|
| 103 |
-
| Galaxy S25 Ultra |
|
| 104 |
| Pixel 10 Pro | - | - | - |
|
| 105 |
| Vivo X200 Pro | - | - | - |
|
| 106 |
|
| 107 |
-
| Device | LFM2-350m<br>(1k-Prefill/100-Decode) | LFM2-VL-450m<br>(256px-Latency & Decode) | Moonshine-Base<br>(30s-audio-Latency & Decode)
|
| 108 |
|--------|--------|--------|----------|
|
| 109 |
| iPad/Mac M1 | - | - | - |
|
| 110 |
| iPhone 13 Mini | - | - | - |
|
| 111 |
| Galaxy A56 | - | - | - |
|
| 112 |
-
| Pixel 6a |
|
| 113 |
| Nothing CMF | - | - | - |
|
| 114 |
| Raspberry Pi 5 | - | - | - |
|
| 115 |
|
|
@@ -184,5 +183,15 @@ git clone https://github.com/cactus-compute/cactus && cd cactus && source ./setu
|
|
| 184 |
8. [University of Michigan Ann-Arbor MSAIL](https://msail.github.io/)
|
| 185 |
9. [University of Colorado Boulder's AI Club](https://www.cuaiclub.org/)
|
| 186 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 187 |
## Join The Community
|
| 188 |
- [Reddit Channel](https://www.reddit.com/r/cactuscompute/)
|
|
|
|
| 1 |
+
<img src="assets/banner.jpg" alt="Logo" style="border-radius: 30px; width: 100%;">
|
|
|
|
| 2 |
|
| 3 |
```
|
| 4 |
βββββββββββββββββββ Energy-efficient inference engine for running AI on mobile devices
|
|
|
|
| 94 |
|
| 95 |
## Benchmark (INT8)
|
| 96 |
|
| 97 |
+
| Device | LFM2.5-1.2B<br>(1k-Prefill/100-Decode) | LFM2.5-VL-1.6B<br>(256px-Latency & Decode) | Whisper-Small-244m<br>(30s-audio-Latency & Decode)
|
| 98 |
|--------|--------|--------|----------|
|
| 99 |
+
| Mac M4 Pro | 582tps/77tps (76MB RAM) | 0.2s/76tps (87MB RAM) | 0.1s/111tps (73MB RAM) |
|
| 100 |
| iPad/Mac M4 | - | - | - |
|
| 101 |
+
| iPhone 17 Pro | 300tps/33tps (108MB RAM)| 0.3s/33tps (156MB RAM) | 0.3s/114tps (177MB RAM)|
|
| 102 |
+
| Galaxy S25 Ultra | 226tps/36tps (1.2GB RAM) | 2.6s/33tps (2GB RAM) | 2.3s/90tps (363MB RAM) |
|
| 103 |
| Pixel 10 Pro | - | - | - |
|
| 104 |
| Vivo X200 Pro | - | - | - |
|
| 105 |
|
| 106 |
+
| Device | LFM2-350m<br>(1k-Prefill/100-Decode) | LFM2-VL-450m<br>(256px-Latency & Decode) | Moonshine-Base-67m<br>(30s-audio-Latency & Decode)
|
| 107 |
|--------|--------|--------|----------|
|
| 108 |
| iPad/Mac M1 | - | - | - |
|
| 109 |
| iPhone 13 Mini | - | - | - |
|
| 110 |
| Galaxy A56 | - | - | - |
|
| 111 |
+
| Pixel 6a | 218tps/44tps (395MB RAM)| 2.5s/36tps (631MB RAM) | 1.5s/189tps (111MB RAM)|
|
| 112 |
| Nothing CMF | - | - | - |
|
| 113 |
| Raspberry Pi 5 | - | - | - |
|
| 114 |
|
|
|
|
| 183 |
8. [University of Michigan Ann-Arbor MSAIL](https://msail.github.io/)
|
| 184 |
9. [University of Colorado Boulder's AI Club](https://www.cuaiclub.org/)
|
| 185 |
|
| 186 |
+
## Contributing to Cactus
|
| 187 |
+
|
| 188 |
+
- **C++ Standard**: Use C++20 features where appropriate
|
| 189 |
+
- **Formatting**: Follow the existing code style in the project, one header per folder.
|
| 190 |
+
- **Comments**: Avoid comments, make your code read like plain english
|
| 191 |
+
- **AI-Generated Code**: Do not bindly PR AI slop, this codebase is very complex, they miss details.
|
| 192 |
+
- **Update docs**: Please update docs when necessary, be intuitive and straightforward.
|
| 193 |
+
- **Keep It Simple**: Do not go beyond the scope of the GH issue, avoid bloated PRs, keep codes lean.
|
| 194 |
+
- **Benchmark Your Changes**: Test performance impact, Cactus is performance-critical.
|
| 195 |
+
|
| 196 |
## Join The Community
|
| 197 |
- [Reddit Channel](https://www.reddit.com/r/cactuscompute/)
|