jakmro commited on
Commit
1996fb7
Β·
verified Β·
1 Parent(s): 070e145

Update organization README

Browse files
Files changed (1) hide show
  1. README.md +17 -8
README.md CHANGED
@@ -1,5 +1,4 @@
1
- ![banner](https://cdn-uploads.huggingface.co/production/uploads/6690e4cacadc8dd5b9008614/Iw9rww1TDVTT9LS7kGLiv.jpeg)
2
-
3
 
4
  ```
5
  β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β” Energy-efficient inference engine for running AI on mobile devices
@@ -95,21 +94,21 @@ graph.hard_reset();
95
 
96
  ## Benchmark (INT8)
97
 
98
- | Device | LFM2.5-1.2B<br>(1k-Prefill/100-Decode) | LFM2.5-VL-1.6B<br>(256px-Latency & Decode) | Whisper-Small<br>(30s-audio-Latency & Decode)
99
  |--------|--------|--------|----------|
100
- | Mac M4 Pro | 582/77 tps| 0.2s & 76tps | 0.1s & 111tps |
101
  | iPad/Mac M4 | - | - | - |
102
- | iPhone 17 Pro | 300/33 tps | 0.3s & 33tps | 0.6s & 114tps |
103
- | Galaxy S25 Ultra | 226/36 tps | 2.6s & 33tps | 2.3s & 90tps |
104
  | Pixel 10 Pro | - | - | - |
105
  | Vivo X200 Pro | - | - | - |
106
 
107
- | Device | LFM2-350m<br>(1k-Prefill/100-Decode) | LFM2-VL-450m<br>(256px-Latency & Decode) | Moonshine-Base<br>(30s-audio-Latency & Decode)
108
  |--------|--------|--------|----------|
109
  | iPad/Mac M1 | - | - | - |
110
  | iPhone 13 Mini | - | - | - |
111
  | Galaxy A56 | - | - | - |
112
- | Pixel 6a | 218/44 tps | 2.5s & 36 tps | 1.5s & 189 tps |
113
  | Nothing CMF | - | - | - |
114
  | Raspberry Pi 5 | - | - | - |
115
 
@@ -184,5 +183,15 @@ git clone https://github.com/cactus-compute/cactus && cd cactus && source ./setu
184
  8. [University of Michigan Ann-Arbor MSAIL](https://msail.github.io/)
185
  9. [University of Colorado Boulder's AI Club](https://www.cuaiclub.org/)
186
 
 
 
 
 
 
 
 
 
 
 
187
  ## Join The Community
188
  - [Reddit Channel](https://www.reddit.com/r/cactuscompute/)
 
1
+ <img src="assets/banner.jpg" alt="Logo" style="border-radius: 30px; width: 100%;">
 
2
 
3
  ```
4
  β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β” Energy-efficient inference engine for running AI on mobile devices
 
94
 
95
  ## Benchmark (INT8)
96
 
97
+ | Device | LFM2.5-1.2B<br>(1k-Prefill/100-Decode) | LFM2.5-VL-1.6B<br>(256px-Latency & Decode) | Whisper-Small-244m<br>(30s-audio-Latency & Decode)
98
  |--------|--------|--------|----------|
99
+ | Mac M4 Pro | 582tps/77tps (76MB RAM) | 0.2s/76tps (87MB RAM) | 0.1s/111tps (73MB RAM) |
100
  | iPad/Mac M4 | - | - | - |
101
+ | iPhone 17 Pro | 300tps/33tps (108MB RAM)| 0.3s/33tps (156MB RAM) | 0.3s/114tps (177MB RAM)|
102
+ | Galaxy S25 Ultra | 226tps/36tps (1.2GB RAM) | 2.6s/33tps (2GB RAM) | 2.3s/90tps (363MB RAM) |
103
  | Pixel 10 Pro | - | - | - |
104
  | Vivo X200 Pro | - | - | - |
105
 
106
+ | Device | LFM2-350m<br>(1k-Prefill/100-Decode) | LFM2-VL-450m<br>(256px-Latency & Decode) | Moonshine-Base-67m<br>(30s-audio-Latency & Decode)
107
  |--------|--------|--------|----------|
108
  | iPad/Mac M1 | - | - | - |
109
  | iPhone 13 Mini | - | - | - |
110
  | Galaxy A56 | - | - | - |
111
+ | Pixel 6a | 218tps/44tps (395MB RAM)| 2.5s/36tps (631MB RAM) | 1.5s/189tps (111MB RAM)|
112
  | Nothing CMF | - | - | - |
113
  | Raspberry Pi 5 | - | - | - |
114
 
 
183
  8. [University of Michigan Ann-Arbor MSAIL](https://msail.github.io/)
184
  9. [University of Colorado Boulder's AI Club](https://www.cuaiclub.org/)
185
 
186
+ ## Contributing to Cactus
187
+
188
+ - **C++ Standard**: Use C++20 features where appropriate
189
+ - **Formatting**: Follow the existing code style in the project, one header per folder.
190
+ - **Comments**: Avoid comments, make your code read like plain english
191
+ - **AI-Generated Code**: Do not bindly PR AI slop, this codebase is very complex, they miss details.
192
+ - **Update docs**: Please update docs when necessary, be intuitive and straightforward.
193
+ - **Keep It Simple**: Do not go beyond the scope of the GH issue, avoid bloated PRs, keep codes lean.
194
+ - **Benchmark Your Changes**: Test performance impact, Cactus is performance-critical.
195
+
196
  ## Join The Community
197
  - [Reddit Channel](https://www.reddit.com/r/cactuscompute/)