royleibov
/

Jamba-v0.1-ZipNN-Compressed

Text Generation

Mixture of Experts

Model card Files Files and versions

royleibov commited on Sep 15, 2024

Commit

ef9dc41

·

verified ·

1 Parent(s): 279671a

Add ZipNN text

Files changed (1) hide show

README.md +14 -0

README.md CHANGED Viewed

@@ -38,6 +38,20 @@ zipnn_hf()
 tokenizer = AutoTokenizer.from_pretrained("royleibov/Jamba-v0.1-ZipNN-Compressed", trust_remote_code=True)
 model = AutoModelForCausalLM.from_pretrained("royleibov/Jamba-v0.1-ZipNN-Compressed", trust_remote_code=True)
 ```
 # Model Card for Jamba

 tokenizer = AutoTokenizer.from_pretrained("royleibov/Jamba-v0.1-ZipNN-Compressed", trust_remote_code=True)
 model = AutoModelForCausalLM.from_pretrained("royleibov/Jamba-v0.1-ZipNN-Compressed", trust_remote_code=True)
 ```
+### ZipNN
+ZipNN also allows you to seemlessly save local disk space in your cache after the model is downloaded.
+To compress the cached model, simply run:
+```bash
+python zipnn_compress_path.py safetensors --model royleibov/Jamba-v0.1-ZipNN-Compressed --hf_cache
+```
+The model will be decompressed automatically and safely as long as `zipnn_hf()` is added at the top of the file like in the [example above](#use-this-model).
+To decompress manualy, simply run:
+```bash
+python zipnn_decompress_path.py --model royleibov/Jamba-v0.1-ZipNN-Compressed --hf_cache
+```
 # Model Card for Jamba