Commit
·
08d7494
1
Parent(s):
fc9184a
Update README.md
Browse files
README.md
CHANGED
|
@@ -80,9 +80,10 @@ There are many factors to consider when choosing a university. Here are some tip
|
|
| 80 |
|
| 81 |
```bash
|
| 82 |
git clone https://github.com/neuralmagic/sparseml
|
| 83 |
-
pip install -e "sparseml[transformers]"
|
| 84 |
python sparseml/src/sparseml/transformers/sparsification/obcq/obcq.py TinyLlama/TinyLlama-1.1B-Chat-v0.4 open_platypus --recipe recipe.yaml --save True
|
| 85 |
python sparseml/src/sparseml/transformers/sparsification/obcq/export.py --task text-generation --model_path obcq_deployment
|
| 86 |
cp deployment/model.onnx deployment/model-orig.onnx
|
|
|
|
| 87 |
python onnx_kv_inject.py --input-file deployment/model-orig.onnx --output-file deployment/model.onnx
|
| 88 |
```
|
|
|
|
| 80 |
|
| 81 |
```bash
|
| 82 |
git clone https://github.com/neuralmagic/sparseml
|
| 83 |
+
pip install -e "sparseml[transformers]"
|
| 84 |
python sparseml/src/sparseml/transformers/sparsification/obcq/obcq.py TinyLlama/TinyLlama-1.1B-Chat-v0.4 open_platypus --recipe recipe.yaml --save True
|
| 85 |
python sparseml/src/sparseml/transformers/sparsification/obcq/export.py --task text-generation --model_path obcq_deployment
|
| 86 |
cp deployment/model.onnx deployment/model-orig.onnx
|
| 87 |
+
wget https://huggingface.co/nm-testing/TinyLlama-1.1B-Chat-v0.4-pruned50-quant/raw/main/recipe.yaml # download recipe
|
| 88 |
python onnx_kv_inject.py --input-file deployment/model-orig.onnx --output-file deployment/model.onnx
|
| 89 |
```
|