Update README.md
Browse files
README.md
CHANGED
|
@@ -66,6 +66,10 @@ vLLM also supports OpenAI-compatible serving. See the [documentation](https://do
|
|
| 66 |
This model was created with [llm-compressor](https://github.com/vllm-project/llm-compressor) by running the code snippet below.
|
| 67 |
|
| 68 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 69 |
```python
|
| 70 |
import argparse
|
| 71 |
from transformers import AutoModelForCausalLM, AutoTokenizer
|
|
|
|
| 66 |
This model was created with [llm-compressor](https://github.com/vllm-project/llm-compressor) by running the code snippet below.
|
| 67 |
|
| 68 |
|
| 69 |
+
```bash
|
| 70 |
+
python quantize.py --model_id ibm-granite/granite-3.1-2b-base --save_path "output_dir/"
|
| 71 |
+
```
|
| 72 |
+
|
| 73 |
```python
|
| 74 |
import argparse
|
| 75 |
from transformers import AutoModelForCausalLM, AutoTokenizer
|