Update README.md
Browse files
README.md
CHANGED
|
@@ -4,7 +4,7 @@ tags:
|
|
| 4 |
---
|
| 5 |
# Sparse MPT-7B-Chat - DeepSparse
|
| 6 |
|
| 7 |
-
Chat-aligned MPT 7b model pruned to 50% and quantized for inference with DeepSparse
|
| 8 |
|
| 9 |
```python
|
| 10 |
from deepsparse import TextGeneration
|
|
|
|
| 4 |
---
|
| 5 |
# Sparse MPT-7B-Chat - DeepSparse
|
| 6 |
|
| 7 |
+
[Chat-aligned MPT 7b model](https://huggingface.co/mosaicml/mpt-7b-chat) pruned to 50% and quantized using SparseGPT for inference with DeepSparse
|
| 8 |
|
| 9 |
```python
|
| 10 |
from deepsparse import TextGeneration
|