Update README.md
Browse files
README.md
CHANGED
|
@@ -6,7 +6,7 @@ base_model:
|
|
| 6 |
|
| 7 |
## Model Details
|
| 8 |
|
| 9 |
-
This model card is for mxfp8/nvfp4 quantization of [meta-llama/Llama-3.1-
|
| 10 |
The models are not able to be published due to license limitation. Please follow the INC example README to generate and evaluate the low precision models.
|
| 11 |
|
| 12 |
## How to Use
|
|
|
|
| 6 |
|
| 7 |
## Model Details
|
| 8 |
|
| 9 |
+
This model card is for mxfp8/nvfp4 quantization of [meta-llama/Llama-3.1-70B-Instruct](https://huggingface.co/meta-llama/Llama-3.1-70B-Instruct) based on [intel/auto-round](https://github.com/intel/auto-round).
|
| 10 |
The models are not able to be published due to license limitation. Please follow the INC example README to generate and evaluate the low precision models.
|
| 11 |
|
| 12 |
## How to Use
|