Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -67,6 +67,8 @@ Our AI models are designed and/or optimized to run on NVIDIA GPU-accelerated sys
|
|
| 67 |
**Preferred Operating System(s):** <br>
|
| 68 |
* Linux <br>
|
| 69 |
|
|
|
|
|
|
|
| 70 |
## Model Version(s):
|
| 71 |
The model version is NVFP4 1.0 version and is quantized with nvidia-modelopt **v0.42.0** <br>
|
| 72 |
|
|
@@ -96,7 +98,7 @@ The model version is NVFP4 1.0 version and is quantized with nvidia-modelopt **v
|
|
| 96 |
|
| 97 |
|
| 98 |
## Inference:
|
| 99 |
-
**Engine:** SGLang <br>
|
| 100 |
**Test Hardware:** B200 <br>
|
| 101 |
|
| 102 |
## Post Training Quantization
|
|
|
|
| 67 |
**Preferred Operating System(s):** <br>
|
| 68 |
* Linux <br>
|
| 69 |
|
| 70 |
+
The integration of foundation and fine-tuned models into AI systems requires additional testing using use-case-specific data to ensure safe and effective deployment. Following the V-model methodology, iterative testing and validation at both unit and system levels are essential to mitigate risks, meet technical and functional requirements, and ensure compliance with safety and ethical standards before deployment.
|
| 71 |
+
|
| 72 |
## Model Version(s):
|
| 73 |
The model version is NVFP4 1.0 version and is quantized with nvidia-modelopt **v0.42.0** <br>
|
| 74 |
|
|
|
|
| 98 |
|
| 99 |
|
| 100 |
## Inference:
|
| 101 |
+
**Acceleration Engine:** SGLang <br>
|
| 102 |
**Test Hardware:** B200 <br>
|
| 103 |
|
| 104 |
## Post Training Quantization
|