nvidia
/

Wan2.2-T2V-A14B-Diffusers-FP8

@@ -1,7 +1,7 @@
 ---
 pipeline_tag: text-to-video
 base_model:
-- Wan-AI/Wan2.2-T2V-A14B
 license: apache-2.0
 library_name: Model Optimizer
 tags:
@@ -18,14 +18,14 @@ tags:
 # Model Overview
 ## Description:
-The NVIDIA Wan2.2-T2V-A14B-Diffusers FP8 model is the quantized version of Wan-AI's Wan2.2-T2V-A14B model, which is a text-to-video diffusion transformer. For more information, please check [here](https://huggingface.co/Wan-AI/Wan2.2-T2V-A14B). The NVIDIA Wan2.2-T2V-A14B-Diffusers FP8 model is quantized with [Model Optimizer](https://github.com/NVIDIA/Model-Optimizer).
 <br>
 This model is ready for commercial/non-commercial use.<br>
 ## Third-Party Community Consideration
-This model is not owned or developed by NVIDIA. This model has been developed and built to a third-party's requirements for this application and use case; see link to Non-NVIDIA [(Wan2.2-T2V-A14B) Model Card](https://huggingface.co/Wan-AI/Wan2.2-T2V-A14B).
 ### License/Terms of Use:
 [Apache license 2.0](https://choosealicense.com/licenses/apache-2.0/)
@@ -42,7 +42,7 @@ Hugging Face 05/08/2026 via https://huggingface.co/nvidia/Wan2.2-T2V-A14B-Diffus
 ## Model Architecture:
 **Architecture Type:** Diffusion Transformer (DiT) with Mixture-of-Experts (MoE) <br>
 **Network Architecture:** Wan2.2-T2V-A14B <br>
-**This model was developed based on [Wan2.2-T2V-A14B](https://huggingface.co/Wan-AI/Wan2.2-T2V-A14B) <br>
 **Number of Model Parameters:** 27B total parameters, 14B active parameters per denoising step <br>
 ## Input:
@@ -116,7 +116,7 @@ trtllm-serve nvidia/Wan2.2-T2V-A14B-Diffusers-FP8 --extra_visual_gen_options ./e
 ```
 ### Model Characteristics
-The original `Wan2.2-T2V-A14B` model uses a Mixture-of-Experts design with separate high-noise and low-noise experts across denoising timesteps. This enables larger total capacity (27B parameters) while keeping the active parameters per step at roughly 14B. See the original model card for more details: [Wan-AI/Wan2.2-T2V-A14B](https://huggingface.co/Wan-AI/Wan2.2-T2V-A14B).
 ## Model Limitations:
 The base model was trained on internet-scale image and video data that may contain societal biases or undesirable content patterns. Therefore, the model may amplify those biases and may generate videos that are inaccurate, inconsistent with the prompt, low quality, or inappropriate, even when prompts are benign. Generated outputs can also reflect limitations in motion coherence, temporal consistency, and prompt adherence. This model is not designed for factual information generation or safety-critical applications without additional safeguards and testing.

 ---
 pipeline_tag: text-to-video
 base_model:
+- Wan-AI/Wan2.2-T2V-A14B-Diffusers
 license: apache-2.0
 library_name: Model Optimizer
 tags:
 # Model Overview
 ## Description:
+The NVIDIA Wan2.2-T2V-A14B-Diffusers FP8 model is the quantized version of Wan-AI's Wan2.2-T2V-A14B model, which is a text-to-video diffusion transformer. For more information, please check [here](https://huggingface.co/Wan-AI/Wan2.2-T2V-A14B-Diffusers). The NVIDIA Wan2.2-T2V-A14B-Diffusers FP8 model is quantized with [Model Optimizer](https://github.com/NVIDIA/Model-Optimizer).
 <br>
 This model is ready for commercial/non-commercial use.<br>
 ## Third-Party Community Consideration
+This model is not owned or developed by NVIDIA. This model has been developed and built to a third-party's requirements for this application and use case; see link to Non-NVIDIA [(Wan2.2-T2V-A14B) Model Card](https://huggingface.co/Wan-AI/Wan2.2-T2V-A14B-Diffusers).
 ### License/Terms of Use:
 [Apache license 2.0](https://choosealicense.com/licenses/apache-2.0/)
 ## Model Architecture:
 **Architecture Type:** Diffusion Transformer (DiT) with Mixture-of-Experts (MoE) <br>
 **Network Architecture:** Wan2.2-T2V-A14B <br>
+**This model was developed based on [Wan2.2-T2V-A14B](https://huggingface.co/Wan-AI/Wan2.2-T2V-A14B-Diffusers) <br>
 **Number of Model Parameters:** 27B total parameters, 14B active parameters per denoising step <br>
 ## Input:
 ```
 ### Model Characteristics
+The original `Wan2.2-T2V-A14B` model uses a Mixture-of-Experts design with separate high-noise and low-noise experts across denoising timesteps. This enables larger total capacity (27B parameters) while keeping the active parameters per step at roughly 14B. See the original model card for more details: [Wan-AI/Wan2.2-T2V-A14B-Diffusers](https://huggingface.co/Wan-AI/Wan2.2-T2V-A14B-Diffusers).
 ## Model Limitations:
 The base model was trained on internet-scale image and video data that may contain societal biases or undesirable content patterns. Therefore, the model may amplify those biases and may generate videos that are inaccurate, inconsistent with the prompt, low quality, or inappropriate, even when prompts are benign. Generated outputs can also reflect limitations in motion coherence, temporal consistency, and prompt adherence. This model is not designed for factual information generation or safety-critical applications without additional safeguards and testing.