jingyux-nv commited on
Commit
de6d036
·
verified ·
1 Parent(s): 0da675c

Point base-model links to Wan-AI/Wan2.2-T2V-A14B-Diffusers

Browse files
Files changed (1) hide show
  1. README.md +5 -5
README.md CHANGED
@@ -1,7 +1,7 @@
1
  ---
2
  pipeline_tag: text-to-video
3
  base_model:
4
- - Wan-AI/Wan2.2-T2V-A14B
5
  license: apache-2.0
6
  library_name: Model Optimizer
7
  tags:
@@ -18,14 +18,14 @@ tags:
18
  # Model Overview
19
 
20
  ## Description:
21
- The NVIDIA Wan2.2-T2V-A14B-Diffusers FP8 model is the quantized version of Wan-AI's Wan2.2-T2V-A14B model, which is a text-to-video diffusion transformer. For more information, please check [here](https://huggingface.co/Wan-AI/Wan2.2-T2V-A14B). The NVIDIA Wan2.2-T2V-A14B-Diffusers FP8 model is quantized with [Model Optimizer](https://github.com/NVIDIA/Model-Optimizer).
22
  <br>
23
 
24
  This model is ready for commercial/non-commercial use.<br>
25
 
26
 
27
  ## Third-Party Community Consideration
28
- This model is not owned or developed by NVIDIA. This model has been developed and built to a third-party's requirements for this application and use case; see link to Non-NVIDIA [(Wan2.2-T2V-A14B) Model Card](https://huggingface.co/Wan-AI/Wan2.2-T2V-A14B).
29
 
30
  ### License/Terms of Use:
31
  [Apache license 2.0](https://choosealicense.com/licenses/apache-2.0/)
@@ -42,7 +42,7 @@ Hugging Face 05/08/2026 via https://huggingface.co/nvidia/Wan2.2-T2V-A14B-Diffus
42
  ## Model Architecture:
43
  **Architecture Type:** Diffusion Transformer (DiT) with Mixture-of-Experts (MoE) <br>
44
  **Network Architecture:** Wan2.2-T2V-A14B <br>
45
- **This model was developed based on [Wan2.2-T2V-A14B](https://huggingface.co/Wan-AI/Wan2.2-T2V-A14B) <br>
46
  **Number of Model Parameters:** 27B total parameters, 14B active parameters per denoising step <br>
47
 
48
  ## Input:
@@ -116,7 +116,7 @@ trtllm-serve nvidia/Wan2.2-T2V-A14B-Diffusers-FP8 --extra_visual_gen_options ./e
116
  ```
117
 
118
  ### Model Characteristics
119
- The original `Wan2.2-T2V-A14B` model uses a Mixture-of-Experts design with separate high-noise and low-noise experts across denoising timesteps. This enables larger total capacity (27B parameters) while keeping the active parameters per step at roughly 14B. See the original model card for more details: [Wan-AI/Wan2.2-T2V-A14B](https://huggingface.co/Wan-AI/Wan2.2-T2V-A14B).
120
 
121
  ## Model Limitations:
122
  The base model was trained on internet-scale image and video data that may contain societal biases or undesirable content patterns. Therefore, the model may amplify those biases and may generate videos that are inaccurate, inconsistent with the prompt, low quality, or inappropriate, even when prompts are benign. Generated outputs can also reflect limitations in motion coherence, temporal consistency, and prompt adherence. This model is not designed for factual information generation or safety-critical applications without additional safeguards and testing.
 
1
  ---
2
  pipeline_tag: text-to-video
3
  base_model:
4
+ - Wan-AI/Wan2.2-T2V-A14B-Diffusers
5
  license: apache-2.0
6
  library_name: Model Optimizer
7
  tags:
 
18
  # Model Overview
19
 
20
  ## Description:
21
+ The NVIDIA Wan2.2-T2V-A14B-Diffusers FP8 model is the quantized version of Wan-AI's Wan2.2-T2V-A14B model, which is a text-to-video diffusion transformer. For more information, please check [here](https://huggingface.co/Wan-AI/Wan2.2-T2V-A14B-Diffusers). The NVIDIA Wan2.2-T2V-A14B-Diffusers FP8 model is quantized with [Model Optimizer](https://github.com/NVIDIA/Model-Optimizer).
22
  <br>
23
 
24
  This model is ready for commercial/non-commercial use.<br>
25
 
26
 
27
  ## Third-Party Community Consideration
28
+ This model is not owned or developed by NVIDIA. This model has been developed and built to a third-party's requirements for this application and use case; see link to Non-NVIDIA [(Wan2.2-T2V-A14B) Model Card](https://huggingface.co/Wan-AI/Wan2.2-T2V-A14B-Diffusers).
29
 
30
  ### License/Terms of Use:
31
  [Apache license 2.0](https://choosealicense.com/licenses/apache-2.0/)
 
42
  ## Model Architecture:
43
  **Architecture Type:** Diffusion Transformer (DiT) with Mixture-of-Experts (MoE) <br>
44
  **Network Architecture:** Wan2.2-T2V-A14B <br>
45
+ **This model was developed based on [Wan2.2-T2V-A14B](https://huggingface.co/Wan-AI/Wan2.2-T2V-A14B-Diffusers) <br>
46
  **Number of Model Parameters:** 27B total parameters, 14B active parameters per denoising step <br>
47
 
48
  ## Input:
 
116
  ```
117
 
118
  ### Model Characteristics
119
+ The original `Wan2.2-T2V-A14B` model uses a Mixture-of-Experts design with separate high-noise and low-noise experts across denoising timesteps. This enables larger total capacity (27B parameters) while keeping the active parameters per step at roughly 14B. See the original model card for more details: [Wan-AI/Wan2.2-T2V-A14B-Diffusers](https://huggingface.co/Wan-AI/Wan2.2-T2V-A14B-Diffusers).
120
 
121
  ## Model Limitations:
122
  The base model was trained on internet-scale image and video data that may contain societal biases or undesirable content patterns. Therefore, the model may amplify those biases and may generate videos that are inaccurate, inconsistent with the prompt, low quality, or inappropriate, even when prompts are benign. Generated outputs can also reflect limitations in motion coherence, temporal consistency, and prompt adherence. This model is not designed for factual information generation or safety-critical applications without additional safeguards and testing.