Efficient-Large-Model
/

Sana_Sprint_1.6B_1024px

sana, sana-sprint

1024px_based_image_size

One-step diffusion

Model card Files Files and versions

Lawrence-cj commited on Oct 28, 2025

Commit

8fc08a6

·

verified ·

1 Parent(s): a5a2f65

Update README.md

Files changed (1) hide show

README.md +7 -1

README.md CHANGED Viewed

@@ -1,4 +1,8 @@
 ---
 library_name: sana, sana-sprint
 tags:
 - text-to-image
@@ -71,7 +75,6 @@ Source code is available at https://github.com/NVlabs/Sana.
 - **Model size:** 1.6B parameters
 - **Model precision:** torch.bfloat16 (BF16)
 - **Model resolution:** This model is developed to generate 1024px based images with multi-scale heigh and width.
-- **License:** [NSCL v2-custom](./LICENSE.txt). Governing Terms:  NVIDIA License.  Additional Information:  [Gemma Terms of Use  |  Google AI for Developers](https://ai.google.dev/gemma/terms) for Gemma-2-2B-IT, [Gemma Prohibited Use Policy  |  Google AI for Developers](https://ai.google.dev/gemma/prohibited_use_policy).
 - **Model Description:** This is a model that can be used to generate and modify images based on text prompts.
 It is a Linear Diffusion Transformer that uses one fixed, pretrained text encoders ([Gemma2-2B-IT](https://huggingface.co/google/gemma-2-2b-it))
 and one 32x spatial-compressed latent feature encoder ([DC-AE](https://hanlab.mit.edu/projects/dc-ae)).
@@ -85,6 +88,9 @@ For research purposes, we recommend our `generative-models` Github repository (h
 - **Demo:** https://nv-sana.mit.edu/sprint
 - **Guidance:** https://github.com/NVlabs/Sana/asset/docs/sana_sprint.md
 ## Uses

 ---
+license: other
+license_name: nvidia-open-model-license
+license_link: >-
+  https://www.nvidia.com/en-us/agreements/enterprise-software/nvidia-open-model-license/
 library_name: sana, sana-sprint
 tags:
 - text-to-image
 - **Model size:** 1.6B parameters
 - **Model precision:** torch.bfloat16 (BF16)
 - **Model resolution:** This model is developed to generate 1024px based images with multi-scale heigh and width.
 - **Model Description:** This is a model that can be used to generate and modify images based on text prompts.
 It is a Linear Diffusion Transformer that uses one fixed, pretrained text encoders ([Gemma2-2B-IT](https://huggingface.co/google/gemma-2-2b-it))
 and one 32x spatial-compressed latent feature encoder ([DC-AE](https://hanlab.mit.edu/projects/dc-ae)).
 - **Demo:** https://nv-sana.mit.edu/sprint
 - **Guidance:** https://github.com/NVlabs/Sana/asset/docs/sana_sprint.md
+## License/Terms of Use
+GOVERNING TERMS: This trial service is governed by the [NVIDIA API Trial Terms of Service](https://assets.ngc.nvidia.com/products/api-catalog/legal/NVIDIA%20API%20Trial%20Terms%20of%20Service.pdf). Use of this model is governed by the [NVIDIA Open Model License Agreement](https://www.nvidia.com/en-us/agreements/enterprise-software/nvidia-open-model-license/).
 ## Uses