File size: 1,849 Bytes
cff1e23
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
---
base_model: nvidia/Cosmos-Predict2-14B-Text2Image
library_name: gguf
quantized_by: city96
tags:
  - nvidia
  - cosmos
  - diffusers
license: other
license_name: nvidia-open-model-license
license_link: >-
  https://www.nvidia.com/en-us/agreements/enterprise-software/nvidia-open-model-license
---

This is a direct GGUF conversion of [nvidia/Cosmos-Predict2-14B-Text2Image](https://huggingface.co/nvidia/Cosmos-Predict2-14B-Text2Image).

The model files can be used in [ComfyUI](https://github.com/comfyanonymous/ComfyUI/) with the [ComfyUI-GGUF](https://github.com/city96/ComfyUI-GGUF) custom node. Place the required model(s) in the following folders:

| Type         | Name                           | Location                          | Download         |
| ------------ | ------------------------------ | --------------------------------- | ---------------- |
| Main Model   | Cosmos-Predict2-14B-Text2Image | `ComfyUI/models/diffusion_models` | GGUF (this repo) |
| Text Encoder | (old) T5-XXL-Encoder           | `ComfyUI/models/text_encoders`    | [Safetensors](https://huggingface.co/comfyanonymous/cosmos_1.0_text_encoder_and_VAE_ComfyUI/tree/main/text_encoders) |
| VAE          | Wan 2.1 VAE                    | `ComfyUI/models/vae`              | [Safetensors](https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged/blob/main/split_files/vae/wan_2.1_vae.safetensors) |

[**Example workflow**](media/cosmos-predict2-14b-text2image_workflow.json) - based on the [official example workflow](https://github.com/comfyanonymous/ComfyUI/pull/8517)

[**Example outputs**](media/cosmos-predict2-14b-text2image.png) - sample size of 1, not strictly representative

![sample](media/cosmos-predict2-14b-text2image.png)

### Notes

*As this is a quantized model not a finetune, all the same restrictions/original license terms still apply.*