DuoNeural
/

Cosmos3-Nano-GPTQ-4bit

Cosmos3OmniPipeline

video-generation

mixture-of-transformers

4-bit precision

Model card Files Files and versions

Cosmos3-Nano-GPTQ-4bit

11.1 GB

Ctrl+K

Ctrl+K

1 contributor

History: 7 commits

DuoNeural's picture

Add Kilonova UMA inference test: 14.4GB VRAM, int32 nibble format, path key fix, AMD ROCm flags

68569a0 verified about 2 months ago

.gitattributes

1.52 kB
initial commit about 2 months ago
README.md

12.3 kB
Add Kilonova UMA inference test: 14.4GB VRAM, int32 nibble format, path key fix, AMD ROCm flags about 2 months ago
config.json

1.8 kB
DuoNeural Cosmos3-Nano GPTQ int4: custom nibble-packed, 2.74x compression (30.3GB→11.1GB), 330 linear layers quantized about 2 months ago
model-00001-packed.safetensors

5.36 GB
xet

DuoNeural Cosmos3-Nano GPTQ int4: custom nibble-packed, 2.74x compression (30.3GB→11.1GB), 330 linear layers quantized about 2 months ago
model-00002-packed.safetensors

5.36 GB
xet

DuoNeural Cosmos3-Nano GPTQ int4: custom nibble-packed, 2.74x compression (30.3GB→11.1GB), 330 linear layers quantized about 2 months ago
model-00003-packed.safetensors

335 MB
xet

DuoNeural Cosmos3-Nano GPTQ int4: custom nibble-packed, 2.74x compression (30.3GB→11.1GB), 330 linear layers quantized about 2 months ago
model_index.json

514 Bytes
Add model_index.json (transformer-only quantized release) about 2 months ago