nightknocker
/

tdrussell-secret-model-diffusers

Model card Files Files and versions

tdrussell-secret-model-diffusers / README.md

nightknocker's picture

Update README.md

6e6ee00 verified 15 days ago

|

1.22 kB

	---
	license: apache-2.0
	tags:
	- anima
	---

	# Anima in diffusers

	Anima weights in diffusers environment. Copy the "[anima-preview.safetensors](https://huggingface.co/circlestone-labs/Anima/tree/main/split_files/diffusion_models)" to an the "transformer" directory.

	I'm aware that another diffusers compatible code has been uploaded elsewhere. However, if the llm_adapter cannot handle padded tokens, then the implementation was incorrect. This repo relies on the assumption that the adapter model can handle padded input tokens; as it should.

	## Inference

	```python
	from cosmos_predict2 import CosmosPredict2Pipeline

	pipeline = CosmosPredict2Pipeline('/path/to/this/bundle')
	# Unlike in diffusers, we don't call a long forward function here.
	qwen_embeds, t5_input_ids = pipeline.prepare_text_embeds(text)
	crossattn_emb = pipeline.transformer.preprocess_text_embeds(qwen_embeds, t5_input_ids)
	pipeline.transformer.to('cuda')
	# Sampling with FlowMatchEulerDiscreteScheduler.
	output = pipeline.sample_fm(
	torch.randn((1, 16, 1, 64, 64), device='cuda', dtype=torch.bfloat16),
	num_inference_steps=30,
	crossattn_emb=crossattn_emb.to('cuda'),
	)
	pipeline.vae.to('cuda')
	image = pipeline.decode_vae(output)
	```