Upload folder using huggingface_hub

e13f5a4 verified about 1 year ago

4.42 kB

	# Prince Xiang HunyuanVideo LoRA

	This repository contains the necessary setup and scripts to generate videos using the HunyuanVideo model with a LoRA (Low-Rank Adaptation) fine-tuned for Prince Xiang. Below are the instructions to install dependencies, download models, and run the demo.

	---

	## Installation

	### Step 1: Install System Dependencies
	Run the following command to install required system packages:
	```bash
	sudo apt-get update && sudo apt-get install git-lfs ffmpeg cbm
	```

	### Step 2: Clone the Repository
	Clone the repository and navigate to the project directory:
	```bash
	git clone https://huggingface.co/svjack/Prince_Xiang_ConsistentID_HunyuanVideo_lora
	cd Prince_Xiang_ConsistentID_HunyuanVideo_lora
	```

	### Step 3: Install Python Dependencies
	Install the required Python packages:
	```bash
	conda create -n py310 python=3.10
	conda activate py310
	pip install ipykernel
	python -m ipykernel install --user --name py310 --display-name "py310"

	pip install -r requirements.txt
	pip install ascii-magic matplotlib tensorboard huggingface_hub
	pip install moviepy==1.0.3
	pip install sageattention==1.0.6

	pip install torch==2.5.0 torchvision
	```

	---

	## Download Models

	### Step 1: Download HunyuanVideo Model
	Download the HunyuanVideo model and place it in the `ckpts` directory:
	```bash
	huggingface-cli download tencent/HunyuanVideo --local-dir ./ckpts
	```

	### Step 2: Download LLaVA Model
	Download the LLaVA model and preprocess it:
	```bash
	cd ckpts
	huggingface-cli download xtuner/llava-llama-3-8b-v1_1-transformers --local-dir ./llava-llama-3-8b-v1_1-transformers
	wget https://raw.githubusercontent.com/Tencent/HunyuanVideo/refs/heads/main/hyvideo/utils/preprocess_text_encoder_tokenizer_utils.py
	python preprocess_text_encoder_tokenizer_utils.py --input_dir llava-llama-3-8b-v1_1-transformers --output_dir text_encoder
	```

	### Step 3: Download CLIP Model
	Download the CLIP model for the text encoder:
	```bash
	huggingface-cli download openai/clip-vit-large-patch14 --local-dir ./text_encoder_2
	```

	---

	## Demo

	### Generate Video 1: Prince Xiang
	Run the following command to generate a video of Prince Xiang:
	```bash
	python hv_generate_video.py \
	--fp8 \
	--video_size 544 960 \
	--video_length 60 \
	--infer_steps 30 \
	--prompt "Unreal 5 render of a handsome man img. warm atmosphere, at home, bedroom. a small fishing village on a pier in the background." \
	--save_path . \
	--output_type both \
	--dit ckpts/hunyuan-video-t2v-720p/transformers/mp_rank_00_model_states.pt \
	--attn_mode sdpa \
	--vae ckpts/hunyuan-video-t2v-720p/vae/pytorch_model.pt \
	--vae_chunk_size 32 \
	--vae_spatial_tile_sample_min_size 128 \
	--text_encoder1 ckpts/text_encoder \
	--text_encoder2 ckpts/text_encoder_2 \
	--seed 1234 \
	--lora_multiplier 1.0 \
	--lora_weight Xiang_Consis_im_lora_dir/Xiang_Consis_im_lora-000006.safetensors
	```


	<video controls autoplay src="https://huggingface.co/svjack/Prince_Xiang_ConsistentID_HunyuanVideo_lora/resolve/main/20250209-123847_1234.mp4 "></video>


	### Generate Video 2: Prince Xiang
	Run the following command to generate a video of Prince Xiang:
	```bash
	python hv_generate_video.py \
	--fp8 \
	--video_size 544 960 \
	--video_length 60 \
	--infer_steps 30 \
	--prompt "Unreal 5 render of a handsome man, warm atmosphere, in a lush, vibrant forest. The scene is bathed in golden sunlight filtering through the dense canopy." \
	--save_path . \
	--output_type both \
	--dit ckpts/hunyuan-video-t2v-720p/transformers/mp_rank_00_model_states.pt \
	--attn_mode sdpa \
	--vae ckpts/hunyuan-video-t2v-720p/vae/pytorch_model.pt \
	--vae_chunk_size 32 \
	--vae_spatial_tile_sample_min_size 128 \
	--text_encoder1 ckpts/text_encoder \
	--text_encoder2 ckpts/text_encoder_2 \
	--seed 1234 \
	--lora_multiplier 1.0 \
	--lora_weight Xiang_Consis_im_lora_dir/Xiang_Consis_im_lora-000006.safetensors

	```


	<video controls autoplay src="https://huggingface.co/svjack/Prince_Xiang_ConsistentID_HunyuanVideo_lora/resolve/main/20250209-131316_1234.mp4"></video>


	---

	## Notes
	- Ensure you have sufficient GPU resources for video generation.
	- Adjust the `--video_size`, `--video_length`, and `--infer_steps` parameters as needed for different output qualities and lengths.
	- The `--prompt` parameter can be modified to generate videos with different scenes or actions.

	---