| # Prince Xiang HunyuanVideo LoRA | |
| This repository contains the necessary setup and scripts to generate videos using the HunyuanVideo model with a LoRA (Low-Rank Adaptation) fine-tuned for Prince Xiang. Below are the instructions to install dependencies, download models, and run the demo. | |
| --- | |
| ## Installation | |
| ### Step 1: Install System Dependencies | |
| Run the following command to install required system packages: | |
| ```bash | |
| sudo apt-get update && sudo apt-get install git-lfs ffmpeg cbm | |
| ``` | |
| ### Step 2: Clone the Repository | |
| Clone the repository and navigate to the project directory: | |
| ```bash | |
| git clone https://huggingface.co/svjack/Prince_Xiang_ConsistentID_HunyuanVideo_lora | |
| cd Prince_Xiang_ConsistentID_HunyuanVideo_lora | |
| ``` | |
| ### Step 3: Install Python Dependencies | |
| Install the required Python packages: | |
| ```bash | |
| conda create -n py310 python=3.10 | |
| conda activate py310 | |
| pip install ipykernel | |
| python -m ipykernel install --user --name py310 --display-name "py310" | |
| pip install -r requirements.txt | |
| pip install ascii-magic matplotlib tensorboard huggingface_hub | |
| pip install moviepy==1.0.3 | |
| pip install sageattention==1.0.6 | |
| pip install torch==2.5.0 torchvision | |
| ``` | |
| --- | |
| ## Download Models | |
| ### Step 1: Download HunyuanVideo Model | |
| Download the HunyuanVideo model and place it in the `ckpts` directory: | |
| ```bash | |
| huggingface-cli download tencent/HunyuanVideo --local-dir ./ckpts | |
| ``` | |
| ### Step 2: Download LLaVA Model | |
| Download the LLaVA model and preprocess it: | |
| ```bash | |
| cd ckpts | |
| huggingface-cli download xtuner/llava-llama-3-8b-v1_1-transformers --local-dir ./llava-llama-3-8b-v1_1-transformers | |
| wget https://raw.githubusercontent.com/Tencent/HunyuanVideo/refs/heads/main/hyvideo/utils/preprocess_text_encoder_tokenizer_utils.py | |
| python preprocess_text_encoder_tokenizer_utils.py --input_dir llava-llama-3-8b-v1_1-transformers --output_dir text_encoder | |
| ``` | |
| ### Step 3: Download CLIP Model | |
| Download the CLIP model for the text encoder: | |
| ```bash | |
| huggingface-cli download openai/clip-vit-large-patch14 --local-dir ./text_encoder_2 | |
| ``` | |
| --- | |
| ## Demo | |
| ### Generate Video 1: Prince Xiang | |
| Run the following command to generate a video of Prince Xiang: | |
| ```bash | |
| python hv_generate_video.py \ | |
| --fp8 \ | |
| --video_size 544 960 \ | |
| --video_length 60 \ | |
| --infer_steps 30 \ | |
| --prompt "Unreal 5 render of a handsome man img. warm atmosphere, at home, bedroom. a small fishing village on a pier in the background." \ | |
| --save_path . \ | |
| --output_type both \ | |
| --dit ckpts/hunyuan-video-t2v-720p/transformers/mp_rank_00_model_states.pt \ | |
| --attn_mode sdpa \ | |
| --vae ckpts/hunyuan-video-t2v-720p/vae/pytorch_model.pt \ | |
| --vae_chunk_size 32 \ | |
| --vae_spatial_tile_sample_min_size 128 \ | |
| --text_encoder1 ckpts/text_encoder \ | |
| --text_encoder2 ckpts/text_encoder_2 \ | |
| --seed 1234 \ | |
| --lora_multiplier 1.0 \ | |
| --lora_weight Xiang_Consis_im_lora_dir/Xiang_Consis_im_lora-000006.safetensors | |
| ``` | |
| <video controls autoplay src="https://huggingface.co/svjack/Prince_Xiang_ConsistentID_HunyuanVideo_lora/resolve/main/20250209-123847_1234.mp4 "></video> | |
| ### Generate Video 2: Prince Xiang | |
| Run the following command to generate a video of Prince Xiang: | |
| ```bash | |
| python hv_generate_video.py \ | |
| --fp8 \ | |
| --video_size 544 960 \ | |
| --video_length 60 \ | |
| --infer_steps 30 \ | |
| --prompt "Unreal 5 render of a handsome man, warm atmosphere, in a lush, vibrant forest. The scene is bathed in golden sunlight filtering through the dense canopy." \ | |
| --save_path . \ | |
| --output_type both \ | |
| --dit ckpts/hunyuan-video-t2v-720p/transformers/mp_rank_00_model_states.pt \ | |
| --attn_mode sdpa \ | |
| --vae ckpts/hunyuan-video-t2v-720p/vae/pytorch_model.pt \ | |
| --vae_chunk_size 32 \ | |
| --vae_spatial_tile_sample_min_size 128 \ | |
| --text_encoder1 ckpts/text_encoder \ | |
| --text_encoder2 ckpts/text_encoder_2 \ | |
| --seed 1234 \ | |
| --lora_multiplier 1.0 \ | |
| --lora_weight Xiang_Consis_im_lora_dir/Xiang_Consis_im_lora-000006.safetensors | |
| ``` | |
| <video controls autoplay src="https://huggingface.co/svjack/Prince_Xiang_ConsistentID_HunyuanVideo_lora/resolve/main/20250209-131316_1234.mp4"></video> | |
| --- | |
| ## Notes | |
| - Ensure you have sufficient GPU resources for video generation. | |
| - Adjust the `--video_size`, `--video_length`, and `--infer_steps` parameters as needed for different output qualities and lengths. | |
| - The `--prompt` parameter can be modified to generate videos with different scenes or actions. | |
| --- |