| # π¬ AI Video Generator Pro - Hailuo-Inspired Edition | |
| A powerful, feature-rich AI video generation application inspired by Hailuo AI, supporting multiple state-of-the-art models from Hugging Face. | |
| ## β¨ Key Features | |
| ### π― Multiple Generation Modes | |
| - **Text-to-Video**: Create videos from text descriptions | |
| - **Image-to-Video**: Animate static images with AI | |
| ### π€ Multiple AI Models | |
| - **CogVideoX-5B**: High-quality 6-second videos at 720p | |
| - **LTX Video**: Fast and efficient generation | |
| - **Stable Video Diffusion**: Professional image animation | |
| - **AnimateDiff**: Advanced motion control | |
| - **Zeroscope V2 XL**: Fast and reliable baseline | |
| ### π₯ Hailuo-Inspired Features | |
| - **Camera Movements**: Zoom, pan, tilt, tracking shots, dolly, crane, shake | |
| - **Visual Effects**: Cinematic lighting, fog, rain, slow motion | |
| - **Video Styles**: Realistic, anime, cartoon, 3D render, vintage, sci-fi, fantasy | |
| - **Enhanced Prompts**: Automatic prompt enhancement with cinematic tags | |
| ## π Quick Start | |
| ### Installation | |
| ```bash | |
| cd /Users/sravyalu/VideoAI/hailuo-clone | |
| pip install -r requirements.txt | |
| ``` | |
| ### Running | |
| **Enhanced version (recommended)**: | |
| ```bash | |
| python backend_enhanced.py | |
| ``` | |
| Then open `index_enhanced.html` in your browser. | |
| **Basic version**: | |
| ```bash | |
| python backend.py | |
| ``` | |
| Then open `index.html` in your browser. | |
| ## π Usage Guide | |
| ### Text-to-Video | |
| 1. Select "Text to Video" mode | |
| 2. Choose an AI model | |
| 3. Enter your prompt (3-1000 characters) | |
| 4. Add camera movements, effects, or styles (optional) | |
| 5. Click "Generate Video" | |
| 6. Wait 30-120 seconds | |
| 7. Download or share | |
| ### Image-to-Video | |
| 1. Select "Image to Video" mode | |
| 2. Upload an image | |
| 3. Add animation prompt (optional) | |
| 4. Select image-to-video model | |
| 5. Generate | |
| ## π― Example Prompts | |
| **Nature**: "A majestic waterfall cascading down mossy rocks in a lush rainforest" | |
| **Action**: "A sports car drifting around a corner" + Camera: [Tracking shot] | |
| **Fantasy**: "A dragon flying over a medieval castle at dawn" + Style: fantasy, magical | |
| ## π οΈ Available Models | |
| | Model | Type | Resolution | Best For | | |
| |-------|------|------------|----------| | |
| | CogVideoX-5B | T2V | 720x480 | High quality | | |
| | LTX Video | T2V | 704x480 | Fast generation | | |
| | Stable Video Diffusion | I2V | 576x576 | Image animation | | |
| | AnimateDiff | I2V | 512x512 | Motion control | | |
| | Zeroscope | T2V | 512x320 | Quick tests | | |
| ## π Project Files | |
| - `backend_enhanced.py` - Enhanced backend with multiple models | |
| - `index_enhanced.html` - Full-featured frontend | |
| - `models_config.py` - Model configurations | |
| - `requirements.txt` - Dependencies | |
| ## π§ API Endpoints | |
| **GET /health** - Server health check | |
| **GET /models** - List available models and options | |
| **POST /generate-video** - Text-to-video generation | |
| **POST /generate-video-from-image** - Image-to-video generation | |
| ## π‘ Tips | |
| - Start with Zeroscope for quick tests | |
| - Use CogVideoX-5B for final high-quality output | |
| - Combine camera movements with visual effects | |
| - Keep prompts descriptive and specific | |
| - Image-to-video works best with clear, well-lit images | |
| ## π Troubleshooting | |
| **Connection errors**: Check internet and Hugging Face availability | |
| **Timeouts**: Service may be busy, try again or use faster model | |
| **Slow generation**: Normal for high-quality models (30-120s) | |
| ## π What's New vs Basic Version | |
| β Multiple AI models (5 models vs 1) | |
| β Image-to-video capability | |
| β Camera movement controls (Hailuo-style) | |
| β Visual effects and styles | |
| β Enhanced prompt building | |
| β Better UI with dual panels | |
| β Categorized example prompts | |
| β Model information display | |
| --- | |
| **Made with β€οΈ inspired by Hailuo AI** | |