Spaces:
Running on Zero
Running on Zero
| title: Generative Video Editor GPU | |
| short_description: Uses natural language to generate and control edits | |
| emoji: 💻 | |
| colorFrom: green | |
| colorTo: green | |
| sdk: gradio | |
| sdk_version: 6.2.0 | |
| app_file: app.py | |
| pinned: false | |
| disable_embedding: false | |
| models: | |
| - zai-org/GLM-4.7-Flash | |
| tags: | |
| - ffmpeg | |
| - video-editing | |
| - natural-language | |
| - ai-video | |
| - no-code | |
| - video-processing | |
| - media-converter | |
| - slideshow-maker | |
| - video-merger | |
| - command-generator | |
| - waveform-visualization | |
| - audio-to-video | |
| - image-to-video | |
| # 🏞 AI Video Composer - FFMPEG in Plain English | |
| Describe what you want in plain English, like "create a slideshow from these images with background music" or "add a waveform visualization to this audio", and AI generates the exact FFMPEG command to make it happen. | |
| ## How It Works | |
| 1. **Upload Media Files**: | |
| - Supports multiple file formats including: | |
| - Images: .png, .jpg, .jpeg, .tiff, .bmp, .gif, .svg | |
| - Audio: .mp3, .wav, .ogg | |
| - Video: .mp4, .avi, .mov, .mkv, .flv, .wmv, .webm, and more | |
| - File size limit: 10MB per file | |
| - Video duration limit: 2 minutes | |
| 2. **Provide Instructions**: | |
| - Write natural language instructions describing how you want to process your media | |
| - Examples: | |
| - "Convert these images into a slideshow with 1 second per image" | |
| - "Add this audio track to the video" | |
| - "Make the video play 2x faster" | |
| - "Create a waveform visualization for this audio file" | |
| 3. **Advanced Parameters**: | |
| - Top-p (nucleus sampling): Controls diversity of generated commands (0-1) | |
| - Temperature: Controls randomness in command generation (0-5) | |
| 4. **Processing**: | |
| - The app analyzes your files and instructions | |
| - Generates an optimized FFmpeg command using your chosen AI model | |
| - Executes the command and returns the processed video | |
| - Displays the generated FFmpeg command for transparency | |
| ## Features | |
| - **Smart Command Generation**: Automatically generates optimal FFmpeg commands based on natural language input | |
| - **Error Handling**: Validates commands before execution and retries with alternative approaches if needed | |
| - **Multiple Asset Support**: Process multiple media files in a single operation | |
| - **Waveform Visualization**: Special support for audio visualization with customizable parameters | |
| - **Image Sequence Processing**: Efficient handling of image sequences for slideshow creation | |
| - **Format Conversion**: Support for various input/output format conversions | |
| - **Example Gallery**: Built-in examples demonstrating common use cases | |
| ## Technical Details | |
| - Built with Gradio for the user interface | |
| - Uses FFmpeg for media processing | |
| - Powered by GLM-4.7 or Kimi-K2 for command generation | |
| - Implements robust error handling and command validation | |
| - Processes files in a temporary directory for safety | |
| - Supports both simple operations and complex media transformations | |
| ## Limitations | |
| - Maximum file size: 10MB per file | |
| - Maximum video duration: 2 minutes | |
| - Output format: Always MP4 | |
| - Processing time may vary based on input complexity |