Spaces:

RafaG
/

ViralCutterPRO

Sleeping

App Files Files Community

ViralCutterPRO / README_en.md

RafaG

Upload 85 files

80b326d verified 19 days ago

preview code

raw

history blame contribute delete

7.01 kB

ViralCutter: Viral Video Generator

English | Português

Description

ViralCutter is an innovative tool designed to generate viral videos from existing content. With advanced video and audio processing techniques, ViralCutter cuts and edits video segments that are perfect for sharing on social media. Using the WhisperX model for transcription and automatic caption generation, it adapts videos to the 9:16 (vertical) format, ideal for platforms like TikTok, Instagram Reels, and YouTube Shorts.

What's New & Updates (Changelog)

Check out the latest improvements:

New WebUI (Gradio): Modern graphical interface inspired by OpusClip, making it easier to use all tools.
Fast Installation (UV): New .bat script that uses uv to install dependencies much faster.
Performance Optimization: Transcription "slicing" implemented. The video is transcribed only once, and cuts reuse the data, eliminating reprocessing.
Flexible AI Support: Native integration with Gemini API and experimental support for G4F (GPT-4 Free), plus a Manual mode.
External Configuration: api_config.json and prompt.txt files for easy customization without touching the code.
Face Fix: MediaPipe fix for precise face tracking without relying on "Center Crop".
Subtitle Improvements: Smart positioning for 2-face videos (split screen) and style corrections.

(See changelog.md for full details)

Interface

Main Screen: OpusClip-style gallery and intuitive controls

Settings: AI adjustment, captions, and log viewer

Features

Video Download: Downloads YouTube videos via a provided URL.
Audio Transcription: Converts audio to text using the WhisperX model.
Viral Segment Identification: Uses AI to detect parts of the video with high viral potential.
Cutting & Formatting: Cuts selected segments and adjusts the aspect ratio to 9:16.
Smart Cropping: Keeps the speaker in focus (Face Tracking) or uses automatic Split Screen (2-Faces) mode.
Audio/Video Merging: Combines transcribed audio with processed video clips.
Batch Export: Generates a ZIP file with all created viral videos, facilitating download and sharing.
Custom Captions: Create custom captions with colors, highlights, no highlights, or word-by-word styles, offering extensive editing possibilities.

How to Use

Open the link and follow the steps in order(Only Portuguese, sorry):
I couldn't get the video to download/play correctly on Gradio in Google Colab, but it's functional.

Limitations

The quality of generated videos may vary based on the quality of the original video.
Processing time depends heavily on your GPU.
The G4F model may be unstable or have request limits. Use Gemini for greater stability (requires an api_key).

Inspiration

This project was inspired by the following repositories:

TODO📝

Release code
Huggingface SpaceDemo
Two face in the cut
Custom caption and burn
Make the code faster
More types of framing beyond 9:16
The cut follows the face as it moves
Automatic translation
Satisfying video on the side
Background music
Watermark at user's choice
Upload directly to YouTube channel

Examples

Installation and Local Usage

Prerequisites

Python 3.10+
FFmpeg installed and in the system PATH.
NVIDIA GPU recommended (with CUDA installed) for WhisperX.

Configuration

Install dependencies:

Option A (Recommended - Fast): Run the install_dependencies.bat file. It will use uv to install everything quickly.

Option B (Manual):
```
pip install -r requirements.txt
```
(Note: WhisperX and Torch may require specific installation instructions for your CUDA version).

Configure API (Optional but Recommended): Edit the api_config.json file in the root folder:

{
    "selected_api": "gemini",
    "gemini": {
        "api_key": "YOUR_KEY_HERE"
    }
}

Running

Graphical Interface (WebUI)

To use the new visual interface: Double-click run_webui.bat or run:

.\run_webui.bat

Interactive Mode (Simple)

Just run the script and follow the on-screen instructions:

python main_improved.py

CLI Mode (Advanced)

You can pass all arguments via command line for automation:

python main_improved.py --url "https://youtu.be/EXAMPLE" --segments 3 --ai-backend gemini --model large-v3-turbo

Main Arguments:

--url: YouTube video URL.
--segments: Number of cuts to generate.
--ai-backend: gemini (Recommended), g4f, or manual.
--viral: Activates automatic viral search mode.
--face-mode: auto, 1 (one face), or 2 (two faces/split).
--workflow: 1 (Full) or 2 (Cut Only, no captions/crop).

Contributions

Want to help make ViralCutter even better? If you have suggestions or want to contribute to the code, feel free to open an issue or submit a pull request on our GitHub repository.

Version

0.7v Alpha
A free alternative to opus.pro and vidyo.ai.