--- title: ShortSmith v2 emoji: 🎬 colorFrom: purple colorTo: blue sdk: gradio sdk_version: "4.44.1" app_file: app.py pinned: false license: mit tags: - video - highlight-detection - ai - qwen - computer-vision - audio-analysis short_description: AI-Powered Video Highlight Extractor --- # ShortSmith v2 Extract the most engaging highlight clips from your videos automatically using AI. ## Features - Multi-modal analysis (visual + audio + motion) - Domain-optimized presets (Sports, Music, Vlogs, etc.) - Person-specific filtering - Scene-aware clip cutting - Trained on Mr. HiSum "Most Replayed" data ## Usage 1. Upload a video (up to 500MB, max 1 hour) 2. Select content domain (Sports, Music, Vlogs, etc.) 3. Choose number of clips and duration 4. (Optional) Upload reference image for person filtering 5. Click "Extract Highlights" 6. Download your clips! ## Tech Stack - **Visual**: Qwen2-VL-2B (INT4 quantized) - **Audio**: Librosa + Wav2Vec 2.0 - **Face Recognition**: InsightFace (SCRFD + ArcFace) - **Hype Scoring**: MLP trained on Mr. HiSum dataset - **Scene Detection**: PySceneDetect Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference