Spaces:
Paused
Paused
| title: ShortSmith v2 | |
| emoji: 🎬 | |
| colorFrom: purple | |
| colorTo: blue | |
| sdk: gradio | |
| sdk_version: "4.44.1" | |
| app_file: app.py | |
| pinned: false | |
| license: mit | |
| tags: | |
| - video | |
| - highlight-detection | |
| - ai | |
| - qwen | |
| - computer-vision | |
| - audio-analysis | |
| short_description: AI-Powered Video Highlight Extractor | |
| # ShortSmith v2 | |
| Extract the most engaging highlight clips from your videos automatically using AI. | |
| ## Features | |
| - Multi-modal analysis (visual + audio + motion) | |
| - Domain-optimized presets (Sports, Music, Vlogs, etc.) | |
| - Person-specific filtering | |
| - Scene-aware clip cutting | |
| - Trained on Mr. HiSum "Most Replayed" data | |
| ## Usage | |
| 1. Upload a video (up to 500MB, max 1 hour) | |
| 2. Select content domain (Sports, Music, Vlogs, etc.) | |
| 3. Choose number of clips and duration | |
| 4. (Optional) Upload reference image for person filtering | |
| 5. Click "Extract Highlights" | |
| 6. Download your clips! | |
| ## Tech Stack | |
| - **Visual**: Qwen2-VL-2B (INT4 quantized) | |
| - **Audio**: Librosa + Wav2Vec 2.0 | |
| - **Face Recognition**: InsightFace (SCRFD + ArcFace) | |
| - **Hype Scoring**: MLP trained on Mr. HiSum dataset | |
| - **Scene Detection**: PySceneDetect | |
| Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference | |