dev_caio / README.md
Chaitanya-aitf's picture
Update README.md
cec57cb verified

A newer version of the Gradio SDK is available: 6.6.0

Upgrade
metadata
title: ShortSmith v2
emoji: 🎬
colorFrom: purple
colorTo: blue
sdk: gradio
sdk_version: 4.44.1
app_file: app.py
pinned: false
license: mit
tags:
  - video
  - highlight-detection
  - ai
  - qwen
  - computer-vision
  - audio-analysis
short_description: AI-Powered Video Highlight Extractor

ShortSmith v2

Extract the most engaging highlight clips from your videos automatically using AI.

Features

  • Multi-modal analysis (visual + audio + motion)
  • Domain-optimized presets (Sports, Music, Vlogs, etc.)
  • Person-specific filtering
  • Scene-aware clip cutting
  • Trained on Mr. HiSum "Most Replayed" data

Usage

  1. Upload a video (up to 500MB, max 1 hour)
  2. Select content domain (Sports, Music, Vlogs, etc.)
  3. Choose number of clips and duration
  4. (Optional) Upload reference image for person filtering
  5. Click "Extract Highlights"
  6. Download your clips!

Tech Stack

  • Visual: Qwen2-VL-2B (INT4 quantized)
  • Audio: Librosa + Wav2Vec 2.0
  • Face Recognition: InsightFace (SCRFD + ArcFace)
  • Hype Scoring: MLP trained on Mr. HiSum dataset
  • Scene Detection: PySceneDetect

Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference