dev_caio

Paused

dev_caio / README.md

Update README.md

cec57cb verified 2 months ago

1.21 kB

	---
	title: ShortSmith v2
	emoji: 🎬
	colorFrom: purple
	colorTo: blue
	sdk: gradio
	sdk_version: "4.44.1"
	app_file: app.py
	pinned: false
	license: mit
	tags:
	- video
	- highlight-detection
	- ai
	- qwen
	- computer-vision
	- audio-analysis
	short_description: AI-Powered Video Highlight Extractor
	---

	# ShortSmith v2

	Extract the most engaging highlight clips from your videos automatically using AI.

	## Features
	- Multi-modal analysis (visual + audio + motion)
	- Domain-optimized presets (Sports, Music, Vlogs, etc.)
	- Person-specific filtering
	- Scene-aware clip cutting
	- Trained on Mr. HiSum "Most Replayed" data

	## Usage
	1. Upload a video (up to 500MB, max 1 hour)
	2. Select content domain (Sports, Music, Vlogs, etc.)
	3. Choose number of clips and duration
	4. (Optional) Upload reference image for person filtering
	5. Click "Extract Highlights"
	6. Download your clips!

	## Tech Stack
	- Visual: Qwen2-VL-2B (INT4 quantized)
	- Audio: Librosa + Wav2Vec 2.0
	- Face Recognition: InsightFace (SCRFD + ArcFace)
	- Hype Scoring: MLP trained on Mr. HiSum dataset
	- Scene Detection: PySceneDetect

	Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference