Spaces:
Sleeping
Sleeping
commit 3
Browse files
README.md
CHANGED
|
@@ -1,6 +1,6 @@
|
|
| 1 |
-
---
|
| 2 |
title: SmolVLM2 Video Highlights
|
| 3 |
-
emoji: "
|
| 4 |
colorFrom: blue
|
| 5 |
colorTo: purple
|
| 6 |
sdk: docker
|
|
@@ -14,16 +14,16 @@ Generate intelligent video highlights using HuggingFace's segment-based approach
|
|
| 14 |
|
| 15 |
This is a FastAPI service that uses HuggingFace's proven segment-based classification method with SmolVLM2-256M-Video-Instruct for reliable, consistent highlight generation.
|
| 16 |
|
| 17 |
-
|
| 18 |
Segment-Based Analysis: Processes videos in fixed 5-second segments for consistent AI classification
|
| 19 |
Dual Criteria Generation: Creates two different highlight criteria sets and selects the most selective one
|
| 20 |
SmolVLM2-256M-Video-Instruct: Faster processing with specialized video understanding
|
| 21 |
Visual Effects: Optional fade transitions between segments for professional-quality output
|
| 22 |
REST API: Upload videos and get generated video description + analysis file path
|
| 23 |
-
|
| 24 |
POST /upload-video - Upload video and receive analysis response
|
| 25 |
GET /health - Health check
|
| 26 |
-
|
| 27 |
Via API
|
| 28 |
# Upload video with optional parameters
|
| 29 |
curl -X POST \
|
|
@@ -43,27 +43,28 @@ Example response:
|
|
| 43 |
Via Android App
|
| 44 |
Use the provided Android client code to integrate with your mobile app.
|
| 45 |
|
| 46 |
-
|
| 47 |
Default settings:
|
| 48 |
|
| 49 |
Segment Length: 5 seconds (fixed segments for consistent classification)
|
| 50 |
Model: SmolVLM2-256M-Video-Instruct (faster processing)
|
| 51 |
Effects: Enabled (fade transitions between segments)
|
| 52 |
Dual Criteria: Two prompt variations for robust selection
|
| 53 |
-
|
| 54 |
SmolVLM2-256M-Video-Instruct: Efficient vision-language model optimized for video understanding
|
| 55 |
HuggingFace Transformers: Latest transformer models and inference
|
| 56 |
FastAPI: Modern web framework for APIs
|
| 57 |
FFmpeg: Video processing with advanced filter support
|
| 58 |
PyTorch: Deep learning framework with device optimization
|
| 59 |
-
|
| 60 |
Social media content creators
|
| 61 |
Educational video processing
|
| 62 |
Meeting/lecture summarization
|
| 63 |
Sports highlight generation
|
| 64 |
Entertainment content curation
|
| 65 |
-
|
| 66 |
Apache 2.0 - Free for commercial and personal use
|
| 67 |
|
| 68 |
-
|
| 69 |
-
Built with
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
title: SmolVLM2 Video Highlights
|
| 3 |
+
emoji: "🎬"
|
| 4 |
colorFrom: blue
|
| 5 |
colorTo: purple
|
| 6 |
sdk: docker
|
|
|
|
| 14 |
|
| 15 |
This is a FastAPI service that uses HuggingFace's proven segment-based classification method with SmolVLM2-256M-Video-Instruct for reliable, consistent highlight generation.
|
| 16 |
|
| 17 |
+
🚀 Features
|
| 18 |
Segment-Based Analysis: Processes videos in fixed 5-second segments for consistent AI classification
|
| 19 |
Dual Criteria Generation: Creates two different highlight criteria sets and selects the most selective one
|
| 20 |
SmolVLM2-256M-Video-Instruct: Faster processing with specialized video understanding
|
| 21 |
Visual Effects: Optional fade transitions between segments for professional-quality output
|
| 22 |
REST API: Upload videos and get generated video description + analysis file path
|
| 23 |
+
🔗 API Endpoints
|
| 24 |
POST /upload-video - Upload video and receive analysis response
|
| 25 |
GET /health - Health check
|
| 26 |
+
📱 Usage
|
| 27 |
Via API
|
| 28 |
# Upload video with optional parameters
|
| 29 |
curl -X POST \
|
|
|
|
| 43 |
Via Android App
|
| 44 |
Use the provided Android client code to integrate with your mobile app.
|
| 45 |
|
| 46 |
+
âš™ï¸ Configuration
|
| 47 |
Default settings:
|
| 48 |
|
| 49 |
Segment Length: 5 seconds (fixed segments for consistent classification)
|
| 50 |
Model: SmolVLM2-256M-Video-Instruct (faster processing)
|
| 51 |
Effects: Enabled (fade transitions between segments)
|
| 52 |
Dual Criteria: Two prompt variations for robust selection
|
| 53 |
+
ðŸ› ï¸ Technology Stack
|
| 54 |
SmolVLM2-256M-Video-Instruct: Efficient vision-language model optimized for video understanding
|
| 55 |
HuggingFace Transformers: Latest transformer models and inference
|
| 56 |
FastAPI: Modern web framework for APIs
|
| 57 |
FFmpeg: Video processing with advanced filter support
|
| 58 |
PyTorch: Deep learning framework with device optimization
|
| 59 |
+
🎯 Perfect For
|
| 60 |
Social media content creators
|
| 61 |
Educational video processing
|
| 62 |
Meeting/lecture summarization
|
| 63 |
Sports highlight generation
|
| 64 |
Entertainment content curation
|
| 65 |
+
�� License
|
| 66 |
Apache 2.0 - Free for commercial and personal use
|
| 67 |
|
| 68 |
+
🤠Contributing
|
| 69 |
+
Built with â¤ï¸ using Hugging Face Transformers and open-source AI models.
|
| 70 |
+
|