Spaces:

vidhi0405
/

VideoToText

Sleeping

App Files Files Community

vidhi0405 commited on 8 days ago

Commit

a4edb01

1 Parent(s): 23c0589

commit 3

Browse files

Files changed (1) hide show

README.md +12 -11

README.md CHANGED Viewed

@@ -1,6 +1,6 @@
----
 title: SmolVLM2 Video Highlights
-emoji: ":movie_camera:"
 colorFrom: blue
 colorTo: purple
 sdk: docker
@@ -14,16 +14,16 @@ Generate intelligent video highlights using HuggingFace's segment-based approach
 This is a FastAPI service that uses HuggingFace's proven segment-based classification method with SmolVLM2-256M-Video-Instruct for reliable, consistent highlight generation.
-🚀 Features
 Segment-Based Analysis: Processes videos in fixed 5-second segments for consistent AI classification
 Dual Criteria Generation: Creates two different highlight criteria sets and selects the most selective one
 SmolVLM2-256M-Video-Instruct: Faster processing with specialized video understanding
 Visual Effects: Optional fade transitions between segments for professional-quality output
 REST API: Upload videos and get generated video description + analysis file path
-🔗 API Endpoints
 POST /upload-video - Upload video and receive analysis response
 GET /health - Health check
-📱 Usage
 Via API
 # Upload video with optional parameters
 curl -X POST \
@@ -43,27 +43,28 @@ Example response:
 Via Android App
 Use the provided Android client code to integrate with your mobile app.
-⚙️ Configuration
 Default settings:
 Segment Length: 5 seconds (fixed segments for consistent classification)
 Model: SmolVLM2-256M-Video-Instruct (faster processing)
 Effects: Enabled (fade transitions between segments)
 Dual Criteria: Two prompt variations for robust selection
-🛠️ Technology Stack
 SmolVLM2-256M-Video-Instruct: Efficient vision-language model optimized for video understanding
 HuggingFace Transformers: Latest transformer models and inference
 FastAPI: Modern web framework for APIs
 FFmpeg: Video processing with advanced filter support
 PyTorch: Deep learning framework with device optimization
-🎯 Perfect For
 Social media content creators
 Educational video processing
 Meeting/lecture summarization
 Sports highlight generation
 Entertainment content curation
-�� License
 Apache 2.0 - Free for commercial and personal use
-🤝 Contributing
-Built with ❤️ using Hugging Face Transformers and open-source AI models.

+---
 title: SmolVLM2 Video Highlights
+emoji: "🎬"
 colorFrom: blue
 colorTo: purple
 sdk: docker
 This is a FastAPI service that uses HuggingFace's proven segment-based classification method with SmolVLM2-256M-Video-Instruct for reliable, consistent highlight generation.
+ðŸš€ Features
 Segment-Based Analysis: Processes videos in fixed 5-second segments for consistent AI classification
 Dual Criteria Generation: Creates two different highlight criteria sets and selects the most selective one
 SmolVLM2-256M-Video-Instruct: Faster processing with specialized video understanding
 Visual Effects: Optional fade transitions between segments for professional-quality output
 REST API: Upload videos and get generated video description + analysis file path
+ðŸ”— API Endpoints
 POST /upload-video - Upload video and receive analysis response
 GET /health - Health check
+ðŸ“± Usage
 Via API
 # Upload video with optional parameters
 curl -X POST \
 Via Android App
 Use the provided Android client code to integrate with your mobile app.
+âš™ï¸ Configuration
 Default settings:
 Segment Length: 5 seconds (fixed segments for consistent classification)
 Model: SmolVLM2-256M-Video-Instruct (faster processing)
 Effects: Enabled (fade transitions between segments)
 Dual Criteria: Two prompt variations for robust selection
+ðŸ› ï¸ Technology Stack
 SmolVLM2-256M-Video-Instruct: Efficient vision-language model optimized for video understanding
 HuggingFace Transformers: Latest transformer models and inference
 FastAPI: Modern web framework for APIs
 FFmpeg: Video processing with advanced filter support
 PyTorch: Deep learning framework with device optimization
+ðŸŽ¯ Perfect For
 Social media content creators
 Educational video processing
 Meeting/lecture summarization
 Sports highlight generation
 Entertainment content curation
+ï¿½ï¿½ License
 Apache 2.0 - Free for commercial and personal use
+ðŸ¤ Contributing
+Built with â¤ï¸ using Hugging Face Transformers and open-source AI models.