Spaces:

MogensR
/

VideoBackgroundReplacer

Paused

App Files Files Community

MogensR commited on Aug 15, 2025

Commit

76b6368

1 Parent(s): a226dc7

Update README.md

Browse files

Files changed (1) hide show

README.md +69 -173

README.md CHANGED Viewed

@@ -1,200 +1,96 @@
 ---
-title: BackgroundFX Fast
-emoji: 🚀
 colorFrom: blue
-colorTo: green
-sdk: docker
 pinned: false
 license: mit
-models:
-  - rembg/u2net_human_seg
-hardware: T4 medium
 ---
-# 🚀 BackgroundFX - Lightning-Fast Video Background Replacement
-**Professional-quality background replacement in seconds, not minutes!** Powered by specialized AI models optimized for T4 GPU performance.
-[![Hugging Face Spaces](https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Spaces-blue)](https://huggingface.co/spaces/yourusername/backgroundfx)
-[![GPU Optimized](https://img.shields.io/badge/GPU-T4%20Optimized-green)](https://www.nvidia.com/en-us/data-center/tesla-t4/)
-## ⚡ Performance Benchmarks
-| Video Length | Ultra Fast | Fast | Balanced | Quality |
-|-------------|------------|------|----------|---------|
-| 10 seconds | 5 sec | 10 sec | 15 sec | 20 sec |
-| 30 seconds | 15 sec | 30 sec | 45 sec | 60 sec |
-| 60 seconds | 30 sec | 60 sec | 90 sec | 120 sec |
-*Benchmarks on T4 GPU with 1080p video*
-## 🎯 Key Features
-### **🏃‍♂️ Speed-First Design**
-- **5-10x faster** than SAM2-based solutions
-- Optimized for T4 GPU on Hugging Face Spaces
-- Real-time preview of first frame
-- Batch processing for maximum efficiency
-### **🎨 Intelligent Segmentation**
-- **Rembg U2NET**: Purpose-built for human segmentation (92-95% accuracy)
-- **MatAnyone Integration**: Optional edge refinement for hair and clothing
-- **Automatic fallback**: Works even without GPU
-### **🎬 Flexible Processing Modes**
-- **Ultra Fast**: Every 3rd frame, direct compositing (3x speed)
-- **Fast**: Every 2nd frame (2x speed)
-- **Balanced**: All frames, optimized pipeline
-- **Quality**: Full processing with green screen workflow
-### **🖼️ Background Options**
-- **Gradient backgrounds**: Instant generation
-- **Solid colors**: Simple and clean
-- **Image URL**: Direct from web
-- **Upload**: Your own images
-## 🔧 Technology Stack
-```
-Pipeline: Rembg → MatAnyone (optional) → Compositing → Output
-```
-| Component | Purpose | Performance Impact |
-|-----------|---------|-------------------|
-| **Rembg** | Person extraction | Base speed |
-| **U2NET_human_seg** | Specialized human model | Optimized for people |
-| **MatAnyone** | Edge refinement | +20% time, better edges |
-| **OpenCV** | Video processing | Hardware accelerated |
-| **Torch** | GPU acceleration | 5-10x speedup |
-## 📦 Installation
-### Quick Deploy to Hugging Face Spaces
-1. **Clone this repository**
-2. **Create new Space** on Hugging Face
-3. **Select T4 GPU** (medium or small)
-4. **Push code** and wait for build
-### Requirements
-```txt
-streamlit==1.48.0
-opencv-python-headless
-numpy
-Pillow
-rembg
-torch
-torchvision
-onnxruntime-gpu
-matanyone  # Optional: for edge refinement
-```
-## 🚀 Usage
-### Simple 3-Step Process
-1. **Upload Video** 📹
-   - Supports MP4, AVI, MOV, MKV
-   - Recommended: Under 30 seconds for fastest processing
-2. **Choose Background** 🎨
-   - Gradient: Instant custom gradients
-   - Color: Solid color backgrounds
-   - Image: URL or upload
-3. **Select Speed & Process** ⚡
-   - Pick your speed/quality tradeoff
-   - Optional MatAnyone refinement
-   - Download result
-## 🎯 Use Cases
-- **Content Creation**: YouTube, TikTok, Instagram videos
-- **Professional**: Video calls, presentations, demos
-- **Education**: Online courses, tutorials
-- **Marketing**: Product videos, advertisements
-- **Personal**: Fun videos, memes, creative content
-## 🏗️ Architecture Decisions
-### Why Rembg over SAM2?
-| Aspect | Rembg | SAM2 |
-|--------|-------|------|
-| **Human Segmentation** | 92-95% accuracy | 85-90% accuracy |
-| **Speed** | 15-20 FPS | 2-3 FPS |
-| **Memory** | 500MB-1GB | 2-4GB |
-| **Setup** | Simple | Complex |
-| **Purpose** | Specialized for humans | General purpose |
-### Why MatAnyone?
-- Refines edges around hair and clothing
-- Minimal performance impact (20%)
-- Optional - can disable for speed
-- Professional-quality output
-## 📊 Performance Optimization Tips
-1. **For fastest processing**:
-   - Use "Ultra Fast" mode
-   - Disable MatAnyone
-   - Use gradient backgrounds
-   - Keep videos under 30 seconds
-2. **For best quality**:
-   - Use "Quality" mode
-   - Enable MatAnyone
-   - Use green screen workflow
-   - Process at full resolution
-3. **For best balance**:
-   - Use "Fast" mode
-   - Enable MatAnyone for important videos
-   - Gradient or simple backgrounds
-## 🐛 Troubleshooting
-| Issue | Solution |
-|-------|----------|
-| **Slow processing** | Switch to "Fast" or "Ultra Fast" mode |
-| **GPU not detected** | Ensure T4 GPU is enabled in Space settings |
-| **Out of memory** | Use "Ultra Fast" mode or shorter videos |
-| **Poor edges** | Enable MatAnyone refinement |
-| **Video won't play** | Check video codec compatibility |
-## 📈 Roadmap
-- [ ] Batch video processing
-- [ ] Custom model fine-tuning
-- [ ] Real-time preview
-- [ ] Mobile app
-- [ ] API endpoint
-- [ ] More background effects
-## 🤝 Contributing
-Contributions welcome! Please check our guidelines.
-## 📄 License
-MIT License - feel free to use in your projects!
-## 🙏 Acknowledgments
-- **Rembg** team for the excellent segmentation models
-- **MatAnyone** for edge refinement technology
-- **Hugging Face** for GPU infrastructure
-- **Streamlit** for the amazing framework
-## 💬 Support
-- [GitHub Issues](https://github.com/yourusername/backgroundfx/issues)
-- [Hugging Face Discussion](https://huggingface.co/spaces/yourusername/backgroundfx/discussions)
----
-**Built for speed, designed for quality.** 🚀
-*Optimized for T4 GPU on Hugging Face Spaces*

 ---
+title: BackgroundFX Pro - SAM2 Powered
+emoji: 🎥
 colorFrom: blue
+colorTo: purple
+sdk: gradio
+sdk_version: 4.0.0
+app_file: app.py
 pinned: false
 license: mit
+suggested_hardware: t4-small
+suggested_storage: small
 ---
+# 🎥 BackgroundFX Pro - SAM2 Powered
+**Professional AI video background replacement with advanced segmentation**
+Upload your video and let SAM2 AI automatically detect and replace the background with precision. Optimized for Hugging Face Spaces with smart memory management and lazy loading.
+## ✨ Features
+- **🤖 SAM2 Integration**: State-of-the-art segmentation with Meta's SAM2
+- **⚡ Smart Loading**: True lazy loading - models download only when needed
+- **🎨 Background Options**: 8 built-in presets + custom image upload
+- **🔧 Advanced Settings**: Model size selection and edge smoothing
+- **💾 Memory Optimized**: Automatic cleanup and CUDA cache management
+- **📱 Professional UI**: Clean, intuitive interface with real-time progress
+## 🚀 Quick Start
+1. Upload a video (MP4, AVI, MOV, MKV, WebM - max 5 minutes)
+2. Choose a background preset or upload custom image
+3. Select AI model size (Tiny/Small/Base)
+4. Click "Replace Background" and wait for processing
+5. Download your professional video!
+## 💡 Pro Tips
+- **Best results**: Clear subject separation from background
+- **Lighting**: Even lighting works best for accurate segmentation
+- **Movement**: Minimal camera shake recommended
+- **Processing time**: ~30-60 seconds per minute of video
+- **GPU acceleration**: Automatically uses available GPU for faster processing
+## 🔧 Technical Details
+- **Models**: SAM2 Tiny (38MB), Small (185MB), Base (320MB)
+- **Formats**: Supports all major video formats
+- **Resolution**: Up to 1920x1080 (Full HD)
+- **Duration**: Max 5 minutes on free tier
+- **Memory**: True lazy loading with automatic cleanup
+## 🎬 Use Cases
+- **Content Creation**: Remove messy backgrounds for professional videos
+- **Virtual Meetings**: Create custom backgrounds for video calls
+- **Education**: Clean backgrounds for instructional videos
+- **Social Media**: Eye-catching backgrounds for posts and stories
+## 🏗️ Built With
+- [SAM2](https://github.com/facebookresearch/segment-anything-2) - Meta's Segment Anything Model 2
+- [Gradio](https://gradio.app/) - Machine learning web interface framework
+- [OpenCV](https://opencv.org/) - Computer vision library
+- [PyTorch](https://pytorch.org/) - Deep learning framework
+## 📋 System Requirements
+- **Recommended**: GPU-enabled Space (T4-small or better)
+- **Minimum**: CPU-only mode supported but slower
+- **Memory**: Automatic management with CUDA optimization
+- **Storage**: No persistent storage needed (lazy loading)
+## 🎯 Background Presets
+Choose from 8 beautiful presets:
+- **Ocean Blue** - Professional gradient
+- **Sunset Orange** - Warm and vibrant
+- **Forest Green** - Natural and calm
+- **Purple Haze** - Creative and modern
+- **Pure White** - Clean and minimal
+- **Pure Black** - Dramatic effect
+- **Chroma Green** - For further editing
+- **Chroma Blue** - Alternative chroma key
+## ⚡ Performance Guide
+| Hardware | Processing Speed | Best Model | Concurrent Users |
+|----------|------------------|------------|------------------|
+| CPU | 2-3 min/video min | Tiny | 1 |
+| T4-small | 30-60s/video min | Small | 1-2 |
+| T4-medium | 20-40s/video min | Base | 2-3 |
+| A10G+ | 15-30s/video min | Base | 3-5 |
+Check out the [configuration reference](https://huggingface.co/docs/hub/spaces-config-reference) for more details.