Spaces:

MogensR
/

VideoBackgroundReplacer2

Paused

File size: 3,860 Bytes

c41ab7b
3afce52
c7f8992
 
 
70ed446
4d84cf4
 
 
9aa4b4e
c7f8992
c41ab7b
 
 
 
 
 
 
76580be
c41ab7b
76580be
84ac472
c41ab7b
 
6391dff
53cb751
6391dff
c41ab7b
76580be
c41ab7b
76580be
53cb751
c41ab7b
53cb751
c41ab7b
 
 
 
8e32139
76580be
c41ab7b
6391dff
c41ab7b
6391dff
c41ab7b
84ac472
76580be
c41ab7b
84ac472
c41ab7b
6391dff
53cb751
 
 
 
 
6391dff
53cb751
84ac472
c41ab7b
6391dff
c41ab7b
84ac472
53cb751
 
 
 
 
 
 
 
8e32139
 
 
6391dff
c41ab7b
84ac472
c41ab7b
84ac472
53cb751
 
 
 
 
84ac472
c41ab7b
84ac472
c41ab7b
84ac472
c41ab7b
84ac472
c41ab7b
 
 
6391dff
c41ab7b
 
6391dff
c41ab7b
 
53cb751

---
title: 🎬 BackgroundFX Pro - SAM2 + MatAnyone
emoji: 🎥
colorFrom: indigo
colorTo: purple
sdk: streamlit
sdk_version: 1.32.0
app_file: streamlit_app.py
pinned: false
license: mit
tags:
  - video
  - background-removal
  - segmentation
  - matting
  - SAM2
  - MatAnyone
---

# 🎬 BackgroundFX Pro — Professional Video Background Replacement

BackgroundFX Pro is a GPU-accelerated app for Hugging Face Spaces (Docker) that replaces video backgrounds using:
- **SAM2** — high-quality object segmentation  
- **MatAnyone** — temporal video matting for stable alpha over time

Built on: **CUDA 12.1.1**, **PyTorch 2.5.1 (cu121)**, **torchvision 0.20.1**, **Streamlit 1.49.1**.

---

## ✨ Features

- Replace backgrounds with: **solid color**, **AI-generated** image (procedural), **custom uploaded image**, or **professional backgrounds**  
- Optimized for **T4 GPUs** on Hugging Face  
- Two-stage pipeline: SAM2 segmentation → MatAnyone refinement → compositing
- Caching & logs stored in the repo volume:
  - HF cache → `./.hf`  
  - Torch cache → `./.torch`  
  - App data & logs → `./data` (see `data/run.log`)
- **FFmpeg** — video format conversion and frame extraction

---

## 🚀 Try It

Open the Space in your browser (GPU required):  
https://huggingface.co/spaces/MogensR/VideoBackgroundReplacer2

---

## 🖱️ How to Use

1. **Upload a video** (`.mp4`, `.mov`, `.avi`, `.mkv`).  
2. Choose a **Background Type**: Image, Color, Blur, Professional Backgrounds, or AI Generated.  
3. If using custom background, upload your image or select from professional options.  
4. Click **🚀 Process Video**.  
5. Preview and **💾 Download Result**.

> Tip: Start with 720p/1080p on T4; 4K can exceed memory limits.

---

## 🗂️ Project Structure (key files)

- `Dockerfile` — CUDA 12.1.1 + PyTorch 2.5.1 container
- `requirements.txt` — Python dependencies
- `app.py` — Main Streamlit application
- `integrated_pipeline.py` — Two-stage processing pipeline
- `models/sam2_loader.py` — SAM2 model loader with HF Hub integration
- `models/matanyone_loader.py` — MatAnyone model loader
- `utils/` — Utility functions
- `data/` — Created at runtime for logs/outputs  
- `tmp/` — Created at runtime for processing jobs - `video_pipeline.py` — Core video processing logic (SAM2 + MatAnyone integration)
- `video_pipeline.py` — Core video processing logic (SAM2 + MatAnyone integration)


---

## ⚙️ Runtime Notes

- Binds to `PORT` / `STREAMLIT_SERVER_PORT` (defaults to **7860**)
- File upload limit: 200MB via `--server.maxUploadSize=200`
- CORS disabled for Docker compatibility: `--server.enableCORS=false`
- Memory management with automatic cleanup between stages
- If processing fails, check Space logs for detailed error information

---

## 🧪 Local Development (Docker)

Requires an NVIDIA GPU with CUDA drivers.

```bash
git clone https://huggingface.co/spaces/MogensR/VideoBackgroundReplacer2
cd VideoBackgroundReplacer2

# Build (Ubuntu 22.04, CUDA 12.1.1; installs Torch 2.5.1+cu121)
docker build -t backgroundfx-pro .

# Run
docker run --gpus all -p 7860:7860 backgroundfx-pro
```

Access at: http://localhost:7860

---

## 🔧 Technical Details

### Pipeline Architecture
1. **Stage 1**: SAM2 generates object masks using click points
2. **Stage 2**: MatAnyone refines masks for temporal consistency  
3. **Stage 3**: Composite foreground with new background

### Model Loading
- SAM2 models downloaded from Hugging Face Hub automatically
- Supports small/base/large variants (small recommended for T4)
- MatAnyone loaded from official repository

### Performance Optimizations
- T4-specific optimizations (fp16, channels_last)
- Memory pruning during long video processing
- Automatic model unloading between stages

---

## 📝 License

MIT License - See LICENSE file for details.