File size: 2,259 Bytes
0518863
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
---
license: apache-2.0
base_model: depth-anything/Video-Depth-Anything-Base
tags:
  - video
  - depth-estimation
  - monocular-depth
  - safetensors
---

# Video Depth Anything — Base

Mirror of [depth-anything/Video-Depth-Anything-Base](https://huggingface.co/depth-anything/Video-Depth-Anything-Base) for use with [ComfyUI-FFMPEGA](https://github.com/AEmotionStudio/ComfyUI-FFMPEGA).

## What is Video Depth Anything?

Video Depth Anything is a state-of-the-art model for **temporally consistent monocular depth estimation** in videos. It extends [Depth Anything V2](https://github.com/DepthAnything/Depth-Anything-V2) with temporal modules for smooth, flicker-free depth maps across video frames.

Key features:
- **Temporal consistency** — smooth depth maps without frame-to-frame flickering
- **Multiple encoder sizes** — Small (97.5M), Base, and Large variants
- **Apache 2.0 license** — fully open source
- **Colormap output** — supports multiple colormap visualizations (inferno, magma, plasma, etc.)

## Files

```
model.safetensors
config.json
```

## Usage

### With ComfyUI-FFMPEGA (recommended)

1. Set `no_llm_mode` to `video_depth` on the FFMPEG Agent node
2. Select encoder size (`small`, `base`, `large`) under Advanced Options
3. Choose colormap for visualization
4. The model auto-downloads on first use

### Manual download

```bash
huggingface-cli download AEmotionStudio/Video-Depth-Anything-Base --local-dir ./video_depth_anything
```

## Available Sizes

| Variant | Parameters | Size | Speed |
| :--- | :--- | :--- | :--- |
| **Small** | 24.8M | ~102 MB | Fastest |
| **Base** | 97.5M | ~390 MB | Balanced |
| **Large** | 335.3M | ~670 MB | Best quality |

## License

Apache 2.0 — see the upstream repository for full license terms.

## Credits

- Original model by: [Depth Anything team](https://github.com/DepthAnything/Video-Depth-Anything)
- Paper: *"Video Depth Anything: Consistent Depth Estimation for Super-Long Videos"*
- Upstream HuggingFace: [depth-anything/Video-Depth-Anything-Base](https://huggingface.co/depth-anything/Video-Depth-Anything-Base)
- Redistributed by: [Æmotion Studio](https://huggingface.co/AEmotionStudio) for use with [ComfyUI-FFMPEGA](https://github.com/AEmotionStudio/ComfyUI-FFMPEGA)