AEmotionStudio
/

Video-Depth-Anything-Small

Depth Estimation

monocular-depth

Model card Files Files and versions

Video-Depth-Anything-Small / README.md

AEmotionStudio's picture

Upload README.md with huggingface_hub

50aafe6 verified 5 days ago

|

history blame contribute delete

2.27 kB

	---
	license: apache-2.0
	base_model: depth-anything/Video-Depth-Anything-Small
	tags:
	- video
	- depth-estimation
	- monocular-depth
	- safetensors
	---

	# Video Depth Anything — Small

	Mirror of [depth-anything/Video-Depth-Anything-Small](https://huggingface.co/depth-anything/Video-Depth-Anything-Small) for use with [ComfyUI-FFMPEGA](https://github.com/AEmotionStudio/ComfyUI-FFMPEGA).

	## What is Video Depth Anything?

	Video Depth Anything is a state-of-the-art model for temporally consistent monocular depth estimation in videos. It extends [Depth Anything V2](https://github.com/DepthAnything/Depth-Anything-V2) with temporal modules for smooth, flicker-free depth maps across video frames.

	Key features:
	- Temporal consistency — smooth depth maps without frame-to-frame flickering
	- Multiple encoder sizes — Small (24.8M), Base, and Large variants
	- Apache 2.0 license — fully open source
	- Colormap output — supports multiple colormap visualizations (inferno, magma, plasma, etc.)

	## Files

	```
	model.safetensors
	config.json
	```

	## Usage

	### With ComfyUI-FFMPEGA (recommended)

	1. Set `no_llm_mode` to `video_depth` on the FFMPEG Agent node
	2. Select encoder size (`small`, `base`, `large`) under Advanced Options
	3. Choose colormap for visualization
	4. The model auto-downloads on first use

	### Manual download

	```bash
	huggingface-cli download AEmotionStudio/Video-Depth-Anything-Small --local-dir ./video_depth_anything
	```

	## Available Sizes

	\| Variant \| Parameters \| Size \| Speed \|
	\| :--- \| :--- \| :--- \| :--- \|
	\| Small \| 24.8M \| ~102 MB \| Fastest \|
	\| Base \| 97.5M \| ~390 MB \| Balanced \|
	\| Large \| 335.3M \| ~670 MB \| Best quality \|

	## License

	Apache 2.0 — see the upstream repository for full license terms.

	## Credits

	- Original model by: [Depth Anything team](https://github.com/DepthAnything/Video-Depth-Anything)
	- Paper: "Video Depth Anything: Consistent Depth Estimation for Super-Long Videos"
	- Upstream HuggingFace: [depth-anything/Video-Depth-Anything-Small](https://huggingface.co/depth-anything/Video-Depth-Anything-Small)
	- Redistributed by: [Æmotion Studio](https://huggingface.co/AEmotionStudio) for use with [ComfyUI-FFMPEGA](https://github.com/AEmotionStudio/ComfyUI-FFMPEGA)