Spaces:
Running
on
Zero
Running
on
Zero
A newer version of the Gradio SDK is available:
6.1.0
Depth Anything 3 AnySize
π Key Modifications from the Original Repo
- Native-Resolution Inputs: Images are now processed at their original resolution by default. During inference, inputs are padded to the ViT patch size, and outputs (depth/confidence/sky maps and processed images) are cropped back to the source height and width. Using larger inputs now will increase memory and compute requirements.
- Updated Defaults: The CLI defaults to
--process-res None --process-res-method keep, and the API usesprocess_res=None, process_res_method="keep". Seedocs/CLI.mdanddocs/API.mdfor details. - Optional Downscaling: For faster inference and lower memory usage, set
process_res(e.g.,720) with a resize strategy like--process-res-method upper_bound_resize. - Original Baseline: Previously, images were resized to 504 px on the long side.
- Implementation Details: Input padding is handled in
src/depth_anything_3/utils/io/input_processor.py, and output cropping is managed insrc/depth_anything_3/api.py.