Update pipeline tag to `image-to-3d` and correct paper links

This PR improves the model card by:
- Updating the `pipeline_tag` from `depth-estimation` to `image-to-3d`. This change better reflects the model's comprehensive capabilities, including 3D reconstruction, pose estimation, and spatially consistent geometry prediction, ensuring better discoverability on the Hugging Face Hub.
- Correcting placeholder arXiv paper links throughout the model card, including the paper badge, the "Performance" section, the BibTeX citation, and the "Links" section, to `https://arxiv.org/abs/2511.10647`.

Files changed (1) hide show

README.md +7 -7

README.md CHANGED Viewed

@@ -1,13 +1,13 @@
 ---
 license: apache-2.0
 tags:
 - depth-estimation
 - computer-vision
 - monocular-depth
 - multi-view-geometry
 - pose-estimation
-library_name: depth-anything-3
-pipeline_tag: depth-estimation
 ---
 # Depth Anything 3: DA3-SMALL
@@ -15,7 +15,7 @@ pipeline_tag: depth-estimation
 <div align="center">
 [![Project Page](https://img.shields.io/badge/Project_Page-Depth_Anything_3-green)](https://depth-anything-3.github.io)
-[![Paper](https://img.shields.io/badge/arXiv-Depth_Anything_3-red)](https://arxiv.org/abs/)
 [![Demo](https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Demo-blue)](https://huggingface.co/spaces/depth-anything/Depth-Anything-3)  # noqa: E501
 <!-- Benchmark badge removed as per request -->
@@ -108,7 +108,7 @@ da3 auto path/to/images --export-format glb --use-backend
 - **Depth Anything 2** for monocular depth estimation
 - **VGGT** for multi-view depth estimation and pose estimation
-For detailed benchmarks, please refer to our [paper](https://depth-anything-3.github.io).  # noqa: E501
 ## Limitations
@@ -124,7 +124,7 @@ If you find Depth Anything 3 useful in your research or projects, please cite:
 @article{depthanything3,
   title={Depth Anything 3: Recovering the visual space from any views},
   author={Haotong Lin and Sili Chen and Jun Hao Liew and Donny Y. Chen and Zhenyu Li and Guang Shi and Jiashi Feng and Bingyi Kang},  # noqa: E501
-  journal={arXiv preprint arXiv:XXXX.XXXXX},
   year={2025}
 }
 ```
@@ -132,11 +132,11 @@ If you find Depth Anything 3 useful in your research or projects, please cite:
 ## Links
 - 🏠 [Project Page](https://depth-anything-3.github.io)
-- 📄 [Paper](https://arxiv.org/abs/)
 - 💻 [GitHub Repository](https://github.com/ByteDance-Seed/depth-anything-3)
 - 🤗 [Hugging Face Demo](https://huggingface.co/spaces/depth-anything/Depth-Anything-3)
 - 📚 [Documentation](https://github.com/ByteDance-Seed/depth-anything-3#-useful-documentation)
 ## Authors
-[Haotong Lin](https://haotongl.github.io/) · [Sili Chen](https://github.com/SiliChen321) · [Junhao Liew](https://liewjunhao.github.io/) · [Donny Y. Chen](https://donydchen.github.io) · [Zhenyu Li](https://zhyever.github.io/) · [Guang Shi](https://scholar.google.com/citations?user=MjXxWbUAAAAJ&hl=en) · [Jiashi Feng](https://scholar.google.com.sg/citations?user=Q8iay0gAAAAJ&hl=en) · [Bingyi Kang](https://bingykang.github.io/)  # noqa: E501

 ---
+library_name: depth-anything-3
 license: apache-2.0
+pipeline_tag: image-to-3d
 tags:
 - depth-estimation
 - computer-vision
 - monocular-depth
 - multi-view-geometry
 - pose-estimation
 ---
 # Depth Anything 3: DA3-SMALL
 <div align="center">
 [![Project Page](https://img.shields.io/badge/Project_Page-Depth_Anything_3-green)](https://depth-anything-3.github.io)
+[![Paper](https://img.shields.io/badge/arXiv-Depth_Anything_3-red)](https://arxiv.org/abs/2511.10647)
 [![Demo](https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Demo-blue)](https://huggingface.co/spaces/depth-anything/Depth-Anything-3)  # noqa: E501
 <!-- Benchmark badge removed as per request -->
 - **Depth Anything 2** for monocular depth estimation
 - **VGGT** for multi-view depth estimation and pose estimation
+For detailed benchmarks, please refer to our [paper](https://arxiv.org/abs/2511.10647).  # noqa: E501
 ## Limitations
 @article{depthanything3,
   title={Depth Anything 3: Recovering the visual space from any views},
   author={Haotong Lin and Sili Chen and Jun Hao Liew and Donny Y. Chen and Zhenyu Li and Guang Shi and Jiashi Feng and Bingyi Kang},  # noqa: E501
+  journal={arXiv preprint arXiv:2511.10647},
   year={2025}
 }
 ```
 ## Links
 - 🏠 [Project Page](https://depth-anything-3.github.io)
+- 📄 [Paper](https://arxiv.org/abs/2511.10647)
 - 💻 [GitHub Repository](https://github.com/ByteDance-Seed/depth-anything-3)
 - 🤗 [Hugging Face Demo](https://huggingface.co/spaces/depth-anything/Depth-Anything-3)
 - 📚 [Documentation](https://github.com/ByteDance-Seed/depth-anything-3#-useful-documentation)
 ## Authors
+[Haotong Lin](https://haotongl.github.io/) · [Sili Chen](https://github.com/SiliChen321) · [Junhao Liew](https://liewjunhao.github.io/) · [Donny Y. Chen](https://donydchen.github.io) · [Zhenyu Li](https://zhyever.github.io/) · [Guang Shi](https://scholar.google.com/citations?user=MjXxWbUAAAAJ&hl=en) · [Jiashi Feng](https://scholar.google.com.sg/citations?user=Q8iay0gAAAAJ&hl=en) · [Bingyi Kang](https://bingykang.github.io/)  # noqa: E501