Spaces:
Running
on
Zero
Running
on
Zero
Delete DA-2K.md
Browse files
DA-2K.md
DELETED
|
@@ -1,51 +0,0 @@
|
|
| 1 |
-
# DA-2K Evaluation Benchmark
|
| 2 |
-
|
| 3 |
-
## Introduction
|
| 4 |
-
|
| 5 |
-

|
| 6 |
-
|
| 7 |
-
DA-2K is proposed in [Depth Anything V2](https://depth-anything-v2.github.io) to evaluate the relative depth estimation capability. It encompasses eight representative scenarios of `indoor`, `outdoor`, `non_real`, `transparent_reflective`, `adverse_style`, `aerial`, `underwater`, and `object`. It consists of 1K diverse high-quality images and 2K precise pair-wise relative depth annotations.
|
| 8 |
-
|
| 9 |
-
Please refer to our [paper](https://arxiv.org/abs/2406.09414) for details in constructing this benchmark.
|
| 10 |
-
|
| 11 |
-
|
| 12 |
-
## Usage
|
| 13 |
-
|
| 14 |
-
Please first [download the benchmark](https://huggingface.co/datasets/depth-anything/DA-2K/tree/main).
|
| 15 |
-
|
| 16 |
-
All annotations are stored in `annotations.json`. The annotation file is a JSON object where each key is the path to an image file, and the value is a list of annotations associated with that image. Each annotation describes two points and identifies which point is closer to the camera. The structure is detailed below:
|
| 17 |
-
|
| 18 |
-
```
|
| 19 |
-
{
|
| 20 |
-
"image_path": [
|
| 21 |
-
{
|
| 22 |
-
"point1": [h1, w1], # (vertical position, horizontal position)
|
| 23 |
-
"point2": [h2, w2], # (vertical position, horizontal position)
|
| 24 |
-
"closer_point": "point1" # we always set "point1" as the closer one
|
| 25 |
-
},
|
| 26 |
-
...
|
| 27 |
-
],
|
| 28 |
-
...
|
| 29 |
-
}
|
| 30 |
-
```
|
| 31 |
-
|
| 32 |
-
To visualize the annotations:
|
| 33 |
-
```bash
|
| 34 |
-
python visualize.py [--scene-type <type>]
|
| 35 |
-
```
|
| 36 |
-
|
| 37 |
-
**Options**
|
| 38 |
-
- `--scene-type <type>` (optional): Specify the scene type (`indoor`, `outdoor`, `non_real`, `transparent_reflective`, `adverse_style`, `aerial`, `underwater`, and `object`). Skip this argument or set <type> as `""` to include all scene types.
|
| 39 |
-
|
| 40 |
-
## Citation
|
| 41 |
-
|
| 42 |
-
If you find this benchmark useful, please consider citing:
|
| 43 |
-
|
| 44 |
-
```bibtex
|
| 45 |
-
@article{depth_anything_v2,
|
| 46 |
-
title={Depth Anything V2},
|
| 47 |
-
author={Yang, Lihe and Kang, Bingyi and Huang, Zilong and Zhao, Zhen and Xu, Xiaogang and Feng, Jiashi and Zhao, Hengshuang},
|
| 48 |
-
journal={arXiv:2406.09414},
|
| 49 |
-
year={2024}
|
| 50 |
-
}
|
| 51 |
-
```
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|