Depth Estimation
PyTorch
deltatok
cvpr2026-highlight

Add pipeline tag and links to paper/code

#1
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +18 -7
README.md CHANGED
@@ -1,19 +1,30 @@
1
  ---
 
 
2
  library_name: pytorch
3
- tags:
4
- - deltatok
5
  license: apache-2.0
6
- datasets:
7
- - kitti
 
8
  ---
9
 
10
  # Depth Head — KITTI
11
 
12
- Monocular depth estimation head trained on KITTI. Requires a frozen [DINOv3](https://github.com/facebookresearch/dinov3) ViT-B backbone (not included).
 
 
 
 
 
13
 
14
  ## Usage
15
 
16
- See the [DeltaTok GitHub repository](https://github.com/amazon-far/deltatok) for training and evaluation code.
 
 
 
 
 
17
 
18
  ## Acknowledgements
19
 
@@ -29,4 +40,4 @@ See the [DeltaTok GitHub repository](https://github.com/amazon-far/deltatok) for
29
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
30
  year = {2026}
31
  }
32
- ```
 
1
  ---
2
+ datasets:
3
+ - kitti
4
  library_name: pytorch
 
 
5
  license: apache-2.0
6
+ pipeline_tag: depth-estimation
7
+ tags:
8
+ - deltatok
9
  ---
10
 
11
  # Depth Head — KITTI
12
 
13
+ Monocular depth estimation head trained on KITTI, as presented in [A Frame is Worth One Token: Efficient Generative World Modeling with Delta Tokens](https://huggingface.co/papers/2604.04913).
14
+
15
+ Requires a frozen [DINOv3](https://github.com/facebookresearch/dinov3) ViT-B backbone (not included).
16
+
17
+ - **Project Page:** [https://deltatok.github.io](https://deltatok.github.io)
18
+ - **Code:** [https://github.com/amazon-far/deltatok](https://github.com/amazon-far/deltatok)
19
 
20
  ## Usage
21
 
22
+ See the [DeltaTok GitHub repository](https://github.com/amazon-far/deltatok) for training and evaluation code. Evaluation typically involves using the `main.py` script provided in the repository:
23
+
24
+ ```bash
25
+ python main.py validate -c configs/deltatok_vitb_dinov3_vitb_kinetics.yaml \
26
+ --model.ckpt_path=path/to/deltatok-kinetics/pytorch_model.bin
27
+ ```
28
 
29
  ## Acknowledgements
30
 
 
40
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
41
  year = {2026}
42
  }
43
+ ```