Gabriele commited on
Commit ·
64789f3
1
Parent(s): 1241b68
Update README
Browse files
README.md
CHANGED
|
@@ -12,7 +12,7 @@ pipeline_tag: depth-estimation
|
|
| 12 |
|
| 13 |
# TIPSv2 — L/14 DPT Heads
|
| 14 |
|
| 15 |
-
DPT (Dense Prediction Transformer) heads for depth estimation, surface normal prediction, and semantic segmentation
|
| 16 |
|
| 17 |
## Usage
|
| 18 |
|
|
@@ -47,6 +47,7 @@ seg = model.predict_segmentation(pixel_values)
|
|
| 47 |
|
| 48 |
- **Backbone**: [TIPSv2 L/14](google/tipsv2-l14) (loaded automatically)
|
| 49 |
- **Heads**: ~102M total params (depth + normals + segmentation)
|
|
|
|
| 50 |
- **Segmentation**: ADE20K, 150 classes
|
| 51 |
- **Input**: images in `[0, 1]` range, any resolution (multiples of 14 recommended)
|
| 52 |
|
|
|
|
| 12 |
|
| 13 |
# TIPSv2 — L/14 DPT Heads
|
| 14 |
|
| 15 |
+
DPT (Dense Prediction Transformer) heads for depth estimation, surface normal prediction, and semantic segmentation on top of the frozen [TIPSv2 L/14](https://huggingface.co/google/tipsv2-l14) backbone. The backbone is loaded automatically. The depth and normals heads are trained on the NYU Depth V2 dataset and segmentation is trained on the ADE20K dataset (150 classes).
|
| 16 |
|
| 17 |
## Usage
|
| 18 |
|
|
|
|
| 47 |
|
| 48 |
- **Backbone**: [TIPSv2 L/14](google/tipsv2-l14) (loaded automatically)
|
| 49 |
- **Heads**: ~102M total params (depth + normals + segmentation)
|
| 50 |
+
- **Depth & normals**: NYU Depth V2
|
| 51 |
- **Segmentation**: ADE20K, 150 classes
|
| 52 |
- **Input**: images in `[0, 1]` range, any resolution (multiples of 14 recommended)
|
| 53 |
|