Spongebobbbbbbbb commited on
Commit
2a1e4a9
·
verified ·
1 Parent(s): a834971

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -39
README.md CHANGED
@@ -2,11 +2,8 @@
2
  license: apache-2.0
3
  tags:
4
  - multi-object-tracking
5
- - MOT
6
  - DETR
7
- - object-detection
8
  - computer-vision
9
- - pytorch
10
  - CVPR2026
11
  datasets:
12
  - DanceTrack
@@ -14,7 +11,6 @@ datasets:
14
  - BFT
15
  language:
16
  - en
17
- pipeline_tag: object-detection
18
  ---
19
 
20
  # FDTA: From Detection to Association
@@ -26,13 +22,6 @@ Official model weights for the paper **"From Detection to Association: Learning
26
 
27
  > **TL;DR.** We reveal that DETR-based end-to-end MOT suffers from overly similar object embeddings. FDTA explicitly enhances discriminativeness in this paradigm.
28
 
29
- ## Model Description
30
-
31
- FDTA is built upon Deformable DETR with a ResNet-50 backbone. It introduces:
32
-
33
- - **Spatial Adapter**: A depth-aware module that incorporates monocular depth estimation to enrich spatial understanding.
34
- - **Temporal Adapter**: Trajectory-level temporal modeling for robust identity association across frames.
35
- - **ID Decoder**: A dedicated decoder with learnable ID vocabulary to produce discriminative object embeddings for multi-object tracking.
36
 
37
  ## Available Checkpoints
38
 
@@ -78,36 +67,9 @@ ckpt_path = hf_hub_download(
78
  )
79
  ```
80
 
81
- Or manually download from the **Files** tab and place under `./checkpoints/`.
82
-
83
- ### 2. Inference
84
-
85
- ```shell
86
- accelerate launch --num_processes=4 submit_and_evaluate.py \
87
- --data-root /path/to/your/datasets/ \
88
- --inference-mode evaluate \
89
- --config-path ./configs/dancetrack.yaml \
90
- --inference-model ./checkpoints/dancetrack.pth \
91
- --outputs-dir ./outputs/ \
92
- --inference-dataset DanceTrack \
93
- --inference-split val
94
- ```
95
-
96
- > Add `--inference-dtype FP16` for faster inference with minimal performance loss.
97
-
98
  For full training and evaluation instructions, please refer to the [GitHub repository](https://github.com/Spongebobbbbbbbb/FDTA).
99
 
100
- ## Architecture Details
101
-
102
- | Component | Details |
103
- |-----------|---------|
104
- | Backbone | ResNet-50 |
105
- | Detector | Deformable DETR (6 encoder + 6 decoder layers) |
106
- | Queries | 300 |
107
- | Feature Dim | 256 |
108
- | ID Decoder Layers | 6 |
109
- | ID Vocabulary Size | 50 |
110
- | Depth Estimation | LID mode, 150 bins |
111
 
112
  ## Citation
113
 
 
2
  license: apache-2.0
3
  tags:
4
  - multi-object-tracking
 
5
  - DETR
 
6
  - computer-vision
 
7
  - CVPR2026
8
  datasets:
9
  - DanceTrack
 
11
  - BFT
12
  language:
13
  - en
 
14
  ---
15
 
16
  # FDTA: From Detection to Association
 
22
 
23
  > **TL;DR.** We reveal that DETR-based end-to-end MOT suffers from overly similar object embeddings. FDTA explicitly enhances discriminativeness in this paradigm.
24
 
 
 
 
 
 
 
 
25
 
26
  ## Available Checkpoints
27
 
 
67
  )
68
  ```
69
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
70
  For full training and evaluation instructions, please refer to the [GitHub repository](https://github.com/Spongebobbbbbbbb/FDTA).
71
 
72
+
 
 
 
 
 
 
 
 
 
 
73
 
74
  ## Citation
75