gpue commited on
Commit
42d49e0
·
verified ·
1 Parent(s): 281e21d

Upload folder using huggingface_hub

Browse files
2023-10-28-18-33-37/config.yml ADDED
@@ -0,0 +1,39 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ lr: 0.0001
2
+ c_in: 6
3
+ zfar: 'Infinity'
4
+ debug: null
5
+ w_rot: 0.1
6
+ n_view: 1
7
+ run_id: null
8
+ use_BN: true
9
+ rot_rep: axis_angle
10
+ ckpt_dir: null
11
+ exp_name: 2023-10-28-18-33-37
12
+ save_dir: /tmp/2023-10-28-18-33-37/
13
+ loss_type: l2
14
+ optimizer: adam
15
+ trans_rep: tracknet
16
+ batch_size: 64
17
+ crop_ratio: 1.2
18
+ use_normal: false
19
+ BN_momentum: 0.1
20
+ max_num_key: null
21
+ warmup_step: -1
22
+ input_resize:
23
+ - 160
24
+ - 160
25
+ max_step_val: 1000
26
+ normal_uint8: false
27
+ vis_interval: 1000
28
+ weight_decay: 0
29
+ n_max_objects: null
30
+ normalize_xyz: true
31
+ clip_grad_norm: 'Infinity'
32
+ rot_normalizer: 0.3490658503988659
33
+ trans_normalizer:
34
+ - 0.019999999552965164
35
+ - 0.019999999552965164
36
+ - 0.05000000074505806
37
+ max_step_per_epoch: 25000
38
+ val_epoch_interval: 10
39
+ n_dataloader_workers: 60
2023-10-28-18-33-37/model_best.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:774700586ddc435d408fc01c9809c43e151232936369dfbea0f0f964ba471d60
3
+ size 68220109
2024-01-11-20-02-45/config.yml ADDED
@@ -0,0 +1,41 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ lr: 0.0001
2
+ c_in: 6
3
+ zfar: 'Infinity'
4
+ debug: null
5
+ n_view: 1
6
+ run_id: 3wy8qqex
7
+ use_BN: true
8
+ exp_name: 2024-01-11-20-02-45
9
+ n_epochs: 62
10
+ save_dir: /home/bowenw/debug/2024-01-11-20-02-45/
11
+ use_mask: false
12
+ loss_type: pairwise_valid
13
+ optimizer: adam
14
+ batch_size: 64
15
+ crop_ratio: 1.1
16
+ enable_amp: true
17
+ use_normal: false
18
+ max_num_key: null
19
+ warmup_step: -1
20
+ input_resize:
21
+ - 160
22
+ - 160
23
+ max_step_val: 1000
24
+ vis_interval: 1000
25
+ weight_decay: 0
26
+ normalize_xyz: true
27
+ resume_run_id: null
28
+ clip_grad_norm: 'Infinity'
29
+ lr_epoch_decay: 500
30
+ render_backend: nvdiffrast
31
+ train_num_pair: 5
32
+ lr_decay_epochs:
33
+ - 50
34
+ n_epochs_warmup: 1
35
+ make_pair_online: false
36
+ gradient_max_norm: 'Infinity'
37
+ max_step_per_epoch: 10000
38
+ n_rendering_workers: 1
39
+ save_epoch_interval: 100
40
+ n_dataloader_workers: 100
41
+ split_objects_across_gpus: true
2024-01-11-20-02-45/model_best.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:81924d384bf5c26c646ee4783104982ae3d1e049c181c36641b6a7aeae494c26
3
+ size 190229389
README.md ADDED
@@ -0,0 +1,124 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: cc-by-nc-4.0
3
+ tags:
4
+ - computer-vision
5
+ - 6d-pose-estimation
6
+ - object-detection
7
+ - robotics
8
+ - foundationpose
9
+ library_name: foundationpose
10
+ ---
11
+
12
+ # FoundationPose Model Weights
13
+
14
+ Pre-trained weights for [FoundationPose](https://github.com/NVlabs/FoundationPose) 6D object pose estimation model.
15
+
16
+ ## Model Details
17
+
18
+ - **Refiner weights:** `2023-10-28-18-33-37/model_best.pth`
19
+ - **Scorer weights:** `2024-01-11-20-02-45/model_best.pth`
20
+ - **Source:** [Official FoundationPose release](https://github.com/NVlabs/FoundationPose)
21
+ - **Paper:** [FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects (CVPR 2024)](https://arxiv.org/abs/2312.08344)
22
+
23
+ ## Model Architecture
24
+
25
+ FoundationPose is a unified foundation model for 6D object pose estimation and tracking, supporting both:
26
+ - **Model-based setup**: Using CAD models
27
+ - **Model-free setup**: Using reference images (16-20 views)
28
+
29
+ ## Files
30
+
31
+ ```
32
+ .
33
+ ├── 2023-10-28-18-33-37/
34
+ │ ├── config.yml
35
+ │ └── model_best.pth (refiner model)
36
+ └── 2024-01-11-20-02-45/
37
+ ├── config.yml
38
+ └── model_best.pth (scorer model)
39
+ ```
40
+
41
+ ## Usage
42
+
43
+ ### Download Weights
44
+
45
+ ```python
46
+ from huggingface_hub import snapshot_download
47
+
48
+ # Download all weights
49
+ weights_path = snapshot_download(
50
+ repo_id="gpue/foundationpose-weights",
51
+ local_dir="./weights"
52
+ )
53
+ ```
54
+
55
+ ### Use with FoundationPose Space
56
+
57
+ This model repository is designed to work with the [gpue/foundationpose](https://huggingface.co/spaces/gpue/foundationpose) Space.
58
+
59
+ Set environment variables:
60
+ ```bash
61
+ FOUNDATIONPOSE_MODEL_REPO=gpue/foundationpose-weights
62
+ USE_HF_WEIGHTS=true
63
+ USE_REAL_MODEL=true
64
+ ```
65
+
66
+ ### Local Usage
67
+
68
+ ```python
69
+ import torch
70
+ from pathlib import Path
71
+
72
+ # Load refiner
73
+ refiner_weights = torch.load("weights/2023-10-28-18-33-37/model_best.pth")
74
+
75
+ # Load scorer
76
+ scorer_weights = torch.load("weights/2024-01-11-20-02-45/model_best.pth")
77
+ ```
78
+
79
+ ## Performance
80
+
81
+ - **Accuracy**: State-of-the-art on BOP benchmark (as of 2024/03)
82
+ - **Speed**: Real-time capable with GPU acceleration
83
+ - **Generalization**: Works on novel objects without fine-tuning
84
+
85
+ ## Citation
86
+
87
+ If you use these weights, please cite:
88
+
89
+ ```bibtex
90
+ @inproceedings{wen2023foundationpose,
91
+ title={FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects},
92
+ author={Wen, Bowen and Yang, Wei and Kautz, Jan and Birchfield, Stan},
93
+ booktitle={CVPR},
94
+ year={2024}
95
+ }
96
+ ```
97
+
98
+ ## License
99
+
100
+ These weights are from the official FoundationPose release and are subject to NVIDIA's [Source Code License](https://github.com/NVlabs/FoundationPose/blob/main/LICENSE.txt).
101
+
102
+ **Key restrictions:**
103
+ - Non-commercial use only
104
+ - No redistribution of derivative works
105
+ - Academic and research purposes
106
+
107
+ ## Related Resources
108
+
109
+ - **Paper**: https://arxiv.org/abs/2312.08344
110
+ - **Code**: https://github.com/NVlabs/FoundationPose
111
+ - **Project Page**: https://nvlabs.github.io/FoundationPose/
112
+ - **Inference Space**: https://huggingface.co/spaces/gpue/foundationpose
113
+
114
+ ## Model Card
115
+
116
+ **Developed by:** NVIDIA Research (Bowen Wen, Wei Yang, Jan Kautz, Stan Birchfield)
117
+
118
+ **Model type:** Transformer-based 6D pose estimator
119
+
120
+ **Training data:** Large-scale synthetic dataset
121
+
122
+ **Intended use:** 6D object pose estimation and tracking for robotics and AR/VR applications
123
+
124
+ **Out-of-scope:** Commercial deployment (due to license restrictions)