File size: 3,337 Bytes
42d49e0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
---
license: cc-by-nc-4.0
tags:
  - computer-vision
  - 6d-pose-estimation
  - object-detection
  - robotics
  - foundationpose
library_name: foundationpose
---

# FoundationPose Model Weights

Pre-trained weights for [FoundationPose](https://github.com/NVlabs/FoundationPose) 6D object pose estimation model.

## Model Details

- **Refiner weights:** `2023-10-28-18-33-37/model_best.pth`
- **Scorer weights:** `2024-01-11-20-02-45/model_best.pth`
- **Source:** [Official FoundationPose release](https://github.com/NVlabs/FoundationPose)
- **Paper:** [FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects (CVPR 2024)](https://arxiv.org/abs/2312.08344)

## Model Architecture

FoundationPose is a unified foundation model for 6D object pose estimation and tracking, supporting both:
- **Model-based setup**: Using CAD models
- **Model-free setup**: Using reference images (16-20 views)

## Files

```
.
├── 2023-10-28-18-33-37/
│   ├── config.yml
│   └── model_best.pth (refiner model)
└── 2024-01-11-20-02-45/
    ├── config.yml
    └── model_best.pth (scorer model)
```

## Usage

### Download Weights

```python
from huggingface_hub import snapshot_download

# Download all weights
weights_path = snapshot_download(
    repo_id="gpue/foundationpose-weights",
    local_dir="./weights"
)
```

### Use with FoundationPose Space

This model repository is designed to work with the [gpue/foundationpose](https://huggingface.co/spaces/gpue/foundationpose) Space.

Set environment variables:
```bash
FOUNDATIONPOSE_MODEL_REPO=gpue/foundationpose-weights
USE_HF_WEIGHTS=true
USE_REAL_MODEL=true
```

### Local Usage

```python
import torch
from pathlib import Path

# Load refiner
refiner_weights = torch.load("weights/2023-10-28-18-33-37/model_best.pth")

# Load scorer
scorer_weights = torch.load("weights/2024-01-11-20-02-45/model_best.pth")
```

## Performance

- **Accuracy**: State-of-the-art on BOP benchmark (as of 2024/03)
- **Speed**: Real-time capable with GPU acceleration
- **Generalization**: Works on novel objects without fine-tuning

## Citation

If you use these weights, please cite:

```bibtex
@inproceedings{wen2023foundationpose,
  title={FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects},
  author={Wen, Bowen and Yang, Wei and Kautz, Jan and Birchfield, Stan},
  booktitle={CVPR},
  year={2024}
}
```

## License

These weights are from the official FoundationPose release and are subject to NVIDIA's [Source Code License](https://github.com/NVlabs/FoundationPose/blob/main/LICENSE.txt).

**Key restrictions:**
- Non-commercial use only
- No redistribution of derivative works
- Academic and research purposes

## Related Resources

- **Paper**: https://arxiv.org/abs/2312.08344
- **Code**: https://github.com/NVlabs/FoundationPose
- **Project Page**: https://nvlabs.github.io/FoundationPose/
- **Inference Space**: https://huggingface.co/spaces/gpue/foundationpose

## Model Card

**Developed by:** NVIDIA Research (Bowen Wen, Wei Yang, Jan Kautz, Stan Birchfield)

**Model type:** Transformer-based 6D pose estimator

**Training data:** Large-scale synthetic dataset

**Intended use:** 6D object pose estimation and tracking for robotics and AR/VR applications

**Out-of-scope:** Commercial deployment (due to license restrictions)