Add model card and pipeline tag
Browse filesHi! I'm Niels, part of the community science team at Hugging Face.
This pull request improves the model card by adding relevant metadata and links to the research paper, project page, and code repository. It also adds installation instructions and a sample usage section for inference, along with the `image-to-3d` pipeline tag to improve the model's discoverability on the Hub.
README.md
CHANGED
|
@@ -1,3 +1,54 @@
|
|
| 1 |
-
---
|
| 2 |
-
license: apache-2.0
|
| 3 |
-
--
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: apache-2.0
|
| 3 |
+
pipeline_tag: image-to-3d
|
| 4 |
+
---
|
| 5 |
+
|
| 6 |
+
# Lite3R: A Model-Agnostic Framework for Efficient Feed-Forward 3D Reconstruction
|
| 7 |
+
|
| 8 |
+
Official implementation of **Lite3R**, a model-agnostic framework for efficient feed-forward 3D reconstruction from multi-view images.
|
| 9 |
+
|
| 10 |
+
Lite3R introduces a systematic approach to compress large-scale 3D reconstruction models while maintaining reconstruction quality. The framework combines Sparse Linear Attention (SLA), FP8-Aware Quantization-Aware Training (QAT), and Partial Attention Distillation.
|
| 11 |
+
|
| 12 |
+
[**Paper**](https://huggingface.co/papers/2605.11354) | [**Website**](https://aigeeksgroup.github.io/Lite3R/) | [**Code**](https://github.com/AIGeeksGroup/Lite3R)
|
| 13 |
+
|
| 14 |
+
## Installation
|
| 15 |
+
|
| 16 |
+
```bash
|
| 17 |
+
# Clone the repository
|
| 18 |
+
git clone https://github.com/AIGeeksGroup/Lite3R.git
|
| 19 |
+
cd Lite3R
|
| 20 |
+
|
| 21 |
+
# Create conda environment
|
| 22 |
+
conda create -n lite3r python=3.10
|
| 23 |
+
conda activate lite3r
|
| 24 |
+
|
| 25 |
+
# Install dependencies
|
| 26 |
+
pip install -r requirements.txt
|
| 27 |
+
```
|
| 28 |
+
|
| 29 |
+
## Sample Usage
|
| 30 |
+
|
| 31 |
+
### Inference
|
| 32 |
+
|
| 33 |
+
After downloading the model checkpoints from this repository, you can run inference using the following command:
|
| 34 |
+
|
| 35 |
+
```bash
|
| 36 |
+
python inference.py \
|
| 37 |
+
--model vggt \
|
| 38 |
+
--checkpoint checkpoints/fp8_qat_1ep/vggt/vggt_fp8_qat_1ep.pt \
|
| 39 |
+
--input_dir examples/input \
|
| 40 |
+
--output_dir examples/output
|
| 41 |
+
```
|
| 42 |
+
|
| 43 |
+
## Citation
|
| 44 |
+
|
| 45 |
+
If you find this work useful, please cite:
|
| 46 |
+
|
| 47 |
+
```bibtex
|
| 48 |
+
@article{zhang2026lite3r,
|
| 49 |
+
title={Lite3R: A Model-Agnostic Framework for Efficient Feed-Forward 3D Reconstruction},
|
| 50 |
+
author={Zhang, Haoyu and Zhang, Zeyu and Zhou, Zedong and Zhao, Yang and Tang, Hao},
|
| 51 |
+
journal={arXiv preprint arXiv:2605.11354},
|
| 52 |
+
year={2026}
|
| 53 |
+
}
|
| 54 |
+
```
|