Add pipeline tag, paper/code links and citation
#1
by nielsr HF Staff - opened
README.md
CHANGED
|
@@ -1,13 +1,17 @@
|
|
| 1 |
---
|
| 2 |
license: cc-by-sa-4.0
|
|
|
|
| 3 |
---
|
| 4 |
|
| 5 |
<h1 align="center">CAPruner</h1>
|
| 6 |
|
| 7 |
<p align="center">
|
| 8 |
-
<
|
| 9 |
<img src='https://img.shields.io/badge/Paper-arXiv-red?style=plastic&logo=arXiv&logoColor=red' alt='Paper arXiv'>
|
| 10 |
-
</a>
|
|
|
|
|
|
|
|
|
|
| 11 |
<a href='https://huggingface.co/fzzsl/CAPruner/tree/main'>
|
| 12 |
<img src='https://img.shields.io/badge/Checkpoints-HF-yellow?style=plastic&logo=huggingface&logoColor=yellow' alt='Checkpoints'>
|
| 13 |
</a>
|
|
@@ -20,11 +24,13 @@ This repository contains the official checkpoints and inference results for the
|
|
| 20 |
|
| 21 |
## Overview
|
| 22 |
|
| 23 |
-
Large language models (LLMs) have recently been applied to 3D vision-language (3D-VL) tasks, in which spatial reasoning is required to identify target objects based on their positions relative to others (i.e., anchors). To facilitate effective scene layout understanding, scene graphs are commonly used to represent such spatial relations. However, reasoning over full graphs incurs high token costs and computational inefficiencies, motivating the use of scene graph pruning.
|
|
|
|
|
|
|
| 24 |
|
| 25 |
## Usage
|
| 26 |
|
| 27 |
-
Please refer to the official [GitHub repository](https://github.com/fz-zsl/CAPruner) for installation and inference instructions.
|
| 28 |
|
| 29 |
## Acknowledgement
|
| 30 |
|
|
@@ -35,5 +41,13 @@ We would like to thank the anonymous reviewers for their constructive feedback.
|
|
| 35 |
If you find this project useful in your research, please consider citing:
|
| 36 |
|
| 37 |
```bibtex
|
| 38 |
-
|
| 39 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
| 2 |
license: cc-by-sa-4.0
|
| 3 |
+
pipeline_tag: graph-ml
|
| 4 |
---
|
| 5 |
|
| 6 |
<h1 align="center">CAPruner</h1>
|
| 7 |
|
| 8 |
<p align="center">
|
| 9 |
+
<a href='https://arxiv.org/abs/2606.07529'>
|
| 10 |
<img src='https://img.shields.io/badge/Paper-arXiv-red?style=plastic&logo=arXiv&logoColor=red' alt='Paper arXiv'>
|
| 11 |
+
</a>
|
| 12 |
+
<a href='https://github.com/fz-zsl/CAPruner'>
|
| 13 |
+
<img src='https://img.shields.io/badge/Code-GitHub-blue?style=plastic&logo=github&logoColor=white' alt='Code GitHub'>
|
| 14 |
+
</a>
|
| 15 |
<a href='https://huggingface.co/fzzsl/CAPruner/tree/main'>
|
| 16 |
<img src='https://img.shields.io/badge/Checkpoints-HF-yellow?style=plastic&logo=huggingface&logoColor=yellow' alt='Checkpoints'>
|
| 17 |
</a>
|
|
|
|
| 24 |
|
| 25 |
## Overview
|
| 26 |
|
| 27 |
+
Large language models (LLMs) have recently been applied to 3D vision-language (3D-VL) tasks, in which spatial reasoning is required to identify target objects based on their positions relative to others (i.e., anchors). To facilitate effective scene layout understanding, scene graphs are commonly used to represent such spatial relations. However, reasoning over full graphs incurs high token costs and computational inefficiencies, motivating the use of scene graph pruning.
|
| 28 |
+
|
| 29 |
+
Conceptual-Adjacent Scene Graph Pruner (**CAPruner**) integrates fuzzy semantic relevance with spatial proximity to estimate relation importance, enabling the selection of critical relations in a task-specific context. Moreover, to avoid costly relation-level annotations, CAPruner is trained by supervising the aggregated scores of each node's incident edges. Extensive experiments demonstrate that CAPruner effectively preserves relations essential for spatial reasoning, leading to substantial performance improvements of LLMs on 3D-VL tasks.
|
| 30 |
|
| 31 |
## Usage
|
| 32 |
|
| 33 |
+
Please refer to the official [GitHub repository](https://github.com/fz-zsl/CAPruner) for environment installation, data preparation, training, and inference instructions.
|
| 34 |
|
| 35 |
## Acknowledgement
|
| 36 |
|
|
|
|
| 41 |
If you find this project useful in your research, please consider citing:
|
| 42 |
|
| 43 |
```bibtex
|
| 44 |
+
@misc{zhou2026caprunerconceptualadjacentscenegraph,
|
| 45 |
+
title={CAPruner: Conceptual-Adjacent Scene Graph Pruner for Enhancing 3D Spatial Reasoning of Large Language Models},
|
| 46 |
+
author={Shengli Zhou and Xiangchen Wang and Guanhua Chen and Feng Zheng},
|
| 47 |
+
year={2026},
|
| 48 |
+
eprint={2606.07529},
|
| 49 |
+
archivePrefix={arXiv},
|
| 50 |
+
primaryClass={cs.CL},
|
| 51 |
+
url={https://arxiv.org/abs/2606.07529},
|
| 52 |
+
}
|
| 53 |
+
```
|