fzzsl
/

CAPruner

Graph Machine Learning

Model card Files Files and versions

xet

Community

Add pipeline tag, paper/code links and citation

by nielsr HF Staff - opened 18 days ago

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

+20

-6

Files changed (1) hide show

README.md +20 -6

README.md CHANGED Viewed

@@ -1,13 +1,17 @@
 ---
 license: cc-by-sa-4.0
 ---
 <h1 align="center">CAPruner</h1>
 <p align="center">
-    <!-- <a href='http://arxiv.org/abs/'>
       <img src='https://img.shields.io/badge/Paper-arXiv-red?style=plastic&logo=arXiv&logoColor=red' alt='Paper arXiv'>
-    </a> -->
     <a href='https://huggingface.co/fzzsl/CAPruner/tree/main'>
       <img src='https://img.shields.io/badge/Checkpoints-HF-yellow?style=plastic&logo=huggingface&logoColor=yellow' alt='Checkpoints'>
     </a>
@@ -20,11 +24,13 @@ This repository contains the official checkpoints and inference results for the
 ## Overview
-Large language models (LLMs) have recently been applied to 3D vision-language (3D-VL) tasks, in which spatial reasoning is required to identify target objects based on their positions relative to others (i.e., anchors). To facilitate effective scene layout understanding, scene graphs are commonly used to represent such spatial relations. However, reasoning over full graphs incurs high token costs and computational inefficiencies, motivating the use of scene graph pruning. Existing pruning methods predominantly rely on spatial proximity and often remove task-relevant relations, thereby undermining reliable spatial reasoning. To address these limitations, we derive a key requirement for scene graph pruning: preserving the spatial relations that are most relevant to the specific 3D-VL task. Guided by this insight, we propose the Conceptual-Adjacent Scene Graph Pruner (CAPruner). CAPruner integrates fuzzy semantic relevance with spatial proximity to estimate relation importance, enabling the selection of critical relations in a task-specific context. Moreover, to avoid costly relation-level annotations, CAPruner is trained by supervising the aggregated scores of each node's incident edges. Extensive experiments demonstrate that CAPruner effectively preserves relations essential for spatial reasoning, leading to substantial performance improvements of LLMs on 3D-VL tasks.
 ## Usage
-Please refer to the official [GitHub repository](https://github.com/fz-zsl/CAPruner) for installation and inference instructions.
 ## Acknowledgement
@@ -35,5 +41,13 @@ We would like to thank the anonymous reviewers for their constructive feedback.
 If you find this project useful in your research, please consider citing:
 ```bibtex
-```

 ---
 license: cc-by-sa-4.0
+pipeline_tag: graph-ml
 ---
 <h1 align="center">CAPruner</h1>
 <p align="center">
+    <a href='https://arxiv.org/abs/2606.07529'>
       <img src='https://img.shields.io/badge/Paper-arXiv-red?style=plastic&logo=arXiv&logoColor=red' alt='Paper arXiv'>
+    </a>
+    <a href='https://github.com/fz-zsl/CAPruner'>
+      <img src='https://img.shields.io/badge/Code-GitHub-blue?style=plastic&logo=github&logoColor=white' alt='Code GitHub'>
+    </a>
     <a href='https://huggingface.co/fzzsl/CAPruner/tree/main'>
       <img src='https://img.shields.io/badge/Checkpoints-HF-yellow?style=plastic&logo=huggingface&logoColor=yellow' alt='Checkpoints'>
     </a>
 ## Overview
+Large language models (LLMs) have recently been applied to 3D vision-language (3D-VL) tasks, in which spatial reasoning is required to identify target objects based on their positions relative to others (i.e., anchors). To facilitate effective scene layout understanding, scene graphs are commonly used to represent such spatial relations. However, reasoning over full graphs incurs high token costs and computational inefficiencies, motivating the use of scene graph pruning.
+Conceptual-Adjacent Scene Graph Pruner (**CAPruner**) integrates fuzzy semantic relevance with spatial proximity to estimate relation importance, enabling the selection of critical relations in a task-specific context. Moreover, to avoid costly relation-level annotations, CAPruner is trained by supervising the aggregated scores of each node's incident edges. Extensive experiments demonstrate that CAPruner effectively preserves relations essential for spatial reasoning, leading to substantial performance improvements of LLMs on 3D-VL tasks.
 ## Usage
+Please refer to the official [GitHub repository](https://github.com/fz-zsl/CAPruner) for environment installation, data preparation, training, and inference instructions.
 ## Acknowledgement
 If you find this project useful in your research, please consider citing:
 ```bibtex
+@misc{zhou2026caprunerconceptualadjacentscenegraph,
+      title={CAPruner: Conceptual-Adjacent Scene Graph Pruner for Enhancing 3D Spatial Reasoning of Large Language Models},
+      author={Shengli Zhou and Xiangchen Wang and Guanhua Chen and Feng Zheng},
+      year={2026},
+      eprint={2606.07529},
+      archivePrefix={arXiv},
+      primaryClass={cs.CL},
+      url={https://arxiv.org/abs/2606.07529},
+}
+```