nielsr HF Staff commited on
Commit
131be7e
·
verified ·
1 Parent(s): 5b6f493

Add model card

Browse files

This PR adds a comprehensive model card for the `3D-R1` model, which is presented in the paper [3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding](https://huggingface.co/papers/2507.23478).

The model card includes:
- A link to the paper.
- A link to the project page (`https://aigeeksgroup.github.io/3D-R1`).
- A link to the code repository (`https://github.com/AIGeeksGroup/3D-R1`).
- Relevant metadata: `pipeline_tag: image-text-to-text`, `library_name: transformers`, and `license: apache-2.0`.

This will help users discover and understand the model better on the Hugging Face Hub.

Files changed (1) hide show
  1. README.md +15 -0
README.md CHANGED
@@ -1 +1,16 @@
 
 
 
 
 
1
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ pipeline_tag: image-text-to-text
3
+ library_name: transformers
4
+ license: apache-2.0
5
+ ---
6
 
7
+ # 3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding
8
+
9
+ **3D-R1** is a foundation model designed to enhance the reasoning capabilities of 3D Vision-Language Models (VLMs) for unified scene understanding. It addresses limitations in existing 3D VLMs by leveraging a high-quality synthetic dataset (Scene-30K), incorporating RLHF policies with novel reward functions (perception, semantic similarity, format), and introducing a dynamic view selection strategy. This approach aims to improve robust reasoning and generalization in 3D scene understanding.
10
+
11
+ The model was presented in the paper:
12
+ - **Paper**: [3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding](https://huggingface.co/papers/2507.23478)
13
+
14
+ For more details, visit the project page and code repository:
15
+ - **Project Page**: [https://aigeeksgroup.github.io/3D-R1](https://aigeeksgroup.github.io/3D-R1)
16
+ - **Code**: [https://github.com/AIGeeksGroup/3D-R1](https://github.com/AIGeeksGroup/3D-R1)