kyujinpy commited on
Commit
21b45c6
·
verified ·
1 Parent(s): 000e251

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +35 -35
README.md CHANGED
@@ -1,36 +1,36 @@
1
- ---
2
- license: apache-2.0
3
- ---
4
-
5
- ![img1](./img/overview_paper.jpg)
6
- # GeoCAD-LLM: CAD Sequence Generation via Multimodal LLMs with Equivariant Geometric Features🛠️
7
- - [Github](https://github.com/kshsh0405/GeoCADLLM_Inference)
8
- - [Paper](comming_soon)
9
-
10
- ## GeoCAD-LLM_4B🛠️
11
- - Base_model: [Qwen3-4B-Instruct](Qwen/Qwen3-4B-Instruct-2507)
12
- - Max sequence length: 8,192
13
- - Epoch: 2
14
- - Learning rate: 1e-4
15
- - Batch size: 128
16
- - This model specialized for text-to-CAD. However, it also supports multi-modality.
17
-
18
- ## GeoCAD-LLM Contributions🔥
19
- ![img2](./img/model_structure.jpg)
20
-
21
- - **State-of-the-art Performace🏆** in [Text2CAD](https://huggingface.co/datasets/SadilKhan/Text2CAD) datasets. (as shown in below Tables)
22
- - **Multimodal CAD Generation🌐**: Both text-to-CAD and pc-text-to-CAD.
23
- - GeoCAD-LLM directly generate CAD vector sequence as **natural language**.
24
- - **Novel Two Stage Training Pipeline🧭**: In stage1, training semantic geometry alignment. In stage2, training fine-grained geometry. Especially, we **direct levearge E(3)-equivariant features** for geomtry-consistent supervision, inherently ensuring geometric feature consistency regardless of input orientation.
25
- - **Apply Point Cloud Dropout (PCD) technique🧶**: PCD mitigates over-reliance on geometric inputs and improves multimodal generalization. Also, it is a critical training technique for multimodal CAD generation.
26
-
27
- ## Performace (text-to-CAD)🔥
28
- ![img3](./img/performance.jpg)
29
-
30
- ## Qualitative Results
31
- Please check our paper and supplementary materials.🤗
32
-
33
- ## Bibtex🤗
34
- ```
35
- (TODO)
36
  ```
 
1
+ ---
2
+ license: apache-2.0
3
+ ---
4
+
5
+ ![img1](./img/overview_paper.jpg)
6
+ # GeoCAD-LLM: CAD Sequence Generation via Multimodal LLMs with Equivariant Geometric Features🛠️
7
+ - [Github](https://github.com/kshsh0405/GeoCADLLM_Inference)
8
+ - [Paper](comming_soon)
9
+
10
+ ## GeoCAD-LLM_4B🛠️
11
+ - Base_model: [Qwen3-4B-Instruct](Qwen/Qwen3-4B-Instruct-2507)
12
+ - Max sequence length: 8,192
13
+ - Epoch: 2
14
+ - Learning rate: 1e-4
15
+ - Batch size: 128
16
+ - This model specialized for text-to-CAD. However, it also supports multi-modality.
17
+
18
+ ## GeoCAD-LLM Contributions🔥
19
+ ![img2](./img/model_structure.jpg)
20
+
21
+ - **State-of-the-art Performace🏆** in [Text2CAD](https://huggingface.co/datasets/SadilKhan/Text2CAD) datasets. (as shown in below Tables)
22
+ - **Multimodal CAD Generation🌐**: Both text-to-CAD and pc-text-to-CAD.
23
+ - GeoCAD-LLM directly generate CAD vector sequence as **natural language**.
24
+ - **Novel Two Stage Training Pipeline🧭**: In stage1, training semantic geometry alignment. In stage2, training fine-grained geometry. Especially, we **direct levearge E(3)-equivariant features** for geomtry-consistent supervision, inherently ensuring geometric feature consistency regardless of input orientation.
25
+ - **Apply Point Cloud Dropout (PCD) technique🧶**: PCD mitigates over-reliance on geometric inputs and improves multimodal generalization. Also, it is a critical training technique for multimodal CAD generation.
26
+
27
+ ## Performace (text-to-CAD & pc-text-to-CAD)🔥
28
+ ![img3](./img/performance.jpg)
29
+
30
+ ## Qualitative Results
31
+ Please check our paper and supplementary materials.🤗
32
+
33
+ ## Bibtex🤗
34
+ ```
35
+ (TODO)
36
  ```