Update README.md
Browse files
README.md
CHANGED
|
@@ -1,36 +1,36 @@
|
|
| 1 |
-
---
|
| 2 |
-
license: apache-2.0
|
| 3 |
-
---
|
| 4 |
-
|
| 5 |
-

|
| 6 |
-
# GeoCAD-LLM: CAD Sequence Generation via Multimodal LLMs with Equivariant Geometric Features🛠️
|
| 7 |
-
- [Github](https://github.com/kshsh0405/GeoCADLLM_Inference)
|
| 8 |
-
- [Paper](comming_soon)
|
| 9 |
-
|
| 10 |
-
## GeoCAD-LLM_4B🛠️
|
| 11 |
-
- Base_model: [Qwen3-4B-Instruct](Qwen/Qwen3-4B-Instruct-2507)
|
| 12 |
-
- Max sequence length: 8,192
|
| 13 |
-
- Epoch: 2
|
| 14 |
-
- Learning rate: 1e-4
|
| 15 |
-
- Batch size: 128
|
| 16 |
-
- This model specialized for text-to-CAD. However, it also supports multi-modality.
|
| 17 |
-
|
| 18 |
-
## GeoCAD-LLM Contributions🔥
|
| 19 |
-

|
| 20 |
-
|
| 21 |
-
- **State-of-the-art Performace🏆** in [Text2CAD](https://huggingface.co/datasets/SadilKhan/Text2CAD) datasets. (as shown in below Tables)
|
| 22 |
-
- **Multimodal CAD Generation🌐**: Both text-to-CAD and pc-text-to-CAD.
|
| 23 |
-
- GeoCAD-LLM directly generate CAD vector sequence as **natural language**.
|
| 24 |
-
- **Novel Two Stage Training Pipeline🧭**: In stage1, training semantic geometry alignment. In stage2, training fine-grained geometry. Especially, we **direct levearge E(3)-equivariant features** for geomtry-consistent supervision, inherently ensuring geometric feature consistency regardless of input orientation.
|
| 25 |
-
- **Apply Point Cloud Dropout (PCD) technique🧶**: PCD mitigates over-reliance on geometric inputs and improves multimodal generalization. Also, it is a critical training technique for multimodal CAD generation.
|
| 26 |
-
|
| 27 |
-
## Performace (text-to-CAD)🔥
|
| 28 |
-

|
| 29 |
-
|
| 30 |
-
## Qualitative Results
|
| 31 |
-
Please check our paper and supplementary materials.🤗
|
| 32 |
-
|
| 33 |
-
## Bibtex🤗
|
| 34 |
-
```
|
| 35 |
-
(TODO)
|
| 36 |
```
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: apache-2.0
|
| 3 |
+
---
|
| 4 |
+
|
| 5 |
+

|
| 6 |
+
# GeoCAD-LLM: CAD Sequence Generation via Multimodal LLMs with Equivariant Geometric Features🛠️
|
| 7 |
+
- [Github](https://github.com/kshsh0405/GeoCADLLM_Inference)
|
| 8 |
+
- [Paper](comming_soon)
|
| 9 |
+
|
| 10 |
+
## GeoCAD-LLM_4B🛠️
|
| 11 |
+
- Base_model: [Qwen3-4B-Instruct](Qwen/Qwen3-4B-Instruct-2507)
|
| 12 |
+
- Max sequence length: 8,192
|
| 13 |
+
- Epoch: 2
|
| 14 |
+
- Learning rate: 1e-4
|
| 15 |
+
- Batch size: 128
|
| 16 |
+
- This model specialized for text-to-CAD. However, it also supports multi-modality.
|
| 17 |
+
|
| 18 |
+
## GeoCAD-LLM Contributions🔥
|
| 19 |
+

|
| 20 |
+
|
| 21 |
+
- **State-of-the-art Performace🏆** in [Text2CAD](https://huggingface.co/datasets/SadilKhan/Text2CAD) datasets. (as shown in below Tables)
|
| 22 |
+
- **Multimodal CAD Generation🌐**: Both text-to-CAD and pc-text-to-CAD.
|
| 23 |
+
- GeoCAD-LLM directly generate CAD vector sequence as **natural language**.
|
| 24 |
+
- **Novel Two Stage Training Pipeline🧭**: In stage1, training semantic geometry alignment. In stage2, training fine-grained geometry. Especially, we **direct levearge E(3)-equivariant features** for geomtry-consistent supervision, inherently ensuring geometric feature consistency regardless of input orientation.
|
| 25 |
+
- **Apply Point Cloud Dropout (PCD) technique🧶**: PCD mitigates over-reliance on geometric inputs and improves multimodal generalization. Also, it is a critical training technique for multimodal CAD generation.
|
| 26 |
+
|
| 27 |
+
## Performace (text-to-CAD & pc-text-to-CAD)🔥
|
| 28 |
+

|
| 29 |
+
|
| 30 |
+
## Qualitative Results
|
| 31 |
+
Please check our paper and supplementary materials.🤗
|
| 32 |
+
|
| 33 |
+
## Bibtex🤗
|
| 34 |
+
```
|
| 35 |
+
(TODO)
|
| 36 |
```
|