Smile8 commited on
Commit
d801255
·
verified ·
1 Parent(s): 20eae61

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +114 -3
README.md CHANGED
@@ -1,3 +1,114 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ ---
4
+ # Geometric Reasoning Network (GRN)
5
+
6
+ ![GRN model architecture](https://github.com/Smail8/geometric_reasoning_networks/blob/main/assets/model.png?raw=true)
7
+
8
+ ## Model Details
9
+
10
+ - **Model name:** Geometric Reasoning Network (GRN)
11
+ - **Model type:** Graph Neural Network for robot manipulation feasibility prediction
12
+ - **Framework:** PyTorch, PyTorch Geometric
13
+ - **Associated paper:**
14
+ *Learning Geometric Reasoning Networks for Robot Task and Motion Planning*, ICLR 2025
15
+ - **Authors:** Smail Ait Bouhsain, Rachid Alami, Thierry Siméon
16
+ - **License:** See repository LICENSE
17
+ - **Paper:** https://openreview.net/pdf?id=ajxAJ8GUX4
18
+ - **Repository:** https://github.com/smail8/geometric_reasoning_networks
19
+
20
+ GRN is a learned geometric reasoning model designed to augment **robot Task and Motion Planning (TAMP)** by predicting the feasibility of manipulation actions in cluttered 3D environments.
21
+
22
+ ---
23
+
24
+ ## Model Description
25
+
26
+ The Geometric Reasoning Network (GRN) operates on **graph-structured representations of manipulation scenes**, encoding objects, candidate grasps, actions, and their geometric relations. It predicts action and grasp feasibility by reasoning jointly over learned geometric constraints.
27
+
28
+ GRN integrates predictions from auxiliary learned modules:
29
+ - **Inverse Kinematics (IK) feasibility**
30
+ - **Grasp Obstruction (GO)**
31
+ - **Action and Grasp Feasibility (AGF)**
32
+
33
+ ---
34
+
35
+ ## Intended Use
36
+
37
+ ### Primary Use Cases
38
+ - Augmenting robot Task and Motion Planning (TAMP)
39
+ - Learned feasibility prediction for manipulation actions
40
+ - Research in robotic manipulation and geometric reasoning
41
+ - Benchmarking learned planning heuristics
42
+
43
+ ### Out-of-Scope Use
44
+ - Low-level control or trajectory optimization
45
+ - Safety-critical deployment without validation
46
+ - Non-robotic domains
47
+
48
+ ---
49
+
50
+ ## Model Architecture
51
+
52
+ - Graph Attention Network with message passing
53
+ - Nodes represent objects
54
+ - Edges encode geometric and relational constraints
55
+ - Supervised training on large-scale simulated datasets
56
+
57
+ ---
58
+
59
+ ## Training Details
60
+
61
+ - **Training data:** GRN simulated manipulation datasets
62
+ - **Loss:** Binary cross-entropy + Mean-squared error
63
+ - **Optimizer:** Adam
64
+ - **Hyperparameters:**
65
+ | Module | Batch size | Learning Rate | Epochs |
66
+ | --------- | ---------- | ------------- | ------ |
67
+ | IK | 8192 | 1e-3 | 100 |
68
+ | GO | 8192 | 1e-3 | 100 |
69
+ | AGF | 2048 | 1e-4 | 100 |
70
+ | GRN | 2048 | 1e-4 | 100 |
71
+
72
+ - **Hardware:** NVIDIA RTX A5000 GPU
73
+ - **Training Time:** 15 hours
74
+
75
+ ---
76
+
77
+ ## Evaluation
78
+
79
+ Evaluation is performed on both in-distribution and out-of-distribution datasets with increasing scene complexity.
80
+
81
+ Metrics include:
82
+ - Feasibility prediction accuracy
83
+ - Downstream task and motion planning success rate
84
+ - Generalization to unseen clutter levels and object sizes
85
+
86
+ GRN outperforms MLP, CNN-based, and standard GNN baselines. Please refer to the paper for detailed results.
87
+
88
+ ![GRN results visualization](https://github.com/Smail8/geometric_reasoning_networks/blob/main/assets/viz.png?raw=true)
89
+
90
+ ---
91
+
92
+ ## Limitations
93
+
94
+ - Trained exclusively in simulation
95
+ - Performance degrades for extreme clutter or unseen geometries
96
+ - Assumes accurate scene geometry and object poses
97
+
98
+ ---
99
+
100
+ ## Ethical Considerations
101
+
102
+ This model does not involve human data. Users are responsible for ensuring safe deployment when integrated into physical robotic systems.
103
+
104
+ ---
105
+
106
+ ## Citation
107
+
108
+ ```bibtex
109
+ @inproceedings{ait2025learning,
110
+ title={Learning Geometric Reasoning Networks for Robot Task and Motion Planning},
111
+ author={Ait Bouhsain, Smail and Alami, Rachid and Simeon, Thierry},
112
+ booktitle={The Thirteenth International Conference on Learning Representations},
113
+ year={2025}
114
+ }