Update README.md
Browse files
README.md
CHANGED
|
@@ -1,3 +1,114 @@
|
|
| 1 |
-
---
|
| 2 |
-
license: mit
|
| 3 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: mit
|
| 3 |
+
---
|
| 4 |
+
# Geometric Reasoning Network (GRN)
|
| 5 |
+
|
| 6 |
+

|
| 7 |
+
|
| 8 |
+
## Model Details
|
| 9 |
+
|
| 10 |
+
- **Model name:** Geometric Reasoning Network (GRN)
|
| 11 |
+
- **Model type:** Graph Neural Network for robot manipulation feasibility prediction
|
| 12 |
+
- **Framework:** PyTorch, PyTorch Geometric
|
| 13 |
+
- **Associated paper:**
|
| 14 |
+
*Learning Geometric Reasoning Networks for Robot Task and Motion Planning*, ICLR 2025
|
| 15 |
+
- **Authors:** Smail Ait Bouhsain, Rachid Alami, Thierry Siméon
|
| 16 |
+
- **License:** See repository LICENSE
|
| 17 |
+
- **Paper:** https://openreview.net/pdf?id=ajxAJ8GUX4
|
| 18 |
+
- **Repository:** https://github.com/smail8/geometric_reasoning_networks
|
| 19 |
+
|
| 20 |
+
GRN is a learned geometric reasoning model designed to augment **robot Task and Motion Planning (TAMP)** by predicting the feasibility of manipulation actions in cluttered 3D environments.
|
| 21 |
+
|
| 22 |
+
---
|
| 23 |
+
|
| 24 |
+
## Model Description
|
| 25 |
+
|
| 26 |
+
The Geometric Reasoning Network (GRN) operates on **graph-structured representations of manipulation scenes**, encoding objects, candidate grasps, actions, and their geometric relations. It predicts action and grasp feasibility by reasoning jointly over learned geometric constraints.
|
| 27 |
+
|
| 28 |
+
GRN integrates predictions from auxiliary learned modules:
|
| 29 |
+
- **Inverse Kinematics (IK) feasibility**
|
| 30 |
+
- **Grasp Obstruction (GO)**
|
| 31 |
+
- **Action and Grasp Feasibility (AGF)**
|
| 32 |
+
|
| 33 |
+
---
|
| 34 |
+
|
| 35 |
+
## Intended Use
|
| 36 |
+
|
| 37 |
+
### Primary Use Cases
|
| 38 |
+
- Augmenting robot Task and Motion Planning (TAMP)
|
| 39 |
+
- Learned feasibility prediction for manipulation actions
|
| 40 |
+
- Research in robotic manipulation and geometric reasoning
|
| 41 |
+
- Benchmarking learned planning heuristics
|
| 42 |
+
|
| 43 |
+
### Out-of-Scope Use
|
| 44 |
+
- Low-level control or trajectory optimization
|
| 45 |
+
- Safety-critical deployment without validation
|
| 46 |
+
- Non-robotic domains
|
| 47 |
+
|
| 48 |
+
---
|
| 49 |
+
|
| 50 |
+
## Model Architecture
|
| 51 |
+
|
| 52 |
+
- Graph Attention Network with message passing
|
| 53 |
+
- Nodes represent objects
|
| 54 |
+
- Edges encode geometric and relational constraints
|
| 55 |
+
- Supervised training on large-scale simulated datasets
|
| 56 |
+
|
| 57 |
+
---
|
| 58 |
+
|
| 59 |
+
## Training Details
|
| 60 |
+
|
| 61 |
+
- **Training data:** GRN simulated manipulation datasets
|
| 62 |
+
- **Loss:** Binary cross-entropy + Mean-squared error
|
| 63 |
+
- **Optimizer:** Adam
|
| 64 |
+
- **Hyperparameters:**
|
| 65 |
+
| Module | Batch size | Learning Rate | Epochs |
|
| 66 |
+
| --------- | ---------- | ------------- | ------ |
|
| 67 |
+
| IK | 8192 | 1e-3 | 100 |
|
| 68 |
+
| GO | 8192 | 1e-3 | 100 |
|
| 69 |
+
| AGF | 2048 | 1e-4 | 100 |
|
| 70 |
+
| GRN | 2048 | 1e-4 | 100 |
|
| 71 |
+
|
| 72 |
+
- **Hardware:** NVIDIA RTX A5000 GPU
|
| 73 |
+
- **Training Time:** 15 hours
|
| 74 |
+
|
| 75 |
+
---
|
| 76 |
+
|
| 77 |
+
## Evaluation
|
| 78 |
+
|
| 79 |
+
Evaluation is performed on both in-distribution and out-of-distribution datasets with increasing scene complexity.
|
| 80 |
+
|
| 81 |
+
Metrics include:
|
| 82 |
+
- Feasibility prediction accuracy
|
| 83 |
+
- Downstream task and motion planning success rate
|
| 84 |
+
- Generalization to unseen clutter levels and object sizes
|
| 85 |
+
|
| 86 |
+
GRN outperforms MLP, CNN-based, and standard GNN baselines. Please refer to the paper for detailed results.
|
| 87 |
+
|
| 88 |
+

|
| 89 |
+
|
| 90 |
+
---
|
| 91 |
+
|
| 92 |
+
## Limitations
|
| 93 |
+
|
| 94 |
+
- Trained exclusively in simulation
|
| 95 |
+
- Performance degrades for extreme clutter or unseen geometries
|
| 96 |
+
- Assumes accurate scene geometry and object poses
|
| 97 |
+
|
| 98 |
+
---
|
| 99 |
+
|
| 100 |
+
## Ethical Considerations
|
| 101 |
+
|
| 102 |
+
This model does not involve human data. Users are responsible for ensuring safe deployment when integrated into physical robotic systems.
|
| 103 |
+
|
| 104 |
+
---
|
| 105 |
+
|
| 106 |
+
## Citation
|
| 107 |
+
|
| 108 |
+
```bibtex
|
| 109 |
+
@inproceedings{ait2025learning,
|
| 110 |
+
title={Learning Geometric Reasoning Networks for Robot Task and Motion Planning},
|
| 111 |
+
author={Ait Bouhsain, Smail and Alami, Rachid and Simeon, Thierry},
|
| 112 |
+
booktitle={The Thirteenth International Conference on Learning Representations},
|
| 113 |
+
year={2025}
|
| 114 |
+
}
|