GRN_models / README.md

Update README.md

ce28928 verified about 1 month ago

3.76 kB

	---
	license: mit
	---
	# Geometric Reasoning Networks (GRN)

	![GRN model architecture](https://github.com/Smail8/geometric_reasoning_networks/blob/main/assets/model.png?raw=true)

	## Model Details

	- Model name: Geometric Reasoning Networks (GRN)
	- Model type: Graph Neural Network for robot manipulation feasibility prediction
	- Framework: PyTorch, PyTorch Geometric
	- Associated paper:
	Learning Geometric Reasoning Networks for Robot Task and Motion Planning, ICLR 2025
	- Authors: Smail Ait Bouhsain, Rachid Alami, Thierry Siméon
	- License: See repository LICENSE
	- Paper: https://openreview.net/pdf?id=ajxAJ8GUX4
	- Repository: https://github.com/smail8/geometric_reasoning_networks

	GRN is a learned geometric reasoning model designed to augment robot Task and Motion Planning (TAMP) by predicting the feasibility of manipulation actions in cluttered 3D environments.

	---

	## Model Description

	The Geometric Reasoning Network (GRN) operates on graph-structured representations of manipulation scenes, encoding objects, candidate grasps, actions, and their geometric relations. It predicts action and grasp feasibility by reasoning jointly over learned geometric constraints.

	GRN integrates predictions from auxiliary learned modules:
	- Inverse Kinematics (IK) feasibility
	- Grasp Obstruction (GO)
	- Action and Grasp Feasibility (AGF)

	---

	## Intended Use

	### Primary Use Cases
	- Augmenting robot Task and Motion Planning (TAMP)
	- Learned feasibility prediction for manipulation actions
	- Research in robotic manipulation and geometric reasoning
	- Benchmarking learned planning heuristics

	### Out-of-Scope Use
	- Low-level control or trajectory optimization
	- Safety-critical deployment without validation
	- Non-robotic domains

	---

	## Model Architecture

	- Graph Attention Network with message passing
	- Nodes represent objects
	- Edges encode geometric and relational constraints
	- Supervised training on large-scale simulated datasets

	---

	## Training Details

	- Training data: GRN simulated manipulation datasets
	- Loss: Binary cross-entropy + Mean-squared error
	- Optimizer: Adam
	- Hyperparameters:
	\| Module \| Batch size \| Learning Rate \| Epochs \|
	\| --------- \| ---------- \| ------------- \| ------ \|
	\| IK \| 8192 \| 1e-3 \| 100 \|
	\| GO \| 8192 \| 1e-3 \| 100 \|
	\| AGF \| 2048 \| 1e-4 \| 100 \|
	\| GRN \| 2048 \| 1e-4 \| 100 \|

	- Hardware: NVIDIA RTX A5000 GPU
	- Training Time: 15 hours

	---

	## Evaluation

	Evaluation is performed on both in-distribution and out-of-distribution datasets with increasing scene complexity.

	Metrics include:
	- Feasibility prediction accuracy
	- Downstream task and motion planning success rate
	- Generalization to unseen clutter levels and object sizes

	GRN outperforms MLP, CNN-based, and standard GNN baselines. Please refer to the paper for detailed results.

	![GRN results visualization](https://github.com/Smail8/geometric_reasoning_networks/blob/main/assets/viz.png?raw=true)

	---

	## Limitations

	- Trained exclusively in simulation
	- Performance degrades for extreme clutter or unseen geometries
	- Assumes accurate scene geometry and object poses

	---

	## Ethical Considerations

	This model does not involve human data. Users are responsible for ensuring safe deployment when integrated into physical robotic systems.

	---

	## Citation

	```bibtex
	@inproceedings{ait2025learning,
	title={Learning Geometric Reasoning Networks for Robot Task and Motion Planning},
	author={Ait Bouhsain, Smail and Alami, Rachid and Simeon, Thierry},
	booktitle={The Thirteenth International Conference on Learning Representations},
	year={2025}
	}