File size: 3,764 Bytes

---
license: mit
---
# Geometric Reasoning Networks (GRN)

![GRN model architecture](https://github.com/Smail8/geometric_reasoning_networks/blob/main/assets/model.png?raw=true)

## Model Details

- **Model name:** Geometric Reasoning Networks (GRN)
- **Model type:** Graph Neural Network for robot manipulation feasibility prediction
- **Framework:** PyTorch, PyTorch Geometric
- **Associated paper:**  
  *Learning Geometric Reasoning Networks for Robot Task and Motion Planning*, ICLR 2025
- **Authors:** Smail Ait Bouhsain, Rachid Alami, Thierry Siméon
- **License:** See repository LICENSE
- **Paper:** https://openreview.net/pdf?id=ajxAJ8GUX4
- **Repository:** https://github.com/smail8/geometric_reasoning_networks

GRN is a learned geometric reasoning model designed to augment **robot Task and Motion Planning (TAMP)** by predicting the feasibility of manipulation actions in cluttered 3D environments.

---

## Model Description

The Geometric Reasoning Network (GRN) operates on **graph-structured representations of manipulation scenes**, encoding objects, candidate grasps, actions, and their geometric relations. It predicts action and grasp feasibility by reasoning jointly over learned geometric constraints.

GRN integrates predictions from auxiliary learned modules:
- **Inverse Kinematics (IK) feasibility**
- **Grasp Obstruction (GO)**
- **Action and Grasp Feasibility (AGF)**

---

## Intended Use

### Primary Use Cases
- Augmenting robot Task and Motion Planning (TAMP)
- Learned feasibility prediction for manipulation actions
- Research in robotic manipulation and geometric reasoning
- Benchmarking learned planning heuristics

### Out-of-Scope Use
- Low-level control or trajectory optimization
- Safety-critical deployment without validation
- Non-robotic domains

---

## Model Architecture

- Graph Attention Network with message passing
- Nodes represent objects
- Edges encode geometric and relational constraints
- Supervised training on large-scale simulated datasets

---

## Training Details

- **Training data:** GRN simulated manipulation datasets
- **Loss:** Binary cross-entropy + Mean-squared error
- **Optimizer:** Adam
- **Hyperparameters:**
  | Module    | Batch size | Learning Rate | Epochs |
  | --------- | ---------- | ------------- | ------ |
  | IK        | 8192       | 1e-3          | 100    |
  | GO        | 8192       | 1e-3          | 100    |
  | AGF       | 2048       | 1e-4          | 100    |
  | GRN       | 2048       | 1e-4          | 100    |
 
- **Hardware:**  NVIDIA RTX A5000 GPU
- **Training Time:** 15 hours

---

## Evaluation

Evaluation is performed on both in-distribution and out-of-distribution datasets with increasing scene complexity.

Metrics include:
- Feasibility prediction accuracy
- Downstream task and motion planning success rate
- Generalization to unseen clutter levels and object sizes

GRN outperforms MLP, CNN-based, and standard GNN baselines. Please refer to the paper for detailed results.

![GRN results visualization](https://github.com/Smail8/geometric_reasoning_networks/blob/main/assets/viz.png?raw=true)

---

## Limitations

- Trained exclusively in simulation
- Performance degrades for extreme clutter or unseen geometries
- Assumes accurate scene geometry and object poses

---

## Ethical Considerations

This model does not involve human data. Users are responsible for ensuring safe deployment when integrated into physical robotic systems.

---

## Citation

```bibtex
@inproceedings{ait2025learning,
  title={Learning Geometric Reasoning Networks for Robot Task and Motion Planning},
  author={Ait Bouhsain, Smail and Alami, Rachid and Simeon, Thierry},
  booktitle={The Thirteenth International Conference on Learning Representations},
  year={2025}
}