Update README.md
Browse files
README.md
CHANGED
|
@@ -7,17 +7,31 @@ pipeline_tag: image-text-to-text
|
|
| 7 |
|
| 8 |
# HOI-R1: Exploring the Potential of Multimodal Large Language Models for Human-Object Interaction Detection
|
| 9 |
|
| 10 |
-
[
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 11 |
|
| 12 |

|
| 13 |
|
| 14 |
-
##
|
|
|
|
|
|
|
| 15 |
|
| 16 |
-
```
|
| 17 |
@article{chen2025hoi,
|
| 18 |
title={HOI-R1: Exploring the Potential of Multimodal Large Language Models for Human-Object Interaction Detection},
|
| 19 |
author={Chen, Junwen and Xiong, Peilin and Yanai, Keiji},
|
| 20 |
journal={arXiv preprint arXiv:2510.05609},
|
| 21 |
year={2025}
|
| 22 |
}
|
| 23 |
-
```
|
|
|
|
| 7 |
|
| 8 |
# HOI-R1: Exploring the Potential of Multimodal Large Language Models for Human-Object Interaction Detection
|
| 9 |
|
| 10 |
+
[](https://arxiv.org/abs/2510.05609)
|
| 11 |
+
|
| 12 |
+
This repository contains the official resources for **HOI-R1**, a research project that explores the potential of **Multimodal Large Language Models (MLLMs)** for **Human-Object Interaction (HOI) Detection**.
|
| 13 |
+
|
| 14 |
+
HOI-R1 is inspired by recent advances in reinforcement learning for large language models and investigates how vision-language models can reason about and detect human-object interactions more effectively.
|
| 15 |
+
|
| 16 |
+
---
|
| 17 |
+
|
| 18 |
+
## 🔍 Overview
|
| 19 |
+
|
| 20 |
+
- **Task**: Human-Object Interaction Detection (HOID)
|
| 21 |
+
- **Our Motivation**:
|
| 22 |
+
Leverage the reasoning capability of Multimodal LLMs and reinforcement learning–style optimization to explore HOI detection performance.
|
| 23 |
+
---
|
| 24 |
|
| 25 |

|
| 26 |
|
| 27 |
+
## 📌 Citation
|
| 28 |
+
|
| 29 |
+
If you find this work useful, please consider citing:
|
| 30 |
|
| 31 |
+
```bibtex
|
| 32 |
@article{chen2025hoi,
|
| 33 |
title={HOI-R1: Exploring the Potential of Multimodal Large Language Models for Human-Object Interaction Detection},
|
| 34 |
author={Chen, Junwen and Xiong, Peilin and Yanai, Keiji},
|
| 35 |
journal={arXiv preprint arXiv:2510.05609},
|
| 36 |
year={2025}
|
| 37 |
}
|
|
|