thxplz commited on
Commit
012204e
·
verified ·
1 Parent(s): 5a3a1dc

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +18 -4
README.md CHANGED
@@ -7,17 +7,31 @@ pipeline_tag: image-text-to-text
7
 
8
  # HOI-R1: Exploring the Potential of Multimodal Large Language Models for Human-Object Interaction Detection
9
 
10
- [paper](https://arxiv.org/abs/2510.05609)
 
 
 
 
 
 
 
 
 
 
 
 
 
11
 
12
  ![hoi-r1-arch](https://cdn-uploads.huggingface.co/production/uploads/63119ce2fb65b9a3e2f75e3c/tHYWwrnqBAHsoo8lIOtnM.jpeg)
13
 
14
- ## Reference
 
 
15
 
16
- ```text
17
  @article{chen2025hoi,
18
  title={HOI-R1: Exploring the Potential of Multimodal Large Language Models for Human-Object Interaction Detection},
19
  author={Chen, Junwen and Xiong, Peilin and Yanai, Keiji},
20
  journal={arXiv preprint arXiv:2510.05609},
21
  year={2025}
22
  }
23
- ```
 
7
 
8
  # HOI-R1: Exploring the Potential of Multimodal Large Language Models for Human-Object Interaction Detection
9
 
10
+ [![arXiv](https://img.shields.io/badge/arXiv-2510.05609-b31b1b.svg)](https://arxiv.org/abs/2510.05609)
11
+
12
+ This repository contains the official resources for **HOI-R1**, a research project that explores the potential of **Multimodal Large Language Models (MLLMs)** for **Human-Object Interaction (HOI) Detection**.
13
+
14
+ HOI-R1 is inspired by recent advances in reinforcement learning for large language models and investigates how vision-language models can reason about and detect human-object interactions more effectively.
15
+
16
+ ---
17
+
18
+ ## 🔍 Overview
19
+
20
+ - **Task**: Human-Object Interaction Detection (HOID)
21
+ - **Our Motivation**:
22
+ Leverage the reasoning capability of Multimodal LLMs and reinforcement learning–style optimization to explore HOI detection performance.
23
+ ---
24
 
25
  ![hoi-r1-arch](https://cdn-uploads.huggingface.co/production/uploads/63119ce2fb65b9a3e2f75e3c/tHYWwrnqBAHsoo8lIOtnM.jpeg)
26
 
27
+ ## 📌 Citation
28
+
29
+ If you find this work useful, please consider citing:
30
 
31
+ ```bibtex
32
  @article{chen2025hoi,
33
  title={HOI-R1: Exploring the Potential of Multimodal Large Language Models for Human-Object Interaction Detection},
34
  author={Chen, Junwen and Xiong, Peilin and Yanai, Keiji},
35
  journal={arXiv preprint arXiv:2510.05609},
36
  year={2025}
37
  }