File size: 361 Bytes
d8918eb |
1 2 3 4 5 6 7 8 |
---
library_name: transformers
pipeline_tag: image-text-to-text
---
This repository contains the HandsOnVLM model presented in the paper [HandsOnVLM: Vision-Language Models for Hand-Object Interaction Prediction](https://huggingface.co/papers/2412.13187).
Project page: https://www.chenbao.tech/handsonvlm/
Code: https://github.com/Kami-code/HandsOnVLM-release |