| library_name: transformers | |
| pipeline_tag: image-text-to-text | |
| This repository contains the HandsOnVLM model presented in the paper [HandsOnVLM: Vision-Language Models for Hand-Object Interaction Prediction](https://huggingface.co/papers/2412.13187). | |
| Project page: https://www.chenbao.tech/handsonvlm/ | |
| Code: https://github.com/Kami-code/HandsOnVLM-release |