TalkingPoints / README.md
nielsr's picture
nielsr HF Staff
Add pipeline tag, paper, and code links to model card
97cd4f7 verified
|
raw
history blame
1.06 kB
metadata
license: apache-2.0
pipeline_tag: image-to-text

Talking Points: Describing and Localizing Pixels

This repository contains the Point Descriptor and Point Localizer models presented in the paper Talking Points: Describing and Localizing Pixels.

Talking Points introduces a novel framework for pixel-level grounding. It consists of two complementary components: a Point Descriptor that generates rich, contextual descriptions of individual keypoints, and a Point Localizer that regresses precise pixel coordinates from these descriptions. Unlike prior work, our approach produces free-form, coarse-to-fine descriptions that situate keypoints within their visual context.

Talking Points Teaser Image

For detailed information on installation, dataset creation, training, and evaluation scripts, please refer to the official GitHub repository.