Robotics
Transformers
Safetensors
qwen2_5_vl
image-text-to-text
text-generation-inference
jan-hq commited on
Commit
6cb08fc
·
verified ·
1 Parent(s): 80afa0b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -9,7 +9,7 @@ tags: []
9
 
10
  ## Introduction
11
 
12
- "PoseLess: Depth-Free Vision-to-Joint Control via Direct Image Mapping with VLM," introduces a novel framework for robot hand control that eliminates the need for explicit pose estimation by directly mapping 2D images to joint angles using projected representations. Our approach leverages synthetic training data generated through randomized joint configurations, enabling zero-shot generalization to real-world scenarios and cross-morphology transfer from robotic to human hands. By projecting visual inputs and employing a transformer-based decoder, PoseLess achieves robust, low-latency control while addressing challenges such as depth ambiguity and data scarcity. Experimental results demonstrate competitive performance in joint angle prediction accuracy without relying on any human-labelled dataset
13
 
14
  Our key contributions are as follows:
15
 
@@ -28,7 +28,7 @@ Our key contributions are as follows:
28
 
29
 
30
  ## Citation
31
- BibTeX: [More Information Needed]
32
 
33
  ## More Information
34
  * Contact the authors at alan@menlo.ai, bach@menlo.ai, charles@menlo.ai, yuuki@menlo.ai for further details.
 
9
 
10
  ## Introduction
11
 
12
+ **"PoseLess: Depth-Free Vision-to-Joint Control via Direct Image Mapping with VLM"** introduces a novel framework for robot hand control that eliminates the need for explicit pose estimation by directly mapping 2D images to joint angles using projected representations. Our approach leverages synthetic training data generated through randomized joint configurations, enabling zero-shot generalization to real-world scenarios and cross-morphology transfer from robotic to human hands. By projecting visual inputs and employing a transformer-based decoder, PoseLess achieves robust, low-latency control while addressing challenges such as depth ambiguity and data scarcity. Experimental results demonstrate competitive performance in joint angle prediction accuracy without relying on any human-labelled dataset
13
 
14
  Our key contributions are as follows:
15
 
 
28
 
29
 
30
  ## Citation
31
+ BibTeX: []
32
 
33
  ## More Information
34
  * Contact the authors at alan@menlo.ai, bach@menlo.ai, charles@menlo.ai, yuuki@menlo.ai for further details.