Add library_name, pipeline tag, license and links to project page and Github repository

by nielsr HF Staff - opened Jun 8, 2025

base: refs/heads/main

←

from: refs/pr/2

Discussion Files changed

+71

-1

Files changed (1) hide show

README.md +71 -1

README.md CHANGED Viewed

	@@ -1 +1,71 @@
1	- ~~Paper: https://arxiv.org/abs/2505.12448~~

+---
+language:
+- en
+license: apache-2.0
+size_categories:
+- 1K<n<10K
+task_categories:
+- visual-question-answering
+pretty_name: Robo2VLM-Reasoning
+dataset_info:
+  features:
+  - name: id
+    dtype: string
+  - name: question
+    dtype: string
+  - name: choices
+    dtype: string
+  - name: correct_answer
+    dtype: int64
+  - name: image
+    struct:
+    - name: bytes
+      dtype: binary
+    - name: path
+      dtype: 'null'
+  - name: reasoning
+    dtype: string
+  - name: orig_idx
+    dtype: int64
+  - name: images
+    sequence: image
+  splits:
+  - name: train
+    num_bytes: 1783797796.625
+    num_examples: 4635
+  - name: test
+    num_bytes: 201450157.0
+    num_examples: 515
+  download_size: 1971201459
+  dataset_size: 1985247953.625
+configs:
+- config_name: default
+  data_files:
+  - split: train
+    path: data/train-*
+  - split: test
+    path: data/test-*
+tags:
+- robotics
+- vision-language
+---
+# Robo2VLM-Reasoning
+Samples from the dataset: [Robo2VLM-1](https://huggingface.co/datasets/keplerccc/Robo2VLM-1), prompting `gemini-2.5-pro` to generate reasoning traces supporting the correct choice.
+Paper: [Robo2VLM: Visual Question Answering from Large-Scale In-the-Wild Robot Manipulation Datasets](https://huggingface.co/papers/2505.15517)
+```
+@misc{chen2025robo2vlmvisualquestionanswering,
+      title={Robo2VLM: Visual Question Answering from Large-Scale In-the-Wild Robot Manipulation Datasets},
+      author={Kaiyuan Chen and Shuangyu Xie and Zehan Ma and Ken Goldberg},
+      year={2025},
+      eprint={2505.15517},
+      archivePrefix={arXiv},
+      primaryClass={cs.RO},
+      url={https://arxiv.org/abs/2505.15517},
+}
+```