Upload README.md
Browse files
README.md
ADDED
|
@@ -0,0 +1,52 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
# Instruction for downloading data from the sft-data repository.
|
| 2 |
+
|
| 3 |
+
First, you would want to log in and access the huggingface data through using
|
| 4 |
+
|
| 5 |
+
```py
|
| 6 |
+
from huggingface_hub import login
|
| 7 |
+
login()
|
| 8 |
+
```
|
| 9 |
+
|
| 10 |
+
Then, you could either download the zip file of the all the sft data folders, which would look like
|
| 11 |
+
|
| 12 |
+
```py
|
| 13 |
+
from huggingface_hub import hf_hub_download
|
| 14 |
+
hf_hub_download(repo_id="LEVI-Project/sft-data", filename="sft-data.zip")
|
| 15 |
+
```
|
| 16 |
+
|
| 17 |
+
Notice that the `sft-data.zip` file above has the following structure:
|
| 18 |
+
|
| 19 |
+
```
|
| 20 |
+
sft-data
|
| 21 |
+
│ README.md This README file.
|
| 22 |
+
└───alf Folder for ALFWORLD.
|
| 23 |
+
│ │ alfworld.json The JSON file for ALFWORLD.
|
| 24 |
+
│ └───alf_data_folder Folder for the ALFWORLD environment.
|
| 25 |
+
│ │ alf_image_id_0 Folder 0 for ALFWORLD image data
|
| 26 |
+
│ │ alf_image_id_1 Folder 1 for ALFWORLD image data
|
| 27 |
+
│ │ alf_image_id_2 Folder 3 for ALFWORLD image data
|
| 28 |
+
│ │ alf_image_id_3 Folder 3 for ALFWORLD image data
|
| 29 |
+
│ │ alf_image_id_4 Folder 4 for ALFWORLD image data
|
| 30 |
+
└───blackjack Folder for blackjack environment in the `gym_cards`
|
| 31 |
+
│ blackjack_data_folder Folder for blackjack image data.
|
| 32 |
+
│ blackjack.json The JSON file for blackjack.
|
| 33 |
+
└───ezpoints Folder for ezpoints environment in the `gym_cards`.
|
| 34 |
+
│ ezpoints_data_folder Folder for ezpoints image data.
|
| 35 |
+
│ ezpoints.json The JSON file for ezpoints.
|
| 36 |
+
└───points24 Folder for points24 environment in the `gym_cards`.
|
| 37 |
+
│ points24_data_folder Folder for points24 image data.
|
| 38 |
+
│ points24.json The JSON file for points24.
|
| 39 |
+
└───numberline Folder for numberline environment in the `gym_cards`
|
| 40 |
+
│ numberline_data_folder Folder for numberline image data.
|
| 41 |
+
│ numberline.json The JSON file for numberline.
|
| 42 |
+
```
|
| 43 |
+
|
| 44 |
+
Also, you could choose to download the files for any environment out of the five ones. For example, you should be using the following code for downloading data from blackjack.
|
| 45 |
+
|
| 46 |
+
```py
|
| 47 |
+
from huggingface_hub import hf_hub_download
|
| 48 |
+
hf_hub_download(repo_id="LEVI-Project/llava-data", filename="blackjack.zip") # zip folder for image data folder
|
| 49 |
+
hf_hub_download(repo_id="LEVI-Project/llava-data", filename="blackjack.json") # JSON file
|
| 50 |
+
```
|
| 51 |
+
|
| 52 |
+
For ALFWORLD, notice that the zip file for the image data folder is `alf_data_folder.zip`.
|