update for how to git clone
Browse files
sections/03_create_dataset.md
CHANGED
|
@@ -8,6 +8,12 @@ See the help page on how to set up [security tokens](https://huggingface.co/docs
|
|
| 8 |
* Click Create New Token → fill out information
|
| 9 |
* Save the token, e.g. in a password manager
|
| 10 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 11 |
#### Data processing workflow overview
|
| 12 |
|
| 13 |
1. Create pilot datasets in personal space and then once ready transfer to the Rosetta Data Bazaar collection
|
|
@@ -19,8 +25,8 @@ See the help page on how to set up [security tokens](https://huggingface.co/docs
|
|
| 19 |
1. Edit READ.md directly in the browser
|
| 20 |
2. upload/delete other files directly
|
| 21 |
3. Add any data processing scripts/workflows for reproducibility
|
| 22 |
-
1. `git clone https://
|
| 23 |
-
2. create analysis folder structure
|
| 24 |
|
| 25 |
src/ \# scripts for data curation
|
| 26 |
|
|
|
|
| 8 |
* Click Create New Token → fill out information
|
| 9 |
* Save the token, e.g. in a password manager
|
| 10 |
|
| 11 |
+
After retriving your personal access token, you can set up git with HuggingFace via command-line. Briefly, this looks like:
|
| 12 |
+
```
|
| 13 |
+
pip install huggingface_hub
|
| 14 |
+
huggingface-cli login
|
| 15 |
+
```
|
| 16 |
+
|
| 17 |
#### Data processing workflow overview
|
| 18 |
|
| 19 |
1. Create pilot datasets in personal space and then once ready transfer to the Rosetta Data Bazaar collection
|
|
|
|
| 25 |
1. Edit READ.md directly in the browser
|
| 26 |
2. upload/delete other files directly
|
| 27 |
3. Add any data processing scripts/workflows for reproducibility
|
| 28 |
+
1. `git clone https://huggingface.co/datasets/<username>/<repo-name>`
|
| 29 |
+
2. create analysis folder structure, such as:
|
| 30 |
|
| 31 |
src/ \# scripts for data curation
|
| 32 |
|