Spaces:

midreal
/

README

Running

anchen1011 commited on Jul 22, 2024

Commit

2ff18ec

verified ·

1 Parent(s): bb8b382

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -13,8 +13,6 @@ We follow an extremely simple format to organize and manage our models aand data
 # Model
-### Format
 Repo should be named as `midreal/{model_function}_{train_method}_{train_technique}_{base_model}_{date}`
 Model card should include ([example](https://huggingface.co/midreal/passage_sft_qlora_llama3_70b_0704)):
@@ -45,27 +43,8 @@ Tracking of the training procedure
 Person in charge
 ```
-### Usage
-Models could be uploaded with:
-```
-upload
-```
-and downloaded with:
-```
-download
-```
-and updated with:
-```
-update
-```
 # Dataset
-### Format
 Repo should be named as `midreal/{model_function}_{train_method}_{status}_{date}`
 Dataset card should include ([example](https://huggingface.co/datasets/midreal/passage_sft_lmflow_0704)):
@@ -80,9 +59,7 @@ The schema that the data elements should follow
 Person in charge
 ```
-Other information about the dataset is also welcome.
-### Status
 The production of dataset should somehow follow a pipeline from `raw_data` to `story_data` to `openai` or `lmflow` format.
@@ -94,19 +71,25 @@ The production of dataset should somehow follow a pipeline from `raw_data` to `s
 `lmflow` refers to [LMFlow data format](https://optimalscale.github.io/LMFlow/examples/DATASETS.html#data-format).
-### Usage
-Datasets could be uploaded with:
 ```
-upload
 ```
-and downloaded with:
 ```
-download
 ```
-and updated with:
 ```
-update
-```

 # Model
 Repo should be named as `midreal/{model_function}_{train_method}_{train_technique}_{base_model}_{date}`
 Model card should include ([example](https://huggingface.co/midreal/passage_sft_qlora_llama3_70b_0704)):
 Person in charge
 ```
 # Dataset
 Repo should be named as `midreal/{model_function}_{train_method}_{status}_{date}`
 Dataset card should include ([example](https://huggingface.co/datasets/midreal/passage_sft_lmflow_0704)):
 Person in charge
 ```
+Other information about the dataset could also be commented on dataset cards.
 The production of dataset should somehow follow a pipeline from `raw_data` to `story_data` to `openai` or `lmflow` format.
 `lmflow` refers to [LMFlow data format](https://optimalscale.github.io/LMFlow/examples/DATASETS.html#data-format).
+# Usage
+We suggest using Huggingface CLI ([docs](https://huggingface.co/docs/huggingface_hub/main/en/guides/cli)).
+Once you have installed huggingface-cli and login, models/datasets could be uploaded with:
 ```
+huggingface-cli upload [midreal/repo_id] [local_path] ([path_in_repo]) (--repo-type=dataset)
 ```
+e.g.
 ```
+huggingface-cli upload midreal/model ./path/to/curated/data /data/train
+huggingface-cli upload midreal/dataset . . --repo-type=dataset
 ```
+and downloaded with:
 ```
+huggingface-cli download midreal/model
+huggingface-cli download midreal/dataset --repo-type dataset
+```
+If the repo doesn’t exist yet, it will be created automatically.