Update README.md
Browse files
README.md
CHANGED
|
@@ -9,12 +9,15 @@ design to provide assistance to developers who are using [gradio](https://www.gr
|
|
| 9 |
# Dataset
|
| 10 |
The dataset is multi-source. Its content comes from the following sources
|
| 11 |
- The stack
|
|
|
|
| 12 |
More precisely, we looked into [the-stack-dedup](https://huggingface.co/datasets/bigcode/the-stack-dedup) which contain codes permissive licenses. We shortlisted the files whose
|
| 13 |
content incorporated the keyword `gradio`.
|
| 14 |
- GitHub Issues
|
|
|
|
| 15 |
We scrapped all the issues of the official repository [the-gradio-app/gradio](https://github.com/gradio-app/gradio) and added them to our training dataset.
|
| 16 |
- Spaces on Hugging Face Hub
|
| 17 |
-
|
|
|
|
| 18 |
with permissive licenses, namely MIT and Apache 2.0. This set of code was further deduplicated.
|
| 19 |
|
| 20 |
# Training setting and hyperparameters
|
|
|
|
| 9 |
# Dataset
|
| 10 |
The dataset is multi-source. Its content comes from the following sources
|
| 11 |
- The stack
|
| 12 |
+
|
| 13 |
More precisely, we looked into [the-stack-dedup](https://huggingface.co/datasets/bigcode/the-stack-dedup) which contain codes permissive licenses. We shortlisted the files whose
|
| 14 |
content incorporated the keyword `gradio`.
|
| 15 |
- GitHub Issues
|
| 16 |
+
|
| 17 |
We scrapped all the issues of the official repository [the-gradio-app/gradio](https://github.com/gradio-app/gradio) and added them to our training dataset.
|
| 18 |
- Spaces on Hugging Face Hub
|
| 19 |
+
|
| 20 |
+
We used the [HuggingFace_Hub API](https://huggingface.co/docs/huggingface_hub/package_reference/hf_api) to scrape the data from the spaces which are designed with gradio. We kept track of those
|
| 21 |
with permissive licenses, namely MIT and Apache 2.0. This set of code was further deduplicated.
|
| 22 |
|
| 23 |
# Training setting and hyperparameters
|