Update README.md
Browse files
README.md
CHANGED
|
@@ -1,4 +1,27 @@
|
|
| 1 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 2 |
Gradio - ready 50 steps
|
| 3 |
- Fine-tuning
|
| 4 |
Oasst Guanaco 100 steps
|
|
|
|
| 1 |
+
---
|
| 2 |
+
task_categories:
|
| 3 |
+
- text-generation
|
| 4 |
+
---
|
| 5 |
+
# Description
|
| 6 |
+
This language model is the version 0.0 of a Gradio Coding Assistant. It is an instruction fine-tuned version of [StarCoder](https://huggingface.co/bigcode/starcoder) that is
|
| 7 |
+
design to provide assistance to developers who are using [gradio](https://www.gradio.app).
|
| 8 |
+
|
| 9 |
+
# Dataset
|
| 10 |
+
The dataset is multi-source. Its content comes from the following sources
|
| 11 |
+
- The stack
|
| 12 |
+
More precisely, we looked into [the-stack-dedup](https://huggingface.co/datasets/bigcode/the-stack-dedup) which contain codes permissive licenses. We shortlisted the files whose
|
| 13 |
+
content incorporated the keyword `gradio`.
|
| 14 |
+
- GitHub Issues
|
| 15 |
+
We scrapped all the issues of the official repository [the-gradio-app/gradio](https://github.com/gradio-app/gradio) and added them to our training dataset.
|
| 16 |
+
- Spaces on Hugging Face Hub
|
| 17 |
+
We used the [huggingface_hub api](https://huggingface.co/docs/huggingface_hub/package_reference/hf_api) to scrape the data from the spaces which are designed with gradio. We kept track of those
|
| 18 |
+
with permissive licenses, namely MIT and Apache 2.0. This set of code was further deduplicated.
|
| 19 |
+
|
| 20 |
+
# Training setting and hyperparameters
|
| 21 |
+
For our fine-tuning, we decided to follow a 2-step strategy.
|
| 22 |
+
- Pretraining (Fine-tuning) with next token prediction on the previously built gradio dataset (this step should familiarize the model with the gradio syntax.)
|
| 23 |
+
- Instruction fine-tuning on an instruction fine-tuning (this step should make the model conversational)
|
| 24 |
+
## Pretraining
|
| 25 |
Gradio - ready 50 steps
|
| 26 |
- Fine-tuning
|
| 27 |
Oasst Guanaco 100 steps
|