Update README.md
Browse files
README.md
CHANGED
|
@@ -5,7 +5,7 @@ license: apache-2.0
|
|
| 5 |
# DataSmith
|
| 6 |
|
| 7 |
## Introduction
|
| 8 |
-
DataSmith is a large model
|
| 9 |
|
| 10 |
## Models Available
|
| 11 |
- DataSmith-6B
|
|
@@ -73,7 +73,7 @@ print(response)
|
|
| 73 |
|
| 74 |
```
|
| 75 |
|
| 76 |
-
##
|
| 77 |
We use gpt-4 to generate training corpus by constructing prompt. If you need it, please contact us by email.
|
| 78 |
|
| 79 |
## Contributing
|
|
|
|
| 5 |
# DataSmith
|
| 6 |
|
| 7 |
## Introduction
|
| 8 |
+
DataSmith is a large model designed to generate JSON-format data from textual content. The DataSmith-6B version, equipped with 6 billion parameters, is fine-tuned using a comprehensive selection of data sources, including news, encyclopedias, legal documents, medical records, advertising, academic papers, books, novels, and various public announcements. This model serves as the foundation for a series of task-specific adaptations.
|
| 9 |
|
| 10 |
## Models Available
|
| 11 |
- DataSmith-6B
|
|
|
|
| 73 |
|
| 74 |
```
|
| 75 |
|
| 76 |
+
## Datasets
|
| 77 |
We use gpt-4 to generate training corpus by constructing prompt. If you need it, please contact us by email.
|
| 78 |
|
| 79 |
## Contributing
|