Spaces:
Running
Running
Commit Β·
c3d811a
1
Parent(s): 1fe831f
Update README.md
Browse files
README.md
CHANGED
|
@@ -7,4 +7,28 @@ sdk: static
|
|
| 7 |
pinned: false
|
| 8 |
---
|
| 9 |
|
| 10 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 7 |
pinned: false
|
| 8 |
---
|
| 9 |
|
| 10 |
+
# Medieval Data π°
|
| 11 |
+
|
| 12 |
+
Welcome to the **Medieval Data** organization, a dedicated platform for offering datasets specifically curated for training machine learning models on medieval-specific tasks.
|
| 13 |
+
|
| 14 |
+
These datasets and models are maintained by [William J.B. Mattingly](https://wjbmattingly.com/)
|
| 15 |
+
|
| 16 |
+
## Datasets π
|
| 17 |
+
|
| 18 |
+
Here's a quick overview of our available datasets:
|
| 19 |
+
|
| 20 |
+
1. **MGH Critical Edition Dataset**: 100 annotated pages of an MGH critical edition to parse out the main body text and titles from marginalia and footers.
|
| 21 |
+
|
| 22 |
+
## Models π‘οΈ
|
| 23 |
+
|
| 24 |
+
1. **MGH Object Detection YOLOv8**: Annotate an MGH critical edition to extract the main body text and titles automatically. This helps in downstream OCR with Tesseract.
|
| 25 |
+
|
| 26 |
+
Replace `dataset_name` with the specific name of the dataset you're interested in.
|
| 27 |
+
|
| 28 |
+
## Contribute π€
|
| 29 |
+
|
| 30 |
+
We welcome contributions! If you have a medieval-specific dataset or have annotations that can be added, please reach out.
|
| 31 |
+
|
| 32 |
+
## License π
|
| 33 |
+
|
| 34 |
+
All datasets in this organization are released under the [CC BY 4.0 License](https://creativecommons.org/licenses/by/4.0/) unless specified otherwise. Please ensure to cite the original sources and the Medieval Data organization when using the datasets.
|