Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
|
@@ -17,7 +17,7 @@ pinned: false
|
|
| 17 |
|
| 18 |
<div style="border: 2px solid #cce7ff; background-color: #f0f8ff; padding: 20px; border-radius: 10px; margin-bottom: 20px;">
|
| 19 |
|
| 20 |
-
# π¬ **1.0 Research Focus**
|
| 21 |
|
| 22 |
## **1.1 Fine-tuning Small LLMs**
|
| 23 |
|
|
@@ -62,11 +62,19 @@ Exploring the potential of small LLMs for cleaning Raw HTR outputs from machine-
|
|
| 62 |
# π **2.0 Datasets**
|
| 63 |
|
| 64 |
## **2.1 Published Datasets**
|
|
|
|
|
|
|
|
|
|
| 65 |
1. [MarineLives/English-Expansions](https://huggingface.co/datasets/MarineLives/English-Expansions)
|
| 66 |
2. [MarineLives/Latin-Expansions](https://huggingface.co/datasets/MarineLives/Latin-Expansions)
|
| 67 |
3. [MarineLives/Line-Insertions](https://huggingface.co/datasets/MarineLives/Line-Insertions)
|
| 68 |
4. [MarineLives/HCA-1358-Errors-In-Phrases](https://huggingface.co/datasets/MarineLives/HCA-1358-Errors-In-Phrases)
|
| 69 |
5. [MarineLives/HCA-13-58-TEXT](https://huggingface.co/datasets/MarineLives/HCA-13-58-TEXT)
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 70 |
|
| 71 |
## **2.2 Unpublished Datasets**
|
| 72 |
- **Dataset 1**: 420K tokens, full diplomatic transcription (1627β1660)
|
|
@@ -80,6 +88,6 @@ Exploring the potential of small LLMs for cleaning Raw HTR outputs from machine-
|
|
| 80 |
# π **Explore MarineLives**
|
| 81 |
Join us in unlocking Early Modern history by exploring our [Hugging Face organization](https://huggingface.co/MarineLives) and datasets!
|
| 82 |
You can follow us on BlueSky at [@marinelives.bsky.social](https://bsky.app/profile/marinelives.bsky.social)
|
| 83 |
-
You can explore our content on our [MarineLives wiki](http://www.marinelives.org/wiki/MarineLives) and on our [
|
| 84 |
|
| 85 |
</div>
|
|
|
|
| 17 |
|
| 18 |
<div style="border: 2px solid #cce7ff; background-color: #f0f8ff; padding: 20px; border-radius: 10px; margin-bottom: 20px;">
|
| 19 |
|
| 20 |
+
# π¬ **1.0 Research Focus on Hugging Face**
|
| 21 |
|
| 22 |
## **1.1 Fine-tuning Small LLMs**
|
| 23 |
|
|
|
|
| 62 |
# π **2.0 Datasets**
|
| 63 |
|
| 64 |
## **2.1 Published Datasets**
|
| 65 |
+
|
| 66 |
+
### **ENGLISH HIGH COURT OF ADMIRALTY DEPOSITIONS**
|
| 67 |
+
|
| 68 |
1. [MarineLives/English-Expansions](https://huggingface.co/datasets/MarineLives/English-Expansions)
|
| 69 |
2. [MarineLives/Latin-Expansions](https://huggingface.co/datasets/MarineLives/Latin-Expansions)
|
| 70 |
3. [MarineLives/Line-Insertions](https://huggingface.co/datasets/MarineLives/Line-Insertions)
|
| 71 |
4. [MarineLives/HCA-1358-Errors-In-Phrases](https://huggingface.co/datasets/MarineLives/HCA-1358-Errors-In-Phrases)
|
| 72 |
5. [MarineLives/HCA-13-58-TEXT](https://huggingface.co/datasets/MarineLives/HCA-13-58-TEXT)
|
| 73 |
+
|
| 74 |
+
### **YIDDISH LETTERS**
|
| 75 |
+
|
| 76 |
+
1. [MarineLives/Gavin-yiddish-raw-HTR-and-groundtruth-lines](https://huggingface.co/datasets/MarineLives/Gavin_yiddish_raw_HT_and_groundtruth_lines)
|
| 77 |
+
2. [MarineLives/Gavin-yiddish-raw-HTR-and-groundtruth-paragraphs](https://huggingface.co/datasets/MarineLives/Gavin_yiddish_raw_HTR_and_groundtruth_paragraphs)
|
| 78 |
|
| 79 |
## **2.2 Unpublished Datasets**
|
| 80 |
- **Dataset 1**: 420K tokens, full diplomatic transcription (1627β1660)
|
|
|
|
| 88 |
# π **Explore MarineLives**
|
| 89 |
Join us in unlocking Early Modern history by exploring our [Hugging Face organization](https://huggingface.co/MarineLives) and datasets!
|
| 90 |
You can follow us on BlueSky at [@marinelives.bsky.social](https://bsky.app/profile/marinelives.bsky.social)
|
| 91 |
+
You can explore our content on our [MarineLives wiki](http://www.marinelives.org/wiki/MarineLives) and on our [ai-and-history-collaboratory GitHub repository](https://github.com/Addaci/marinelives-collaboratory/wiki).
|
| 92 |
|
| 93 |
</div>
|