Update README.md
Browse files
README.md
CHANGED
|
@@ -16,10 +16,10 @@ We first pretrained the model with 2 million sentences from Dutch historical nov
|
|
| 16 |
Afterward, we finetuned the model with a 10k dataset consisting of 19th-century Dutch sentences;
|
| 17 |
these sentences were automatically annotated by a rule-based system built for 19th-century Dutch spelling normalization (van Cranenburgh and van Noord, 2022).
|
| 18 |
|
| 19 |
-
The model is only available in the TensorFlow format but can be converted to a Pytroch environment.
|
| 20 |
-
The pretrained only weights are
|
| 21 |
-
The pretrained only weights are available in the directory
|
| 22 |
-
The train and validation sets used for finetuning are available in the repository.
|
| 23 |
For further information about the model, please see the [GitHub](https://github.com/Awolters123/Master-Thesis) repository.
|
| 24 |
|
| 25 |
|
|
@@ -50,5 +50,4 @@ _Batch size_: 32
|
|
| 50 |
_Optimizer_: AdamW
|
| 51 |
_Epochs_: 30, with earlystopping
|
| 52 |
|
| 53 |
-
To further finetune the model, use the
|
| 54 |
-
If you want to finetune the pretrained weights from scratch, you have to first convert the Flax file into a Pytorch or TensorFlow environment.
|
|
|
|
| 16 |
Afterward, we finetuned the model with a 10k dataset consisting of 19th-century Dutch sentences;
|
| 17 |
these sentences were automatically annotated by a rule-based system built for 19th-century Dutch spelling normalization (van Cranenburgh and van Noord, 2022).
|
| 18 |
|
| 19 |
+
The finetuned model is only available in the TensorFlow format but can be converted to a Pytroch environment.
|
| 20 |
+
The pretrained only weights are available in the Pytorch environment; note that this model has to be finetuned first.
|
| 21 |
+
The pretrained only weights are available in the directory __Pretrained_ByT5__.
|
| 22 |
+
The train and validation sets used for finetuning are available in the main repository.
|
| 23 |
For further information about the model, please see the [GitHub](https://github.com/Awolters123/Master-Thesis) repository.
|
| 24 |
|
| 25 |
|
|
|
|
| 50 |
_Optimizer_: AdamW
|
| 51 |
_Epochs_: 30, with earlystopping
|
| 52 |
|
| 53 |
+
To further finetune or pretrain the model, use the __T5Trainer.py__ script.
|
|
|