GottBERT
/

GottBERT_base_last

Model card Files Files and versions

Raphael Scheible commited on Nov 5, 2024

Commit

4e1f078

·

verified ·

1 Parent(s): 1ba3dea

Update README.md

Files changed (1) hide show

README.md +3 -0

README.md CHANGED Viewed

@@ -94,6 +94,9 @@ Details:
 - **Filtered vs Unfiltered Data**: Minor improvements seen with filtered data, but not significant enough to justify filtering in every case.
 - **Computation Limitations**: Fixed memory allocation on TPUs required processing data as a single stream, unlike GPU training which preserves document boundaries. Training was performed in 32-bit mode due to framework limitations, increasing memory usage.
 ## Citations
 If you use GottBERT in your research, please cite the following paper:
 ```bibtex

 - **Filtered vs Unfiltered Data**: Minor improvements seen with filtered data, but not significant enough to justify filtering in every case.
 - **Computation Limitations**: Fixed memory allocation on TPUs required processing data as a single stream, unlike GPU training which preserves document boundaries. Training was performed in 32-bit mode due to framework limitations, increasing memory usage.
+## Fairseq Checkpoints
+Get the fairseq checkpoints [here](https://drive.proton.me/urls/CFSGE8ZK9R#1F1G727lv77k).
 ## Citations
 If you use GottBERT in your research, please cite the following paper:
 ```bibtex