DanielGallagherIRE
/

obfuscated-bert-fineweb-1B-original

Model card Files Files and versions

DanielGallagherIRE commited on Jan 20

Commit

f56e3a1

·

verified ·

1 Parent(s): 3155e74

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -8,9 +8,9 @@ base_model:
 - google-bert/bert-base-cased
 ---
 # Model Card
-This model was trained for the purposes of analysing model utility when trained on various [Derived Text Formats](https://text-plus.org/en/themen-dokumentation/atf/).<br>
-These are versions of the same text that are adjusted to reduce the chances that the original text can ever be extracted from the model, with applications in privacy and copyright infringement protection.
-In this case, the model was trained on the original dataset without any obfuscation to be used as a baseline.
 ## Training Configuration

 - google-bert/bert-base-cased
 ---
 # Model Card
+This model was trained for the purposes of analysing model utility when trained on various [Derived Text Formats](https://text-plus.org/en/themen-dokumentation/atf/). These are versions of the same text that are adjusted to reduce the chances that the original text can ever be extracted from the model, with applications in privacy and copyright infringement protection.
+<br><br>
+The dataset used for these experiments is [codelion/fineweb-edu-1B](https://huggingface.co/datasets/codelion/fineweb-edu-1B), with all obfuscated formats found [here](https://huggingface.co/datasets/DanielGallagherIRE/fineweb-edu-1B-obfuscated). In this case, the model was trained on the original dataset without any obfuscation to be used as a baseline.
 ## Training Configuration