Ramon Gougis commited on
Update README.md
Browse files
README.md
CHANGED
|
@@ -45,11 +45,11 @@ from huggingface_hub import hf_hub_download
|
|
| 45 |
ckpt_path = hf_hub_download(repo_id="ragou19/WaveletLM", filename="best_model.pt")
|
| 46 |
|
| 47 |
# Then follow the instructions in the GitHub repo to load and run:
|
| 48 |
-
# https://github.com/ramongougis/WaveletLM
|
| 49 |
```
|
|
|
|
| 50 |
|
| 51 |
## Architecture
|
| 52 |
-
|
| 53 |
|
| 54 |
## Training
|
| 55 |
Trained on a single RTX 5090 for 5 epochs on WikiText-103 (best of 3 seeds: 1337, 42, 7). Best validation loss: 3.16. PG-19 weights also included (1-epoch run; longer training planned post-release).
|
|
|
|
| 45 |
ckpt_path = hf_hub_download(repo_id="ragou19/WaveletLM", filename="best_model.pt")
|
| 46 |
|
| 47 |
# Then follow the instructions in the GitHub repo to load and run:
|
|
|
|
| 48 |
```
|
| 49 |
+
https://github.com/ramongougis/WaveletLM
|
| 50 |
|
| 51 |
## Architecture
|
| 52 |
+

|
| 53 |
|
| 54 |
## Training
|
| 55 |
Trained on a single RTX 5090 for 5 epochs on WikiText-103 (best of 3 seeds: 1337, 42, 7). Best validation loss: 3.16. PG-19 weights also included (1-epoch run; longer training planned post-release).
|