KotshinZ
/

gpt2-RMT-8-mem512

Generated from Trainer

Model card Files Files and versions

KotshinZ commited on Mar 17, 2025

Commit

eabe392

·

verified ·

1 Parent(s): 2840f89

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -15,6 +15,7 @@ licence: license
 This model is a fine-tuned version of [KotshinZ/gpt2-RMT-7-mem512](https://huggingface.co/KotshinZ/gpt2-RMT-7-mem512) on the [HuggingFaceFW/fineweb-edu](https://huggingface.co/datasets/HuggingFaceFW/fineweb-edu) dataset.
 It has been trained using [TRL](https://github.com/huggingface/trl).
 ## Quick start
@@ -33,6 +34,7 @@ print(output["generated_text"])
 This model was trained with SFT.
 ### Framework versions

 This model is a fine-tuned version of [KotshinZ/gpt2-RMT-7-mem512](https://huggingface.co/KotshinZ/gpt2-RMT-7-mem512) on the [HuggingFaceFW/fineweb-edu](https://huggingface.co/datasets/HuggingFaceFW/fineweb-edu) dataset.
 It has been trained using [TRL](https://github.com/huggingface/trl).
+For use this model. You need clone [KotShinZ/Recurrent-Memory-Transformer_PreTrained.git](https://github.com/KotShinZ/Recurrent-Memory-Transformer_PreTrained)
 ## Quick start
 This model was trained with SFT.
+This model memory_size = 512, n_backward = 8.
 ### Framework versions