Cedille
/

de-anna

@@ -27,7 +27,7 @@ tokenizer = AutoTokenizer.from_pretrained("Cedille/de-anna")
 model = AutoModelForCausalLM.from_pretrained("Cedille/de-anna")
 ```
 ### Lower memory usage
-Loading a model with Huggingface requires two copies of the weights, so 48+ GB of RAM for [GPT_J models](https://huggingface.co/docs/transformers/v4.15.0/model_doc/gptj) in float32 precision.
 The first trick would be to load the model with the specific argument below to load only one copy of the weights.
 ```
 from transformers import AutoTokenizer, AutoModelForCausalLM

 model = AutoModelForCausalLM.from_pretrained("Cedille/de-anna")
 ```
 ### Lower memory usage
+Loading a model with Huggingface requires two copies of the weights, so 48+ GB of RAM for [GPT-J models](https://huggingface.co/docs/transformers/v4.15.0/model_doc/gptj) in float32 precision.
 The first trick would be to load the model with the specific argument below to load only one copy of the weights.
 ```
 from transformers import AutoTokenizer, AutoModelForCausalLM