HuggingFaceFW
/

ablation-model-fineweb-edu

Text Generation

text-generation-inference

Model card Files Files and versions

loubnabnl HF Staff commited on Jun 5, 2024

Commit

ee29d8d

·

verified ·

1 Parent(s): a3db1f2

Update README.md

Files changed (1) hide show

README.md +16 -0

README.md CHANGED Viewed

@@ -41,6 +41,22 @@ inputs = tokenizer.encode("Machine Learning is", return_tensors="pt").to(device)
 outputs = model.generate(inputs)
 print(tokenizer.decode(outputs[0]))
 ```
 ## Training
 ### Model
 - Architecture: Llama model

 outputs = model.generate(inputs)
 print(tokenizer.decode(outputs[0]))
 ```
+## Intermediate checkpoints
+We are releasing intermediate checkpoints for this model at intervals of every 1000 training steps in separate branches. The naming convention is `step-001000-2BT`.
+You can load a specific model revision with `transformers` using the argument `revision`:
+```python
+model = AutoModelForCausalLM.from_pretrained(checkpoint, revision="step-001000-2BT")
+```
+You can access all the revisions for the models via the following code:
+```python
+from huggingface_hub import list_repo_refs
+out = list_repo_refs(checkpoint)
+branches = [b.name for b in out.branches]
+```
 ## Training
 ### Model
 - Architecture: Llama model