EleutherAI
/

neox-ckpt-pythia-2.8b

@@ -28,14 +28,17 @@ Each branch contains a full training checkpoint at a given step, including:
 ## Branches
-154 checkpoints are available as branches:
 - `step0` — initialization
 - `step{1,2,4,8,16,32,64,128,256,512}` — log-spaced early checkpoints
-- `step1000` through `step143000` — every 1,000 steps
 Branch `step143000` corresponds to the final model.
 ## Converting to HuggingFace Format
 To convert a checkpoint to HuggingFace Transformers format, use the conversion script from [GPT-NeoX](https://github.com/EleutherAI/gpt-neox):

 ## Branches
+20 log-spaced checkpoints are available as branches:
 - `step0` — initialization
 - `step{1,2,4,8,16,32,64,128,256,512}` — log-spaced early checkpoints
+- `step{1000,2000,4000,8000,16000,32000,64000,128000}` — log-spaced training checkpoints
+- `step143000` — final model
 Branch `step143000` corresponds to the final model.
+> **Note:** To keep storage requirements manageable, this repository provides a log-spaced subset of 20 checkpoints rather than all 154 training checkpoints. If you need linearly-spaced checkpoints (every 1,000 steps), the HuggingFace Transformers-compatible weights for all 154 checkpoints are available at [`EleutherAI/pythia-2.8b`](https://huggingface.co/EleutherAI/pythia-2.8b).
 ## Converting to HuggingFace Format
 To convert a checkpoint to HuggingFace Transformers format, use the conversion script from [GPT-NeoX](https://github.com/EleutherAI/gpt-neox):