Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -1,23 +1,19 @@
|
|
| 1 |
-
# nanochat
|
| 2 |
|
| 3 |
-
|
| 4 |
|
| 5 |
-
##
|
| 6 |
|
| 7 |
-
-
|
| 8 |
-
-
|
| 9 |
-
-
|
| 10 |
|
| 11 |
-
##
|
| 12 |
|
| 13 |
-
|
| 14 |
|
| 15 |
```bash
|
| 16 |
-
|
| 17 |
-
|
| 18 |
-
|
| 19 |
-
# Or specify phase and tag
|
| 20 |
-
python -m scripts.chat_cli --model-tag d26 --phase sft
|
| 21 |
```
|
| 22 |
-
|
| 23 |
-
The scripts resolve `get_base_dir()` to `NANOCHAT_BASE_DIR` and load the tokenizer and latest checkpoint from `chatsft_checkpoints/d26` (or `base_checkpoints/d26` for base).
|
|
|
|
| 1 |
+
# nanochat EOS-chat (SFT)
|
| 2 |
|
| 3 |
+
Nanochat SFT checkpoint and tokenizer for EOS-chat.
|
| 4 |
|
| 5 |
+
## Files
|
| 6 |
|
| 7 |
+
- `chatsft_checkpoints/d26/model_000704.pt` – model state dict
|
| 8 |
+
- `chatsft_checkpoints/d26/meta_000704.json` – config and metadata
|
| 9 |
+
- `tokenizer/` – tokenizer files
|
| 10 |
|
| 11 |
+
## Load with nanochat
|
| 12 |
|
| 13 |
+
Clone this repo and set it as the nanochat base directory:
|
| 14 |
|
| 15 |
```bash
|
| 16 |
+
git clone https://huggingface.co/ksjpswaroop/nanochat-eos nanochat-eos
|
| 17 |
+
export NANOCHAT_BASE_DIR=$(pwd)/nanochat-eos
|
| 18 |
+
python -m scripts.chat_cli # use SFT model
|
|
|
|
|
|
|
| 19 |
```
|
|
|
|
|
|