ksjpswaroop commited on
Commit
6af6512
·
verified ·
1 Parent(s): e94067e

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +11 -15
README.md CHANGED
@@ -1,23 +1,19 @@
1
- # nanochat-eos (EOS-chat)
2
 
3
- Energy LLM **EOS-chat** model and tokenizer, developed at Safire by the AI team.
4
 
5
- ## Layout
6
 
7
- - **tokenizer/** – BPE tokenizer (`tokenizer.pkl`, `token_bytes.pt`). Used by all checkpoints.
8
- - **base_checkpoints/d26/**Pretrained base model (step 7226): `model_007226.pt`, `meta_007226.json`.
9
- - **chatsft_checkpoints/d26/**SFT (chat) model (step 848): `model_000848.pt`, `meta_000848.json`.
10
 
11
- ## Loading locally
12
 
13
- Set `NANOCHAT_BASE_DIR` to the directory that contains `tokenizer/`, `base_checkpoints/`, and `chatsft_checkpoints/` (e.g. this repo root after cloning), then:
14
 
15
  ```bash
16
- # Chat (SFT) model
17
- python -m scripts.chat_cli
18
-
19
- # Or specify phase and tag
20
- python -m scripts.chat_cli --model-tag d26 --phase sft
21
  ```
22
-
23
- The scripts resolve `get_base_dir()` to `NANOCHAT_BASE_DIR` and load the tokenizer and latest checkpoint from `chatsft_checkpoints/d26` (or `base_checkpoints/d26` for base).
 
1
+ # nanochat EOS-chat (SFT)
2
 
3
+ Nanochat SFT checkpoint and tokenizer for EOS-chat.
4
 
5
+ ## Files
6
 
7
+ - `chatsft_checkpoints/d26/model_000704.pt` model state dict
8
+ - `chatsft_checkpoints/d26/meta_000704.json`config and metadata
9
+ - `tokenizer/`tokenizer files
10
 
11
+ ## Load with nanochat
12
 
13
+ Clone this repo and set it as the nanochat base directory:
14
 
15
  ```bash
16
+ git clone https://huggingface.co/ksjpswaroop/nanochat-eos nanochat-eos
17
+ export NANOCHAT_BASE_DIR=$(pwd)/nanochat-eos
18
+ python -m scripts.chat_cli # use SFT model
 
 
19
  ```