# RecursiveComplete — 18M GPT-2 trained from scratch

Everything you need to **upload**, **train**, or **use** this model is in this folder.

First, once:
```bash
pip install -r requirements.txt
```

---

## A) USE IT (generate text)
```bash
python chat.py "Once upon a time"
python chat.py "The little robot wanted to"
```
Needs: chat.py, gpt2.py, big.pt, tokenizer_bpe/

## B) CONTINUE TRAINING
Resumes automatically from where it stopped (~iter 7000):
```bash
python train_big.py
```
Bump `max_iters` inside train_big.py (currently 12000) to train longer.
Needs: train_big.py, gpt2.py, big.pt, data/train.bin, data/meta.json

## C) UPLOAD TO HUGGING FACE  (repo: Gentraxyz/RecursiveComplete)
Clean inference files are already generated for you (model.safetensors + config.json).

Windows / PowerShell:
```powershell
powershell -ExecutionPolicy ByPass -c "irm https://hf.co/cli/install.ps1 | iex"
hf auth login
copy HF_README.md README.md
hf upload Gentraxyz/RecursiveComplete .
```
Mac/Linux:
```bash
pip install huggingface_hub
hf auth login
cp HF_README.md README.md
hf upload Gentraxyz/RecursiveComplete .
```

### What to upload vs skip
- UPLOAD: model.safetensors, config.json, gpt2.py, tokenizer_bpe/, README.md, chat.py
- SKIP (optional/large): big.pt (training checkpoint, has optimizer state),
  data/train.bin (180MB corpus — only upload if you want others to retrain)

To upload just the essentials instead of everything:
```bash
hf upload Gentraxyz/RecursiveComplete model.safetensors
hf upload Gentraxyz/RecursiveComplete config.json
hf upload Gentraxyz/RecursiveComplete gpt2.py
```

---

## File guide
| File | Used for | Notes |
|---|---|---|
| big.pt | train / use | full checkpoint (weights + optimizer + iter) ~210MB |
| model.safetensors | upload / use | clean weights only, HF-standard |
| config.json | upload / use | model dimensions |
| gpt2.py | all | model architecture |
| chat.py | use | generation script |
| train_big.py | train | auto-resumes from big.pt |
| prep_bpe.py | (optional) | rebuild train.bin from raw text |
| tokenizer_bpe/ | all | BPE vocab.json + merges.txt |
| data/train.bin | train | tokenized 90M-token corpus |
| data/meta.json | train | vocab size + eot id |
| HF_README.md | upload | rename to README.md on HF |
| requirements.txt | all | torch, tokenizers, numpy, safetensors |

Model: 18.3M params, 448d / 7 heads / 6 layers / 256 ctx, BPE 8192 vocab.
Base completion model (not instruction-tuned). Best at short story-style English.