| # LifeStack documentation |
|
|
| All project documentation lives under **`docs/`** (except the repository **[README.md](../README.md)** at the root, which is the HuggingFace Space and GitHub entry point). |
|
|
| **Repository:** [https://github.com/oki-dokii/Meta-R2](https://github.com/oki-dokii/Meta-R2) |
|
|
| --- |
|
|
| ## Quick links |
|
|
| | Doc | Use | |
| |-----|-----| |
| | [training_guide.md](training_guide.md) | Install, train, resume, push to Hub | |
| | [train_trl.md](train_trl.md) | `scripts/train_trl.py` CLI reference | |
| | [reward.md](reward.md) | All 10 GRPO reward functions and anti-hacking design | |
| | [model_card.md](model_card.md) | Hugging Face model card (post to Hub) | |
| | [blog.md](blog.md) | Engineering retrospective / HF blog draft | |
|
|
| --- |
|
|
| ## Core system |
|
|
| | Doc | Code | Description | |
| |-----|------|-------------| |
| | [lifestack_env.md](lifestack_env.md) | `core/lifestack_env.py` | `LifeStackEnv`, `WorldEngine`, `reset/step/rollout`, OpenEnv shim | |
| | [reward.md](reward.md) | `core/reward.py`, `scripts/train_trl.py` | `compute_reward`, `compute_task_reward`, 10 GRPO reward heads | |
| | [task.md](task.md) | `core/task.py` | `Task`, `Route`, `Milestone`, `ExoEvent` dataclasses | |
| | [memory.md](memory.md) | `agent/memory.py` | `LifeStackMemory`, ChromaDB collections, few-shot injection | |
| | [conflict_generator.md](conflict_generator.md) | `agent/conflict_generator.py` | Crisis scenario templates, `TaskGenerator`, 8 domains | |
|
|
| --- |
|
|
| ## Training & configuration |
|
|
| | Doc | Description | |
| |-----|-------------| |
| | [training_guide.md](training_guide.md) | End-to-end guide: Colab, server, resume, common errors | |
| | [train_trl.md](train_trl.md) | `LifeStackGRPOTrainer`, JSON-boundary masking, 5-stage curriculum | |
| | [configuration.md](configuration.md) | `GRPOConfig` via `_make_grpo_config`, reward weights per stage | |
| | [kaggle_train.md](kaggle_train.md) | Notebook-specific setup (T4, session timeouts, OOM tips) | |
|
|
| --- |
|
|
| ## Scripts & evaluation |
|
|
| | Doc | Description | |
| |-----|-------------| |
| | [eval.md](eval.md) | `scripts/eval.py` random-policy baseline | |
| | [scripts.md](scripts.md) | All scripts under `scripts/` with usage examples | |
|
|
| --- |
|
|
| ## Application & deployment |
|
|
| | Doc | Description | |
| |-----|-------------| |
| | [app.md](app.md) | Flask demo `app_flask.py` on port 7860, routes, `MODEL_REGISTRY` | |
| | [DEPLOYMENT.md](DEPLOYMENT.md) | `start.sh` dual-service, ports 7860 + 8000, env vars, Docker | |
|
|
| --- |
|
|
| ## Meta & pitch |
|
|
| | Doc | Description | |
| |-----|-------------| |
| | [implementation_summary.md](implementation_summary.md) | Engineering summary, training timeline, code map | |
| | [mentor_pitch.md](mentor_pitch.md) | 60–90 second pitch with honest numbers | |
| | [CONTRIBUTING.md](CONTRIBUTING.md) | Doc-first contribution rules, commit format | |
|
|
| --- |
|
|
| ## Maintenance rule |
|
|
| When you add a feature or a new doc file, add a one-line entry in the appropriate table above. See [CONTRIBUTING.md](CONTRIBUTING.md) for the full doc-first rule. |
|
|
| *Last updated: 2026-04-26.* |
|
|