meta-r2 / docs /README.md
github-actions[bot]
Deploy Space snapshot
ddbc1ba
# LifeStack documentation
All project documentation lives under **`docs/`** (except the repository **[README.md](../README.md)** at the root, which is the HuggingFace Space and GitHub entry point).
**Repository:** [https://github.com/oki-dokii/Meta-R2](https://github.com/oki-dokii/Meta-R2)
---
## Quick links
| Doc | Use |
|-----|-----|
| [training_guide.md](training_guide.md) | Install, train, resume, push to Hub |
| [train_trl.md](train_trl.md) | `scripts/train_trl.py` CLI reference |
| [reward.md](reward.md) | All 10 GRPO reward functions and anti-hacking design |
| [model_card.md](model_card.md) | Hugging Face model card (post to Hub) |
| [blog.md](blog.md) | Engineering retrospective / HF blog draft |
---
## Core system
| Doc | Code | Description |
|-----|------|-------------|
| [lifestack_env.md](lifestack_env.md) | `core/lifestack_env.py` | `LifeStackEnv`, `WorldEngine`, `reset/step/rollout`, OpenEnv shim |
| [reward.md](reward.md) | `core/reward.py`, `scripts/train_trl.py` | `compute_reward`, `compute_task_reward`, 10 GRPO reward heads |
| [task.md](task.md) | `core/task.py` | `Task`, `Route`, `Milestone`, `ExoEvent` dataclasses |
| [memory.md](memory.md) | `agent/memory.py` | `LifeStackMemory`, ChromaDB collections, few-shot injection |
| [conflict_generator.md](conflict_generator.md) | `agent/conflict_generator.py` | Crisis scenario templates, `TaskGenerator`, 8 domains |
---
## Training & configuration
| Doc | Description |
|-----|-------------|
| [training_guide.md](training_guide.md) | End-to-end guide: Colab, server, resume, common errors |
| [train_trl.md](train_trl.md) | `LifeStackGRPOTrainer`, JSON-boundary masking, 5-stage curriculum |
| [configuration.md](configuration.md) | `GRPOConfig` via `_make_grpo_config`, reward weights per stage |
| [kaggle_train.md](kaggle_train.md) | Notebook-specific setup (T4, session timeouts, OOM tips) |
---
## Scripts & evaluation
| Doc | Description |
|-----|-------------|
| [eval.md](eval.md) | `scripts/eval.py` random-policy baseline |
| [scripts.md](scripts.md) | All scripts under `scripts/` with usage examples |
---
## Application & deployment
| Doc | Description |
|-----|-------------|
| [app.md](app.md) | Flask demo `app_flask.py` on port 7860, routes, `MODEL_REGISTRY` |
| [DEPLOYMENT.md](DEPLOYMENT.md) | `start.sh` dual-service, ports 7860 + 8000, env vars, Docker |
---
## Meta & pitch
| Doc | Description |
|-----|-------------|
| [implementation_summary.md](implementation_summary.md) | Engineering summary, training timeline, code map |
| [mentor_pitch.md](mentor_pitch.md) | 60–90 second pitch with honest numbers |
| [CONTRIBUTING.md](CONTRIBUTING.md) | Doc-first contribution rules, commit format |
---
## Maintenance rule
When you add a feature or a new doc file, add a one-line entry in the appropriate table above. See [CONTRIBUTING.md](CONTRIBUTING.md) for the full doc-first rule.
*Last updated: 2026-04-26.*