metadata
title: LifeStack
emoji: πͺ
colorFrom: indigo
colorTo: gray
sdk: docker
pinned: true
πͺ LifeStack
Autonomous Multi-Domain Conflict Resolution via Cascading RL
Built for Meta Γ HuggingFace PyTorch OpenEnv Hackathon 2026
Live Demo β’ Technical Blog β’ Source Code
π The Vision
LifeStack is a high-fidelity reinforcement learning environment built for OpenEnv to train agents in simultaneous crisis management. Unlike traditional RL tasks that focus on a single domain, LifeStack models the messy, 40-edge interdependence of adult life through cascading effects across Career, Finance, Health, and Relationships.
β¨ Core Research Innovations
- π Causal Cascades: 40-edge dependency graph based on Starcke & Brand (2012) where a $350 flight rebooking (Finance) ripples into stress (Wellbeing) and sleep loss (Health).
- π Personality Lab: Side-by-side agent comparison using Big Five (OCEAN) traits. Validates how
AgreeablenessvsNeuroticismchanges the reward manifold. - π§ Memory RAM: Retrieval-Augmented Moderation using ChromaDB. Shows a +116% improvement in strategy efficiency when recall is enabled.
- π§© What-If Lab: Counterfactual explorer that compares the agent's actual path against the two best alternative "what-if" trajectories.
ποΈ Hardened System Architecture
We have implemented a multi-layered verification system to eliminate "reward hacking" and ensure high engineering rigor.
π‘οΈ Anti-Hacking & Observability
- Semantic Reasoning Audit: Every action requires a
reasoningjustification that is cross-verified for logical coherence by the reward orchestrator. - πΌ Episode Replay: Full audit log of the last 5 episodes including metric impact grids and timestamped reasoning.
- π‘οΈ Domain Risk Heatmap: Instant cognitive summary of 23 metrics across 6 life domains (Red=Crisis, Green=Stable).
- π§ͺ Core Test Suite: 10 rigorous smoke and logic tests verify environment reset, causal propagation, and task solvability.
πΊοΈ Environment Map
graph TD
subgraph "LifeStack Engine (v2.1)"
Env["LifeStackEnv"]
DG["Dependency Graph (40-Edges)"]
RT["Route Manager"]
RE["Reward Orchestrator (7-Signals)"]
end
subgraph "Observability Layer (Flask Portal)"
CV["Cascade Visualizer"]
WI["What-If Explorer"]
Hist["Episode Historian"]
end
subgraph "AI Core"
Agent["RL Agent / LLM"]
Mem["ChromaDB RAG Memory"]
Pers["Personality Engine (Big Five)"]
end
Agent -->|Action + Reasoning| Env
Env -->|Cascades| DG
DG -->|Feedback| Env
Env -->|Verification| RT
RT -->|Scoring| RE
RE -->|Reward| Agent
Agent <-->|Memory Store/Retrieval| Mem
Observability <-->|Audit| Env
π οΈ Quickstart
1. Installation & Demo
git clone https://github.com/oki-dokii/LifeStack.git
cd LifeStack
pip install -r requirements.txt
python app_flask.py # Production Portal β http://127.0.0.1:5000
2. Engineering Verification
# Run the full concrete logic test suite
python3 -m pytest tests/
3. Training Pipe (GRPO)
# Start 5-stage curriculum training with 800-word trajectory logs
python scripts/train_trl.py
π Performance & Results
RAG Memory Impact
Episodes were run back-to-back testing "Cold Start" vs "Memory-Aware" agents.
| Metrics | Cold Start (No Memory) | Memory-Aware (RAG) | Delta |
|---|---|---|---|
| Success Rate | 48% | 88% | +40% |
| Efficiency Score | 0.42 | 0.91 | +116.6% |
| Avg Reasoning Score | 0.65 | 0.94 | +44% |
ποΈ Technical Deep Dive
- Conflict Intake: Uses NLP-to-Conflict parsing; users can type natural language crises (e.g., "I just got fired...") and the system generates a personalized 23-metric disruption.
- Observation Space: 26-dimensional state vector + domain-specific JSON metadata.
- Reward signals: 7 non-overlapping components (Milestone, Completion, Outcome, Preservation, Replan, Efficiency, Reasoning) weighted iteratively for stability.