File size: 1,278 Bytes
80d8c84
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
# Kush (Person D) Task List

Source of truth: `ReplicaLab_Comprehensive_Task_Division.md`

---

## Current status

- All Person D implementation and storytelling tasks are recorded complete in the source-of-truth backlog.
- The frontend now presents the demo in the intended order:
  - source paper
  - parsed replication brief
  - live negotiation
  - deterministic judge
  - training story
- The dashboard, episode page, training panel, and evaluation bench all build successfully after the latest refinement pass.

---

## Active focus

- No open Person D implementation blockers remain in the backlog.
- Remaining polish is demo execution quality:
  - keep the live script aligned with the new paper-to-training UI flow
  - swap packaged training demo data for live artifacts if a final run is ready
  - capture final screenshots or footage from the updated frontend

---

## Notes for demo prep

- Start the live walkthrough from `/episode?template=ml_benchmark&difficulty=medium`.
- Use the left panel to anchor the narrative in the source paper and parsed brief.
- Use the right-side training callout at episode end to connect the judged reward to the minimal Colab notebook.
- Use `/compare` as the seeded evaluation bench, not as the primary baseline-vs-trained story.