Spaces:

openenv-community
/

replicalab

Running

App Files Files Community

replicalab / docs /kush /task_list.md

maxxie114's picture

Initial HF Spaces deployment

80d8c84 2 days ago

|

history blame contribute delete

1.28 kB

Kush (Person D) Task List

Source of truth: ReplicaLab_Comprehensive_Task_Division.md

Current status

All Person D implementation and storytelling tasks are recorded complete in the source-of-truth backlog.
The frontend now presents the demo in the intended order:
- source paper
- parsed replication brief
- live negotiation
- deterministic judge
- training story
The dashboard, episode page, training panel, and evaluation bench all build successfully after the latest refinement pass.

Active focus

No open Person D implementation blockers remain in the backlog.
Remaining polish is demo execution quality:
- keep the live script aligned with the new paper-to-training UI flow
- swap packaged training demo data for live artifacts if a final run is ready
- capture final screenshots or footage from the updated frontend

Notes for demo prep

Start the live walkthrough from /episode?template=ml_benchmark&difficulty=medium.
Use the left panel to anchor the narrative in the source paper and parsed brief.
Use the right-side training callout at episode end to connect the judged reward to the minimal Colab notebook.
Use /compare as the seeded evaluation bench, not as the primary baseline-vs-trained story.