File size: 2,369 Bytes
80d8c84
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
# Kian (Person A) Task Breakdown

Source of truth: `ReplicaLab_Comprehensive_Task_Division.md`

---

## Current status

- `FND 04`, `FND 08`, `FND 09`, `MOD 01` to `MOD 05`, `MOD 11`, `MOD 12` are complete
- Shared `AGT 05` is now complete, so the deterministic feasibility layer exists for both the Lab Manager path and the judge feasibility score
- `SCN 01` to `SCN 10` are complete, so the deterministic scenario layer exists in code
- `ENV 01` to `ENV 08` are all complete — the full environment lifecycle (reset, step, validate, Lab Manager response, termination, judge scoring, state snapshot, close) works end-to-end
- `JDG 01` to `JDG 06` plus `JDG 08` are complete — the deterministic reward pipeline is wired, the plain-English explanation layer exists, and the reward stack now has stronger regression coverage for ordering, substitution behavior, partial feasibility credit, and breakdown determinism
- `TST 01` to `TST 05` are complete with 36 env tests and 40 reward tests passing
- `MOD 06`, `SCN 13`, `AGT 09`, `JDG 11`, `ENV 11`, `ENV 10`, and `OBS 04` are now complete, so the remaining Kian work is the blocked schema follow-on

Bounded-tool scope note:

1. Kian-owned scenario, judge, and environment tasks now need to support
   bounded `search`, `code_check`, and `image_inspection` traces without
   changing the outer action contract.
2. Training reward must remain deterministic and must not depend on live web.
3. Frozen evidence packs are the default training-time source of tool inputs.
4. Audio remains out of scope.

---

## Recommended execution order

1. `MOD 08` -- add schema and validator unit-test expansion

---

## Why this order

- `SCN 13` is complete, so the normalized scenario layer now carries booking and scheduling conflicts as structured deterministic data.
- `AGT 09` is complete, so the grounded Lab Manager checker, suggestion, and response stack now has deterministic regression coverage.
- `JDG 11` is complete and `ENV 11` is now integrated, so terminal env outputs and replay-facing state carry the canonical audit payload end to end.
- `ENV 10` and `OBS 04` are now complete, so the environment stack has deterministic replay and broader regression coverage on top of the completed ENV 01-08 and ENV 11 lifecycle.
- `MOD 08` is the only remaining Kian-owned implementation task, and it is now fully unblocked.