Commit History

feat(hints): ADR-009 layered HintGenerator; accepted
84740d4

Codeseys commited on

feat(trainer): ADR-008 gate-3 live GRPO+SDPO smoke PASS; ADR-008 accepted
2a34df4

Codeseys commited on

feat(trainer): ADR-008 Dr.GRPO config + SDPO strict-alignment guard
bde5c5e

Codeseys commited on

docs(adr): add ADR-008/009/010 (Dr.GRPO+SDPO, layered hints, FeatureDeletionEnv)
36ab61e

Codeseys commited on

research: Composer 2.5 data-gen + targeted-textual-feedback deep-research wave
6049d00

Codeseys commited on

examples: add sdpo_real_trace_train_smoke — close the forward+backward+step link
7090729

Codeseys commited on

Wave 21c: verify PRIME-RL adapter parity against upstream source (byte-for-byte)
c98928e

Codeseys commited on

Wave 21b: skip zero-signal SDPO on empty-recovery error turns + real-trace validation
d61036a

Codeseys commited on

Wave 21: close both Wave 20 debt items — chat-template alignment + structural is_error
6806cf7

Codeseys commited on

Wave 20: ModalSpawnExecutor — finish the Modal-backed serverless executor
a384097

Codeseys commited on

Wave 19: production-grade SDPO via ComposerDataCollator + adapter + collator fixes
03bf323

Codeseys commited on

Wave 18: 14 backlog items closed + 3-reviewer cross-family review
54efac8

Codeseys commited on

Wave 17: close all 5 audit FLAGs + SDPO context alignment + serverless re-exports
a84c060

Codeseys commited on

Wave 16: install ergonomics + gradient evidence + SDPO end-to-end example
c0a5ab7

Codeseys commited on

Wave 15: 4-angle multi-model self-critique caught 2 math BLOCKERs in primary loss kernels; fixed against upstream byte-for-byte + GSM8K example + ergonomics
e5add15

Codeseys commited on

Wave 14: close every Wave 13 review finding + 4 documentation files; Wave 14b: real PRIME-RL parity + multi-process DiLoCo convergence
d9dd3a5

Codeseys commited on

Wave 13: serverless DiLoCo + replaysim normalization + 3 distillation losses + PRIME-RL + Monarch
b266c31

Codeseys commited on

Wave 12: close V1-V8 brief — GPU smoke, SDPO firing, real-trace e2e
d88715c

Codeseys commited on

Wave 11: cross-model adversarial review + honest down-revision
f16fa23

Codeseys commited on

Wave 10 — packaging: composer_replication is now pip-installable
ac05fbf

Codeseys commited on

Tidy .gitignore (de-dup *.jsonl, restore section blank lines)
d52e126

Codeseys commited on

Spike 007: include synthetic_session.jsonl fixture in repo
a35a8d7

Codeseys commited on

Wave 7+8+9: spikes 006/007/008 — close vision-validation gaps V2/V5/V8
57af35d

Codeseys commited on

Wave 7: Phase 2-4 of deep work loop — backlog, parallel research, three ADRs
ac4bfb4

Codeseys commited on

Wave 6: vision validation self-audit (5/10 to 9/10 in 5 days, no GPU)
040eff8

baladithyab commited on

Wave 5: full publication-materials drafts (pre-experimental release set)
639a760

baladithyab commited on

Wave 4: data collator + loss composition smoke (38/38 tests pass)
157cdba

baladithyab commited on

Wave 3: integration architecture + spike-005 trainer skeleton (16 tests pass)
fd77f74

baladithyab commited on

Integrate Cursor blog directly + audit research note + add SDPO/OPSD link
1cede23

baladithyab commited on

Spike v0.0 laydown + spike 001 VALIDATED
35581fd

Codeseys commited on

Initial commit: Composer 2.5 Replication Framework — research synthesis
7165832

Codeseys commited on