explcre commited on
Commit
8aea234
·
verified ·
1 Parent(s): 355a44d

Upload _claude_memory/feedback_experiments_md_live_doc.md with huggingface_hub

Browse files
_claude_memory/feedback_experiments_md_live_doc.md ADDED
@@ -0,0 +1,32 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ name: EXPERIMENTS.md is the live paper-grade master doc — update on every new result
3
+ description: Standing instruction — every time a new experiment result lands, append/update the corresponding row in regureasoner_loop/docs/EXPERIMENTS.md, not just in results/*.md
4
+ type: feedback
5
+ originSessionId: 4037f43b-2133-46c6-84bd-02f7d454ec8b
6
+ ---
7
+ EXPERIMENTS.md (`regureasoner_loop/docs/EXPERIMENTS.md` on the
8
+ `mllm-integrate-server2` branch) is the canonical paper-section
9
+ draft target. Every time a new bench/training/oracle result lands,
10
+ update the matching section in EXPERIMENTS.md AND push to the
11
+ remote branch.
12
+
13
+ **Why:** the user explicitly asked: "update this EXPERIMENTS.md
14
+ every time you have new experiment results" (2026-04-27). The doc
15
+ is intended as the experiment section for a top AI conference paper;
16
+ results/*.md files are raw data tables that get referenced from
17
+ EXPERIMENTS.md, but EXPERIMENTS.md is the narrative + headline
18
+ numbers source of truth.
19
+
20
+ **How to apply:**
21
+ * When a metrics.json / genqual.json / training final.json lands,
22
+ open EXPERIMENTS.md, find the matching §3.x or §4-§7 section,
23
+ replace any "queued" / "running" placeholder with the real number,
24
+ and append a paired comparison block when both raw and enriched
25
+ rows are available.
26
+ * Bump §8 "Pipeline state (live)" to reflect the new state and what
27
+ triggered next.
28
+ * Add a row to §10 "Code commits" with the commit SHA + 1-line note.
29
+ * Always commit + push to `mllm-integrate-server2` immediately
30
+ (companion `results/<task>_*.md` if the per-result table is long).
31
+ * The HF mirror of EXPERIMENTS.md is **blocked** by sandbox policy —
32
+ do not retry; the canonical copy is the GitHub branch.