Add files using upload-large-folder tool

2a55985 verified 14 days ago

21.2 kB

	# DoAtlas 项目进度恢复文档

	> 生成时间：2026-04-27 22:35 UTC
	> 配套压缩包：`exports/doatlas-snapshot-20260427-2234.zip`（约 173 MB）
	> 适用版本：`replit.md` 中所述「Wave B + #221 通用 6 通道 Reviewer」之后的状态
	> 本文目的：换台机器、换个 Replit、或者隔几周回来，能照着这份文档把项目状态完整恢复，并接着没跑完的待办继续推进。

	---

	## 1. 项目当前定位

	DoAtlas 是一个生物医学研究协同工作台，pnpm 单仓多包（monorepo）+ Node 24 + TypeScript 5.9 + Postgres + Python 研究引擎。当前阶段已经完成：

	- 三波前端 + 后端基础（视觉打磨、核心产品形态、性能/可达性）
	- 第四波 Python 研究引擎接入（HMAC 内部桥、6 个工具、DuckDB 快照）
	- 实时聊天 + ReAct 工具循环 + 后台任务面板
	- 长期跨会话记忆（opt-in）
	- 因果网/药物网持久化进 Postgres
	- Wave A 工具网注册中心 + Wave B 在线进化飞轮（影子 A/B、5 门 autoPromote、30 分钟 watchdog）
	- Task #221 通用 6 通道 Reviewer 落地，drugclip 跑通 promote → rollback 闭环（详见 `docs/wave-b-acceptance-report.md`）

	仓库共 95 个项目任务：55 已合并（MERGED）/ 13 已取消（CANCELLED）/ 27 待办（PROPOSED）。

	---

	## 2. 4 个 artifact 概览

	\| Artifact \| 类型 \| 端口 \| 路径前缀 \| 简介 \|
	\|---\|---\|---\|---\|---\|
	\| `@workspace/api-server` \| api（Express 5 + Drizzle） \| 8080 \| `/api` \| 业务 API、会话/聊天编排、工具路由、Wave B 飞轮、admin 面板 \|
	\| `@workspace/doatlas-web` \| web（Vite + React 19） \| 23232 \| `/` \| 用户工作台 SPA：聊天、图谱、排行、审阅、快照、admin \|
	\| `@workspace/research-engine` \| api（Flask + DuckDB） \| 8011 \| `/research-engine` \| Python 研究流水线 + 工具执行；只接受 HMAC 已签名内部请求 \|
	\| `@workspace/mockup-sandbox` \| design（Vite） \| 8081 \| `/__mockup` \| 内部组件预览，不参与生产 \|

	共享库（`lib/`）：

	- `lib/db` — Drizzle 模型 + 迁移（PostgreSQL）
	- `lib/api-spec` — OpenAPI v1.1（`docs/API_CONTRACT.md` 是中文契约）
	- `lib/api-zod`、`lib/api-client-react` — 由 `pnpm --filter @workspace/api-spec run codegen` 自动生成

	详细架构见 `docs/WORKFLOW.md`、`docs/REPLIT_SETUP.md`、`docs/LOCAL_SETUP.md`。

	---

	## 3. 已完成（MERGED，55 个）— 按主题分组

	> 排序按主题；每条带任务编号方便对照 `.local/tasks/` 下的源描述。

	### 3.1 后端基础修复 + 核心 API（Wave 0–1）
	- `#1` 修复后端崩溃导致工作台白屏与聊天报错
	- `#2` 实现登录、聊天等核心后端接口
	- `#3` 为前后端关键流程补充端到端测试
	- `#13` Fix production login CORS error
	- `#14` Show a clearer message when login is blocked by browser security

	### 3.2 视觉打磨 / 工作台 / 性能（Wave 1–3）
	- `#15` 第一波：视觉打磨（登录/工作台/设置全面升级到 Claude/ChatGPT 级别）
	- `#16` 第二波：核心产品形态（因果图谱+证据溯源+审核中心+5 任务模式+P2 功能）
	- `#17` 第三波：体验流畅与性能（首屏/虚拟化/SSE/键盘/移动端/a11y）
	- `#71` Make the Causal Network browsable in the workbench

	### 3.3 全栈生产化与部署
	- `#9` DoAtlas 全栈生产化（双 profile + 真实 LLM + 数据源 + 部署）

	### 3.4 Python 研究引擎接入（Wave 4）
	- `#18` 第四波：接入 Python 研究引擎（在线跑 M2-M10 流水线产新边）
	- `#26` Hook the research tools up to the real knowledge graph data
	- `#27` Guard the research bridge against replayed requests before real pipelines go live
	- `#28` Make replay protection survive running multiple research engine workers
	- `#32` Run the real fulltext, validation, and codex pipelines instead of placeholders
	- `#33` Mount the snapshot dataset into the research engine in production

	### 3.5 聊天与工具循环（ReAct + 自主代理）
	- `#25` Let the research assistant actually run and stream tool results in chat
	- `#36` End-to-end user journey QA on the live workbench
	- `#53` Make the chat assistant truly autonomous (plan, recover, self-check)
	- `#54` Replay agent activity for past assistant messages on demand
	- `#55` Apply the autonomous agent loop to background research pipelines
	- `#56` Add automated tests for the agent loop's recovery and self-check paths
	- `#57` Add a manual refresh option for the agent activity panel
	- `#82` Show partial agent activity in the in-chat assistant message too
	- `#83` Test the live progress panel updates as the engine streams snapshots

	### 3.6 后台任务（pipeline supervisor / jobs panel）
	- `#44` Show pending jobs in chat with progress and a follow-up when each finishes
	- `#45` Let users cancel a long-running research job from the chat
	- `#46` Show every past background job in the conversation timeline
	- `#47` Add a regression test for the conversation jobs history panel
	- `#48` Let users filter or search the background jobs history
	- `#51` Remember the user's preferred history filters across visits
	- `#52` Highlight matched search terms inside job summaries
	- `#59` Show the new pipeline activity panel in the workbench UI
	- `#60` Save background pipeline work so a server restart doesn't lose it
	- `#61` Add automated tests for the pipeline supervisor
	- `#69` Show partial pipeline progress in the agent activity panel

	### 3.7 长期记忆（cross-session memory）
	- `#58` Long-term cross-session memory (opt-in)
	- `#62` Let users edit saved memories, not just delete them
	- `#63` Group injected memories by category in the activity panel
	- `#64` Add automated tests for the long-term memory pipeline
	- `#84` Finalize long-term memory (#58 wiring + e2e)
	- `#85` Lock down the long-term memory status response so the UI never breaks
	- `#86` Type the research-job agent snapshot column so the build stops needing a manual cast

	### 3.8 因果网与药物网持久化
	- `#70` 冻结因果网与药物网接口设计 + 真实数据回环测试
	- `#72` Persist the Causal Network into the database for fast queries
	- `#73` Hook M3/M5 runners up to the new network format
	- `#79` Wire the chat agent and graph browser to the new persistent causal store
	- `#80` Pool causal-network estimates into a single number per bucket
	- `#81` Persist the Drug Network the same way as the Causal Network
	- `#87` Pool ratio measures (HR/OR/RR) on the log scale automatically
	- `#88` Refresh the literature summary after a paper is removed
	- `#89` Show the pooled literature estimate in the chat evidence drawer
	- `#90` Wire the chat agent and graph browser to read drug data from the database
	- `#91` Surface drug evidence by joining drugs to causal edges in one query
	- `#95` Test that drug evidence shows up in the side panel and chat answers

	> 备注：Wave A（工具网注册中心）/ Wave B（在线进化飞轮 + Reviewer 通用化）相关任务（#177、#181、#214、#215、#221 等）的实现细节与验收报告已合并到 `replit.md` 顶层 + `docs/wave-b-acceptance-report.md`，不在 95 个任务计数内（属于上层的能力框架阶段）。

	---

	## 4. 已取消（CANCELLED，13 个）— 含取消原因

	\| 任务 \| 标题 \| 取消原因（推断） \|
	\|---\|---\|---\|
	\| `#4` \| 把账号、对话和任务保存到数据库, 重启不丢失 \| 已被 `#2` 实现登录、聊天等核心后端接口覆盖（持久化随核心 API 一起做了） \|
	\| `#5` \| 让聊天接入真正的大模型, 给出有价值的回复 \| 已被 `#9` 全栈生产化（含真实 LLM 接入）覆盖 \|
	\| `#6` \| 完善研究任务、运行记录和制品的真实数据 \| 路径已经被 Wave 4 `#18` 重新规划成 Python 引擎接入 \|
	\| `#7` \| 在 CI 中自动运行端到端测试 \| 改走 `Test Gate` workflow + `scripts/test-gate.sh`（参见 `docs/TESTING.md`） \|
	\| `#8` \| 补充会话切换、新建会话与登出的端到端测试 \| 已并入 `#3` 的 e2e 套件与 Playwright 配置 \|
	\| `#10` \| Connect the chat UI to the live backend end-to-end \| 已被 `#9` + `#25` 覆盖 \|
	\| `#11` \| Ship the local-deployment kit (Codex CLI + GLM + MiniMax) \| 部分能力（Codex CLI adapter）改在 `docs/CODEX_CLI_ADAPTER.md` 单独维护，多 provider 走 Replit AI Integrations 反代 \|
	\| `#12` \| Publish a complete OpenAPI spec and wire it into CI \| 改成 `lib/api-spec` + Orval 生成 zod/hooks，CI 不强卡 \|
	\| `#39` \| Plug in the real LLM full-text extractor when an API key is available \| 已并入 `#32` 真实 pipeline 接入 \|
	\| `#40` \| Call the real Codex worker for analysis when its binary is mounted \| 同上，CodexCLI runner 在适配器里完成 \|
	\| `#76` \| Show the Causal Network as an interactive node-link diagram \| 改成表格 + bucket 浏览（`#71`），节点-链路图被搁置 \|
	\| `#77` \| Link drug trials to the literature evidence that mentions them \| 已被 `#89`/`#91` 覆盖（药物-证据 join 在 SQL 一次完成） \|
	\| `#78` \| Load the full DrugNetwork from SQLite when the upstream DB ships \| 改走 Postgres 持久化（`#81`/`#90`），SQLite 路径作废 \|

	---

	## 5. 仍待办（PROPOSED，27 个）— 按主题 + 依赖分组

	> 「依赖于」一栏标注的是该任务在创建时被记录的 dependsOn；当上游任务也是 MERGED 状态时表示当前已可启动。

	### 5.1 持久化与审核数据闭环（Wave 4 收尾）
	- `#19` 将审核决定持久化到 Postgres — 依赖 `#16`（已 MERGED，可起）
	- `#20` 将证据抽屉里的 audit_checks 完整展示出来 — 依赖 `#16`（可起）
	- `#21` 为工作台 4 个页面补充端到端回归测试 — 依赖 `#16`（可起）

	### 5.2 体验流畅 / 可达性 / 性能（Wave 3 收尾）
	- `#22` Let users resume interrupted chat replies when the network drops — 依赖 `#17`（可起）
	- `#23` Verify the whole app is keyboard-and-screen-reader friendly — 依赖 `#17`（可起）
	- `#24` Measure real-world page speed with automated performance checks — 依赖 `#17`（可起）

	### 5.3 聊天与工具循环增强
	- `#29` Show research-engine progress in the chat (status, partial results, errors) — 依赖 `#25`（可起）
	- `#30` Make the research-vs-chat detector smarter than keyword matching — 依赖 `#25`（可起）
	- `#31` Cover the chat tool loop with automated regression tests — 依赖 `#25`（可起）
	- `#41` Make the assistant always answer the user's question, even when it runs research tools — 依赖 `#36`（可起）
	- `#42` Stop the assistant from putting emojis and decorative ASCII boxes in answers — 依赖 `#36`（可起）
	- `#43` Stop firing literature searches on every clinical-intake turn — 依赖 `#36`（可起）

	### 5.4 HMAC / 引擎运维
	- `#34` Set up shared replay protection automatically in production — 依赖 `#28`（可起）
	- `#35` Stop unbounded growth of the nonce table when the engine sits idle — 依赖 `#28`（可起）

	### 5.5 数据集 / 部署 pipeline
	- `#37` Auto-update the snapshot pin when a newer dataset is downloaded — 依赖 `#33`（可起）
	- `#38` Make the dataset download workflow run automatically before deployment — 依赖 `#33`（可起；目前 `Dataset Download` workflow 处于 failed 状态，先修后再做这两个）

	### 5.6 后台任务（cancel + history）
	- `#49` Cancel running tasks even after refreshing the page — 依赖 `#45`（可起）
	- `#50` Cover the cancel-job flow with automated tests — 依赖 `#45`（可起）

	### 5.7 长期记忆增强
	- `#65` Let users hide entire memory categories from each turn's prompt — 依赖 `#63`（可起）
	- `#66` Show injected memories on past assistant messages, not just fresh ones — 依赖 `#63`（可起）

	### 5.8 后台 pipeline activity 流式
	- `#67` Stream agent activity for background jobs while they're still running — 依赖 `#59`（可起）
	- `#68` Test the background job activity panel end-to-end — 依赖 `#59`（可起）

	### 5.9 因果网下游消费
	- `#74` Wire downstream consumers (chat, graph, ranking) to the new local-causal-net files — 依赖 `#73`（可起）
	- `#75` Expose the bucket aggregator as a chat tool so users can refresh the literature index — 依赖 `#73`（可起）

	### 5.10 Postgres 化图谱后续
	- `#92` Backfill production graph data into the new persistent store — 依赖 `#79`（可起）
	- `#93` Show the richer evidence card from the persistent store on the edge drawer — 依赖 `#79`（可起）
	- `#94` Cover the Postgres-backed graph routes with integration tests — 依赖 `#79`（可起）

	> 推荐启动顺序：先把 5.1（审核持久化）+ 5.5（数据集 pipeline 修复）打通，让生产/部署链条不再依赖手工步骤；然后挑 5.10 + 5.4 收尾；体验类（5.2/5.3/5.6）可以并行。5.7/5.8/5.9 都是已有功能的延伸，可独立排期。

	---

	## 6. 关键运行入口与 workflow 状态

	### 6.1 4 个 artifact 启动方式

	```bash
	# 一次安装
	pnpm install

	# Python 研究引擎
	cd artifacts/research-engine && uv pip install --system -e . && cd -

	# 单独启动（也可由 Replit workflows 自动起）
	pnpm --filter @workspace/api-server run dev # 8080
	pnpm --filter @workspace/doatlas-web run dev # 23232 (Vite)
	pnpm --filter @workspace/research-engine run dev # 8011
	pnpm --filter @workspace/mockup-sandbox run dev # 8081
	```

	### 6.2 当前 Replit workflow 状态（截至本文档生成时）

	\| Workflow \| 用途 \| 状态 \|
	\|---\|---\|---\|
	\| `artifacts/api-server: API Server` \| 启动 Express api-server \| running \|
	\| `artifacts/doatlas-web: web` \| 启动 Vite 前端 \| running \|
	\| `artifacts/research-engine: Research Engine` \| 启动 Flask 引擎 \| running \|
	\| `artifacts/mockup-sandbox: Component Preview Server` \| 内部组件预览 \| running \|
	\| `Test Gate` \| `bash scripts/test-gate.sh` 全量测试门 \| finished（最近一次完成） \|
	\| `Provider Smoke Check` \| `pnpm run smoke:providers` 探活 LLM 适配器 \| failed ⚠️（informational 性质,不阻断生产;通常因第三方 LLM provider 当下不可达或 key 缺失,排查时先确认 `AI_INTEGRATIONS_*` 是否注入） \|
	\| `Dataset Download` \| `scripts/dataset-download.sh` 拉 808MB 快照 \| failed ⚠️（建议在 ModelScope token 有效后重跑；不影响业务运行，但 `inspect_target` 在没有快照时会回退 `snapshot_unavailable`） \|
	\| `agent-loop-tests` \| `pnpm --filter @workspace/api-server test` \| failed ⚠️（已知有 4 个 ephemeral schema 同步遗留 + 1 个 memory.test.mjs 长尾挂起，参见 `docs/wave-b-acceptance-report.md` CAVEAT-D；与生产路径无关） \|

	### 6.3 关键脚本

	- `scripts/test-gate.sh` — 单一 CI 入口（hermetic 套件 ~25–60s）；带 `--coverage` 出 c8 报告
	- `scripts/start-api-server.sh` — Boot guard：先探活 healthz，命中就 sleep；没有就 pkill 再 exec
	- `scripts/start-research-engine.sh` — 同上，避免 8011 端口竞争
	- `scripts/dataset-download.sh` — DuckDB 快照下载，已存在则短路
	- `scripts/post-merge.sh` — 合并后自动 `pnpm install` + `drizzle-kit push` + `seed:prod`
	- `scripts/smoke-providers.ts` — LLM 多家供应商真流式探活

	详细说明见 `docs/TESTING.md`、`docs/WORKFLOW.md`。

	---

	## 7. 换台机器 / 换环境后的恢复步骤

	> 假设你拿到的是 `exports/doatlas-snapshot-YYYYMMDD-HHMM.zip`，目标机器是 Linux + Node 24 + Python 3.11（或 Replit）。

	1. 解压
	```bash
	unzip exports/doatlas-snapshot-20260427-2234.zip -d doatlas/
	cd doatlas
	```

	2. 安装 Node 依赖（必须 pnpm 9+）
	```bash
	corepack enable
	pnpm install
	```

	3. 安装 Python 依赖（推荐 uv）
	```bash
	curl -LsSf https://astral.sh/uv/install.sh \| sh
	uv sync # 用 uv.lock 还原
	# 或者手动：
	cd artifacts/research-engine && uv pip install --system -e . && cd -
	```

	4. 配置环境变量
	```bash
	cp .env.replit.example .env
	# 至少填：SESSION_SECRET, DOATLAS_BRIDGE_SECRET (openssl rand -hex 32),
	# DATABASE_URL, INVITE_CODES (生产)
	# AI_INTEGRATIONS_* 由 Replit 注入；本地需自填或用 Replit 反代
	```

	5. 数据库初始化
	```bash
	pnpm --filter @workspace/db push
	pnpm --filter @workspace/api-server seed:prod # 可选：写入 invite/admin
	```

	6. 数据集快照（恢复 `inspect_target` 等工具）
	```bash
	bash scripts/dataset-download.sh
	# 或手动：把 .duckdb 放到 artifacts/research-engine/data/snapshots/SNAP:*.duckdb
	```

	7. 启动 4 个 artifact（本地用多个终端，Replit 用 workflows 自动起）
	```bash
	pnpm --filter @workspace/research-engine run dev &
	pnpm --filter @workspace/api-server run dev &
	pnpm --filter @workspace/doatlas-web run dev &
	pnpm --filter @workspace/mockup-sandbox run dev &
	```

	8. 跑测试门
	```bash
	pnpm run typecheck
	bash scripts/test-gate.sh
	pnpm --filter @workspace/research-engine run test
	```

	9. 冒烟（可选）
	```bash
	pnpm run smoke:providers # LLM 适配器探活
	curl -s http://127.0.0.1:8011/internal/healthz
	curl -s http://127.0.0.1:8080/api/research-engine/healthz
	```

	10. 拉起来后第一时间检查
	- 工作台能登录（用 invite code 注册一个测试账号）
	- 聊天能联通真实模型（看 SSE `content_delta`）
	- `/admin/networks` 能看到 `drugclip_v1` 的 active 变体
	- `Test Gate` workflow 能跑通（部分 ephemeral schema 失败可参考 `docs/wave-b-acceptance-report.md` CAVEAT-D 评估）

	---

	## 8. 已知风险 / 未完事项摘要

	- `Dataset Download` workflow 当前 failed — `inspect_target` 在快照缺失时返回 `snapshot_unavailable` 是预期降级，但生产应尽快修复。
	- `agent-loop-tests` workflow 当前 failed — 4 个 ephemeral schema 同步遗留（缺 `problem_classes.status` 列）+ 1 个 `memory.test.mjs` promise 长尾挂起。已在 `docs/wave-b-acceptance-report.md` CAVEAT-D 标注，不影响生产路径，跟踪在内部任务 #218。
	- Wave B drugclip runner 信号 expressiveness 受限 — `synthetic_ref` 模式下 v1/v2 SMILES top-K cosine 分布几乎一致，5 门控会走 `skipped_ci`。需要拉真 DUD-E `actives.smi` 入库（参考 `docs/wave-b-acceptance-report.md` CAVEAT-A 修订版 + 后续任务 #220）。
	- Wave B autoRollback watchdog cron — 当前由 driver 单次手动调用证明路径正确；生产 cron 部署需后续做（CAVEAT-E）。
	- HMAC nonce 共享存储 — 单进程 OK，多 worker 需要 Postgres-backed nonce 表（待办 `#34`、`#35`）。
	- 审核决定写回 Postgres — 当前 `/review/edges/:id/decision` 只更新内存 + 审计 JSONL（待办 `#19`）。
	- `reference/` 大目录未打入快照 — 该目录是 2.5 GB 的 `doatlas-handoff` 原始 dump，可由 `Dataset Download` workflow 重新拉取，未纳入压缩包。

	### 8.1 文档维护约定

	- `replit.md` — 长期项目记忆 + 用户偏好；任何「架构级」变更（新增 artifact、新增依赖、改 schema、改进化策略）都要顺手更新顶部 Overview / System Architecture。
	- `docs/PROGRESS.md`（本文件） — 每次「打包快照 / 离开项目较久」时重新生成，对应一份带时间戳的 zip。注意：本文件不是日常迭代日志，而是「换环境恢复指南」。
	- `docs/wave-b-acceptance-report.md` — Wave B 飞轮的真路径验收证据，与代码变更同步更新。
	- `docs/WORKFLOW.md` — 流程级地图；与 `docs/API_CONTRACT.md` 冲突时以契约为准并提交 PR 同步本文。
	- *`.local/tasks/.md`** — 项目任务原始描述（每个任务 1 个 markdown），与 `listProjectTasks()` 返回值一一对应。

	---

	## 9. 下次更新本文件前的检查清单

	- [ ] `listProjectTasks({ includeDescription: false })` 返回的 MERGED / CANCELLED / PROPOSED 计数与本文 §3/§4/§5 是否一致？如不一致，按主题更新对应分组。
	- [ ] `replit.md` 顶部 `Recent Changes`（如有）是否指向最新一个压缩包路径？
	- [ ] 4 个业务 workflow + `Test Gate` + `Provider Smoke Check` + `Dataset Download` + `agent-loop-tests` 的状态是否还和 §6.2 一致？failed 的有没有新增/修复？
	- [ ] `docs/wave-b-acceptance-report.md` 中的 CAVEAT-A/B/C/D/E 是否还成立？
	- [ ] 新生成的快照里是否包含：`replit.md`、`docs/`（含本文件本身）、`pnpm-workspace.yaml`、`pnpm-lock.yaml`、`pyproject.toml`、`uv.lock`、`.replit`、`.env.replit.example`、`.local/tasks/*.md`、`artifacts/{api-server,doatlas-web,research-engine,mockup-sandbox}`、`lib/`、`scripts/`、`data/`？
	- [ ] 是否仍正确排除：`node_modules/`、`.venv-doatlas/`、`.pythonlibs/`、`.cache/`、`.git/`、`dist-handoff/`、`.upm/`、`.config/`、`.agents/`、`reference/`、`exports/*`、各类 `__pycache__/`、`.pytest_cache/`、`.mypy_cache/`？
	- [ ] 压缩包体积是否仍 < 200 MB？若超出，在本文 §1 顶部明确标注。

	---

	> 末更：2026-04-27 22:35 UTC（与压缩包 `doatlas-snapshot-20260427-2234.zip` 同批）。