File size: 3,286 Bytes
886e247 2b312e4 886e247 2b312e4 886e247 2b312e4 886e247 2b312e4 886e247 2b312e4 886e247 2b312e4 886e247 2b312e4 886e247 2b312e4 886e247 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 | ---
license: cc-by-4.0
language:
- en
tags:
- video-generation
- world-model
- memory
- action-conditioned
- wan
library_name: diffsynth
---
# Echo-Memory — Wan 2.1 1.3B memory baseline checkpoints
Paper-aligned **epoch-0** fine-tunes for [Echo-Memory](https://github.com/Echo-Team-Joy-Future-Academy-JD/Echo-Memory) ([project page](https://echo-team-joy-future-academy-jd.github.io/Echo-Memory/)).
**Backbone:** [Wan-AI/Wan2.1-T2V-1.3B](https://huggingface.co/Wan-AI/Wan2.1-T2V-1.3B)
**Training:** static in-domain pool · 1 epoch · **30,000 steps** · 640×352 · 81-frame chunks
**Layout:** `{row_id}/epoch-0.safetensors`
## Checkpoint index
| Family | Paper row | HF path | Steps |
| --- | --- | --- | ---: |
| Raw context | Context K=1 | `context_k1/epoch-0.safetensors` | 30,000 |
| Raw context | Context K=20 | `context_k20/epoch-0.safetensors` | 30,000 |
| Spatial | Spatial Memory | `spatial_mem/epoch-0.safetensors` | 30,000 |
| State-space | Block-wise SSM | `block_wise_ssm_two_chunk/epoch-0.safetensors` | 30,000 |
| State-space | Legacy Hybrid (VideoSSM) | `videossm_hybrid/epoch-0.safetensors` | 30,000 |
| Spatial | concat text (ablation) | `spatial_concat_text_two_chunk/epoch-0.safetensors` | 30,000 |
| Spatial | inject none (ablation) | `spatial_inject_none_two_chunk/epoch-0.safetensors` | 30,000 |
| Spatial | cross-attn t32 (ablation) | `spatial_cross_attn_readout_t32_g4_two_chunk/epoch-0.safetensors` | 30,000 |
| State-space | SSM ctx1 / every4 / hint21 | `ssm_ablation_ctx1_every4_hint21/epoch-0.safetensors` | 30,000 |
| State-space | SSM ctx5 / every1 / hint21 | `ssm_ablation_ctx5_every1_hint21/epoch-0.safetensors` | 30,000 |
| State-space | SSM ctx5 / every4 / hint81 | `ssm_ablation_ctx5_every4_hint81/epoch-0.safetensors` | 30,000 |
Context K=5 and FramePack compression rows are not yet released as `epoch-0` weights.
## Download
```bash
pip install -U "huggingface_hub[cli]"
# one row
huggingface-cli download Echo-Team/Echo-Memory context_k1/epoch-0.safetensors --local-dir ./ckpts
# all rows
huggingface-cli download Echo-Team/Echo-Memory --local-dir ./ckpts
```
Keep the row subdirectory in the local path (e.g. `./ckpts/spatial_mem/epoch-0.safetensors`).
## Use with Echo-Memory code
Clone [Echo-Memory](https://github.com/Echo-Team-Joy-Future-Academy-JD/Echo-Memory), install the environment, then:
```bash
export WAN_BASE_MODEL=/path/to/Wan2.1-T2V-1.3B
export DATASET_BASE_PATH=data/Context-as-Memory-Dataset
export PYTHONPATH=$PWD:${PYTHONPATH:-}
export CKPT=./ckpts/spatial_mem/epoch-0.safetensors
# in-domain replay + revisit
bash eval/v2/run_static_consistency_loop_and_revisit.sh
bash eval/v2/run_basic_replay_gt.sh
# open-domain revisit (first frames in repo)
PHASE=stage1 OOD_DIR=assets/opendomain_revisit \
bash eval/v2/revisit_suite/run_one_click_revisit_eval.sh
```
Memory runtime flags are inferred from the checkpoint path via `env/memory_baseline_runtime.py` — use the HF folder names above.
Full docs: [doc/checkpoints.md](https://github.com/Echo-Team-Joy-Future-Academy-JD/Echo-Memory/blob/main/doc/checkpoints.md)
## Citation
Echo-Memory: A Controlled Study of Memory in Action World Models — Echo Team @ Joy Future Academy, JD ([ResearchGate DOI](https://doi.org/10.13140/RG.2.2.19906.34248)).
|