Video-to-Video
English
nielsr HF Staff commited on
Commit
336537d
·
verified ·
1 Parent(s): 0ca8acb

Improve model card metadata and content

Browse files

Hi! I'm Niels from the Hugging Face community science team.

This PR improves the model card for DecMem. Key changes include:
- Updating the `pipeline_tag` to `text-to-video` to ensure the model is correctly categorized and discoverable in the Hub's gallery.
- Ensuring the paper, project page, and code repository are clearly linked.
- Maintaining the installation and inference instructions provided in the official repository.

Files changed (1) hide show
  1. README.md +9 -7
README.md CHANGED
@@ -1,15 +1,15 @@
1
  ---
2
- pipeline_tag: video-to-video
3
- license: apache-2.0
4
- language:
5
- - en
6
  base_model:
7
  - Wan-AI/Wan2.1-T2V-1.3B
 
 
 
 
8
  ---
9
 
10
  # DecMem: Towards Minute-Long Consistent World Generation with Decoupled Memory
11
 
12
- We propose DecMem, a decoupled memory architecture that employs Sparse Global Memory for efficient fine-grained access to global history and Anchored Local Memory for stable and high-quality extrapolation.
13
 
14
  [**Project Page**](https://jeffreyyzh.github.io/DecMem-Page/) | [**Paper**](https://arxiv.org/abs/2605.31336) | [**Code**](https://github.com/KlingAIResearch/DecMem)
15
 
@@ -23,7 +23,7 @@ huggingface-cli download Wan-AI/Wan2.1-T2V-1.3B \
23
  --local-dir wan_models/Wan2.1-T2V-1.3B
24
  ```
25
 
26
- Download DecMem trained checkpoints from HuggingFace:
27
 
28
  ```bash
29
  huggingface-cli download KlingTeam/DecMem --local-dir checkpoints
@@ -38,7 +38,9 @@ checkpoints/
38
 
39
  ## Quick start
40
 
41
- We provide the example video-pose pairs for quick inference. The inference is Block-by-block causal denoising manner with KV cache.
 
 
42
 
43
  ```bash
44
  bash scripts/infer_example.sh
 
1
  ---
 
 
 
 
2
  base_model:
3
  - Wan-AI/Wan2.1-T2V-1.3B
4
+ language:
5
+ - en
6
+ license: apache-2.0
7
+ pipeline_tag: text-to-video
8
  ---
9
 
10
  # DecMem: Towards Minute-Long Consistent World Generation with Decoupled Memory
11
 
12
+ DecMem is a decoupled memory architecture designed for consistent, long-horizon world generation. It employs **Sparse Global Memory** for efficient fine-grained access to global history and **Anchored Local Memory** for stable and high-quality extrapolation. This approach enables minute-level controllable long video generation with high fidelity and consistency.
13
 
14
  [**Project Page**](https://jeffreyyzh.github.io/DecMem-Page/) | [**Paper**](https://arxiv.org/abs/2605.31336) | [**Code**](https://github.com/KlingAIResearch/DecMem)
15
 
 
23
  --local-dir wan_models/Wan2.1-T2V-1.3B
24
  ```
25
 
26
+ Download DecMem trained checkpoints:
27
 
28
  ```bash
29
  huggingface-cli download KlingTeam/DecMem --local-dir checkpoints
 
38
 
39
  ## Quick start
40
 
41
+ We provide example video-pose pairs for quick inference. The inference is performed in a block-by-block causal denoising manner with KV cache.
42
+
43
+ To run the inference, follow the installation instructions in the [official repository](https://github.com/KlingAIResearch/DecMem) and run:
44
 
45
  ```bash
46
  bash scripts/infer_example.sh