Tevior commited on
Commit
a0ed767
·
verified ·
1 Parent(s): fd316d5

Update model card.

Browse files
Files changed (1) hide show
  1. README.md +61 -0
README.md ADDED
@@ -0,0 +1,61 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ library_name: pytorch
4
+ tags:
5
+ - motion-generation
6
+ - text-to-motion
7
+ - humanml3d
8
+ - controllable-generation
9
+ - kv-control
10
+ pipeline_tag: text-to-motion
11
+ ---
12
+
13
+ # KV-Control (T-Concat v4 backbone)
14
+
15
+ Sparse-keyframe, multi-joint controllable text-to-motion generation. The
16
+ repository at [github.com/Tevior/KV-Control](https://github.com/Tevior/KV-Control)
17
+ contains the full training and inference code.
18
+
19
+ ## What is here
20
+
21
+ | Path | Content | Size |
22
+ |---|---|---|
23
+ | `base_t_concat_v4/model/net_best_fid.tar` | Pre-trained T-Concat v4 masked-transformer base (the paper main backbone) | 168 MB |
24
+ | `kv_control/model/net_best_kps.tar` | KV-Control adapter trained on the base above | 520 MB |
25
+ | `vqvae/net_best_fid.pth` | Part-aware VQ-VAE tokenizer (128 codes × 6 parts) | 236 MB |
26
+ | `vqvae/skeleton_partition.json` | Skeleton partition for the part-aware VQ | 1 KB |
27
+ | `stats/{mean,std}.npy` | Normalization stats matching the released VQ | 4 KB |
28
+ | `clip/ViT-B-32.pt` | OpenAI CLIP ViT-B/32 visual + text encoder | 336 MB |
29
+ | `t2m/Comp_v6_KLD005/opt.txt + meta/` | Frozen evaluation encoder config & stats | 3 KB |
30
+ | `t2m/text_mot_match/model/finest.tar` | Pre-trained text-motion eval encoder (Guo et al., 2022) | 235 MB |
31
+ | `t2m/length_estimator/model/finest.tar` | Pre-trained motion-length predictor | 1.7 MB |
32
+ | `aux/body_models/` | SMPL neutral mesh + face / J_regressor (SMPL license) | 234 MB |
33
+ | `aux/glove/` | Vocab files for the length estimator | 10 MB |
34
+
35
+ ## How to use
36
+
37
+ ```bash
38
+ git clone https://github.com/Tevior/KV-Control.git
39
+ cd KV-Control
40
+ bash scripts/download_checkpoints.sh # populates checkpoints/, aux/ → glove/, body_models/
41
+ ```
42
+
43
+ Refer to the GitHub README for installation and quick-start commands.
44
+
45
+ ## Licenses
46
+
47
+ * Our weights (`base_t_concat_v4`, `kv_control`, `vqvae`, `stats`) — **MIT**.
48
+ * CLIP ViT-B/32 — released by OpenAI under MIT.
49
+ * SMPL body model under `aux/body_models/` — original SMPL license (research-only).
50
+ * Text-motion eval encoder / length estimator under `t2m/` — re-distributed
51
+ from the HumanML3D / Guo et al. 2022 release for reproducibility.
52
+
53
+ ## Citation
54
+
55
+ ```bibtex
56
+ @article{kvcontrol2026,
57
+ title = {KV-Control: Sparse-Keyframe Multi-Joint Text-to-Motion Generation},
58
+ author = {... (under review) ...},
59
+ year = {2026},
60
+ }
61
+ ```