Anonymous Authors commited on
Commit
3ceb528
Β·
1 Parent(s): 78f3fef

Rename '(Corp)' variant to '(Composite)' to match the paper

Browse files

The locality-preserving post-processing variant is referred to as
"ViTeX-14B (Composite)" in the paper and on the leaderboard. Update the
model card and the make_corp_baseline.py docstring to match. The
script filename and the default output directory (ViTeX-14B_Corp/) are
left as legacy identifiers so existing eval pipelines keep working.

Files changed (2) hide show
  1. README.md +2 -2
  2. make_corp_baseline.py +1 -1
README.md CHANGED
@@ -35,7 +35,7 @@ This repository is fully self-contained β€” it bundles the trained weights, the
35
  β”œβ”€β”€ README.md
36
  β”œβ”€β”€ requirements.txt
37
  β”œβ”€β”€ inference_example.py run ViTeX-14B on one (video, mask, glyph) tuple
38
- β”œβ”€β”€ make_corp_baseline.py build the ViTeX-14B (Corp) variant from raw predictions
39
  β”œβ”€β”€ vitex_14b.safetensors (8 GB β€” trained adapter weights)
40
  β”œβ”€β”€ diffsynth/ (bundled inference library)
41
  └── base_model/ (70 GB β€” frozen base model files)
@@ -80,7 +80,7 @@ python inference_example.py \
80
 
81
  The script automatically uses the bundled `base_model/` and `vitex_14b.safetensors` β€” no extra downloads.
82
 
83
- ## Locality-preserving variant: ViTeX-14B (Corp)
84
 
85
  `make_corp_baseline.py` is a deterministic, training-free post-processing wrapper that composes ViTeX-14B's predicted text region back onto the source video. Two per-frame operations:
86
 
 
35
  β”œβ”€β”€ README.md
36
  β”œβ”€β”€ requirements.txt
37
  β”œβ”€β”€ inference_example.py run ViTeX-14B on one (video, mask, glyph) tuple
38
+ β”œβ”€β”€ make_corp_baseline.py build the ViTeX-14B (Composite) variant from raw predictions
39
  β”œβ”€β”€ vitex_14b.safetensors (8 GB β€” trained adapter weights)
40
  β”œβ”€β”€ diffsynth/ (bundled inference library)
41
  └── base_model/ (70 GB β€” frozen base model files)
 
80
 
81
  The script automatically uses the bundled `base_model/` and `vitex_14b.safetensors` β€” no extra downloads.
82
 
83
+ ## Locality-preserving variant: ViTeX-14B (Composite)
84
 
85
  `make_corp_baseline.py` is a deterministic, training-free post-processing wrapper that composes ViTeX-14B's predicted text region back onto the source video. Two per-frame operations:
86
 
make_corp_baseline.py CHANGED
@@ -1,4 +1,4 @@
1
- """Build the ViTeX-14B (Corp) baseline.
2
 
3
  For each test clip:
4
  1. Read source video, ViTeX-14B prediction, and the dilated text mask.
 
1
+ """Build the ViTeX-14B (Composite) baseline.
2
 
3
  For each test clip:
4
  1. Read source video, ViTeX-14B prediction, and the dilated text mask.