HereLiesAz
/

liperty-avhubert-encoder

visual-speech-recognition

Model card Files Files and versions

liperty-avhubert-encoder

4.97 GB

Ctrl+K

Ctrl+K

1 contributor

History: 8 commits

HereLiesAz's picture

V3 seq2seq decoder export (TransformerDecoder, 6 layers, vocab=1000)

77e5fd3 verified about 2 months ago

.gitattributes

1.52 kB
initial commit about 2 months ago
LICENSE.txt

8.81 kB
Add AV-HuBERT license (per redistribution requirement) about 2 months ago
README.md

1.52 kB
Add README with attribution and provenance about 2 months ago
avhubert_base_vox_433h_decoder.onnx

239 MB
xet

V3 seq2seq decoder export (TransformerDecoder, 6 layers, vocab=1000) about 2 months ago
avhubert_base_vox_433h_dict.txt

7.8 kB
V3 seq2seq decoder export (TransformerDecoder, 6 layers, vocab=1000) about 2 months ago
avhubert_base_vox_433h_visual_encoder.onnx

411 MB
xet

ONNX-exported AV-HuBERT base+vox+433h fine-tuned visual encoder (Docker, parity-passed) about 2 months ago
avhubert_visual_encoder.onnx

411 MB
xet

ONNX-exported AV-HuBERT base visual encoder (Docker route, parity-fixed) about 2 months ago
large_vox_iter5.pt
Detected Pickle imports (5)
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2",
- "torch.LongStorage",
- "collections.OrderedDict",
- "fairseq.data.dictionary.Dictionary"
How to fix it?
3.91 GB
xet

Mirror of facebookresearch AV-HuBERT large_vox_iter5 (LRS3+VoxCeleb2 pretrained) about 2 months ago