Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
HereLiesAz
/
liperty-avhubert-encoder
like
0
ONNX
visual-speech-recognition
lipreading
av-hubert
mirror
License:
av-hubert-license
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
liperty-avhubert-encoder
4.97 GB
Ctrl+K
Ctrl+K
1 contributor
History:
8 commits
HereLiesAz
V3 seq2seq decoder export (TransformerDecoder, 6 layers, vocab=1000)
77e5fd3
verified
about 2 months ago
.gitattributes
Safe
1.52 kB
initial commit
about 2 months ago
LICENSE.txt
Safe
8.81 kB
Add AV-HuBERT license (per redistribution requirement)
about 2 months ago
README.md
1.52 kB
Add README with attribution and provenance
about 2 months ago
avhubert_base_vox_433h_decoder.onnx
239 MB
xet
V3 seq2seq decoder export (TransformerDecoder, 6 layers, vocab=1000)
about 2 months ago
avhubert_base_vox_433h_dict.txt
7.8 kB
V3 seq2seq decoder export (TransformerDecoder, 6 layers, vocab=1000)
about 2 months ago
avhubert_base_vox_433h_visual_encoder.onnx
411 MB
xet
ONNX-exported AV-HuBERT base+vox+433h fine-tuned visual encoder (Docker, parity-passed)
about 2 months ago
avhubert_visual_encoder.onnx
411 MB
xet
ONNX-exported AV-HuBERT base visual encoder (Docker route, parity-fixed)
about 2 months ago
large_vox_iter5.pt
pickle
Detected Pickle imports (5)
"torch.FloatStorage"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.LongStorage"
,
"collections.OrderedDict"
,
"fairseq.data.dictionary.Dictionary"
How to fix it?
3.91 GB
xet
Mirror of facebookresearch AV-HuBERT large_vox_iter5 (LRS3+VoxCeleb2 pretrained)
about 2 months ago