Instructions to use MahmoodAnaam/avhubert_encoder_vsr_base_vox_433h with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use MahmoodAnaam/avhubert_encoder_vsr_base_vox_433h with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("feature-extraction", model="MahmoodAnaam/avhubert_encoder_vsr_base_vox_433h", trust_remote_code=True)# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("MahmoodAnaam/avhubert_encoder_vsr_base_vox_433h", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
File size: 946 Bytes
f5d35b5 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 | {
"auto_map": {
"AutoFeatureExtractor": "feature_extraction_avhubert.AVHubertFeatureExtractor"
},
"feature_extractor_type": "AVHubertFeatureExtractor",
"image_crop_size": 88,
"landmark_indices": [
5,
411,
199,
187
],
"max_sample_size": null,
"min_detection_confidence": 0.5,
"min_tracking_confidence": 0.5,
"normalize": true,
"refine_landmarks": false,
"sr": 16000,
"stack_order_audio": 4,
"static_image_mode": false,
"transforms": [
{
"training": "True",
"transforms_type": "ToImage"
},
{
"size": "(88, 88)",
"training": "True",
"transforms_type": "CenterCrop"
},
{
"dtype": "torch.float32",
"scale": "True",
"training": "True",
"transforms_type": "ToDtype"
},
{
"inplace": "False",
"mean": "[0.421]",
"std": "[0.165]",
"training": "True",
"transforms_type": "Normalize"
}
]
}
|