Audio-Text-to-Text
Transformers
Safetensors
English
qwen2_audio
text2text-generation
audio
speech
audio-llm
paralinguistic
pclm
dpo
voxparadox
Instructions to use IHP-Lab/Qwen2-Audio_PCLM_DPO with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use IHP-Lab/Qwen2-Audio_PCLM_DPO with Transformers:
# Load model directly from transformers import AutoProcessor, AutoModelForSeq2SeqLM processor = AutoProcessor.from_pretrained("IHP-Lab/Qwen2-Audio_PCLM_DPO") model = AutoModelForSeq2SeqLM.from_pretrained("IHP-Lab/Qwen2-Audio_PCLM_DPO") - Notebooks
- Google Colab
- Kaggle
File size: 342 Bytes
11a7e82 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 | {
"chunk_length": 30,
"feature_extractor_type": "WhisperFeatureExtractor",
"feature_size": 128,
"hop_length": 160,
"n_fft": 400,
"n_samples": 480000,
"nb_max_frames": 3000,
"padding_side": "right",
"padding_value": 0.0,
"processor_class": "Qwen2AudioProcessor",
"return_attention_mask": true,
"sampling_rate": 16000
}
|