Audio Classification
Transformers
PyTorch
TensorBoard
audio-spectrogram-transformer
Generated from Trainer
Instructions to use saadashraf/ast_bird_model with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use saadashraf/ast_bird_model with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("audio-classification", model="saadashraf/ast_bird_model")# Load model directly from transformers import AutoFeatureExtractor, AutoModelForAudioClassification extractor = AutoFeatureExtractor.from_pretrained("saadashraf/ast_bird_model") model = AutoModelForAudioClassification.from_pretrained("saadashraf/ast_bird_model") - Notebooks
- Google Colab
- Kaggle
plz, help me
#1
by coldpumpkinn - opened
Hello, I am a college student who is conducting a voice classification study. I am currently working on fin-tuning of the ast model. But there's a problem, and the dataset I have is not learning properly. The problem is trainer.train(), but I wonder what the shape of the dataset of train_dataset in trainer.train() is when fine tuning. Could you please share some things that I can help you with, such as sharing or advice?
The error message I currently have is as follows. RuntimeError: expected scalar type Long but found Int. Please let me know if there are any mistakes. I'll fix it Thank you.
https://drive.google.com/file/d/1vHYEIlXl3X2QCPS0eLSD0NX8kg3E6IFu/view?usp=sharing