alkiskoudounas
/

voc2vec

Audio Classification

non-verbal-vocalization

Model card Files Files and versions

alkiskoudounas commited on Feb 6, 2025

Commit

e7bf015

·

verified ·

1 Parent(s): 3f505a9

Updated README

Files changed (1) hide show

README.md +8 -1

README.md CHANGED Viewed

@@ -19,7 +19,6 @@ voc2vec is a foundation model specifically designed for non-verbal human data.
 We employed a collection of 10 datasets covering around 125 hours of non-verbal audio and pre-trained a [Wav2Vec2](https://ai.facebook.com/blog/wav2vec-20-learning-the-structure-of-speech-from-raw-audio/)-like model.
 ## Model description
 Voc2vec is built upon the wav2vec 2.0 framework and follows its pre-training setup.
@@ -29,6 +28,14 @@ The pre-training datasets include: AudioSet (vocalization), FreeSound (babies),
 We evaluate voc2vec on six datasets: ASVP-ESD, ASPV-ESD (babies), CNVVE, NonVerbal Vocalization Dataset, Donate a Cry, VIVAE.
 ## Usage examples
 You can use the model directly in the following manner:

 We employed a collection of 10 datasets covering around 125 hours of non-verbal audio and pre-trained a [Wav2Vec2](https://ai.facebook.com/blog/wav2vec-20-learning-the-structure-of-speech-from-raw-audio/)-like model.
 ## Model description
 Voc2vec is built upon the wav2vec 2.0 framework and follows its pre-training setup.
 We evaluate voc2vec on six datasets: ASVP-ESD, ASPV-ESD (babies), CNVVE, NonVerbal Vocalization Dataset, Donate a Cry, VIVAE.
+## Available Models
+| Model | Description | Link |
+|--------|-------------|------|
+| **voc2vec** | Pre-trained model on **125 hours of non-verbal audio**. | [🔗 Model](https://huggingface.co/alkiskoudounas/voc2vec) |
+| **voc2vec-as-pt** | Continues pre-training from a model that was **initially trained on the AudioSet dataset**. | [🔗 Model](https://huggingface.co/alkiskoudounas/voc2vec-as-pt) |
+| **voc2vec-ls-pt** | Continues pre-training from a model that was **initially trained on the LibriSpeech dataset**. | [🔗 Model](https://huggingface.co/alkiskoudounas/voc2vec-ls-pt) |
 ## Usage examples
 You can use the model directly in the following manner: