Updated README
Browse files
README.md
CHANGED
|
@@ -19,7 +19,6 @@ voc2vec is a foundation model specifically designed for non-verbal human data.
|
|
| 19 |
|
| 20 |
We employed a collection of 10 datasets covering around 125 hours of non-verbal audio and pre-trained a [Wav2Vec2](https://ai.facebook.com/blog/wav2vec-20-learning-the-structure-of-speech-from-raw-audio/)-like model.
|
| 21 |
|
| 22 |
-
|
| 23 |
## Model description
|
| 24 |
|
| 25 |
Voc2vec is built upon the wav2vec 2.0 framework and follows its pre-training setup.
|
|
@@ -29,6 +28,14 @@ The pre-training datasets include: AudioSet (vocalization), FreeSound (babies),
|
|
| 29 |
|
| 30 |
We evaluate voc2vec on six datasets: ASVP-ESD, ASPV-ESD (babies), CNVVE, NonVerbal Vocalization Dataset, Donate a Cry, VIVAE.
|
| 31 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 32 |
## Usage examples
|
| 33 |
|
| 34 |
You can use the model directly in the following manner:
|
|
|
|
| 19 |
|
| 20 |
We employed a collection of 10 datasets covering around 125 hours of non-verbal audio and pre-trained a [Wav2Vec2](https://ai.facebook.com/blog/wav2vec-20-learning-the-structure-of-speech-from-raw-audio/)-like model.
|
| 21 |
|
|
|
|
| 22 |
## Model description
|
| 23 |
|
| 24 |
Voc2vec is built upon the wav2vec 2.0 framework and follows its pre-training setup.
|
|
|
|
| 28 |
|
| 29 |
We evaluate voc2vec on six datasets: ASVP-ESD, ASPV-ESD (babies), CNVVE, NonVerbal Vocalization Dataset, Donate a Cry, VIVAE.
|
| 30 |
|
| 31 |
+
## Available Models
|
| 32 |
+
|
| 33 |
+
| Model | Description | Link |
|
| 34 |
+
|--------|-------------|------|
|
| 35 |
+
| **voc2vec** | Pre-trained model on **125 hours of non-verbal audio**. | [🔗 Model](https://huggingface.co/alkiskoudounas/voc2vec) |
|
| 36 |
+
| **voc2vec-as-pt** | Continues pre-training from a model that was **initially trained on the AudioSet dataset**. | [🔗 Model](https://huggingface.co/alkiskoudounas/voc2vec-as-pt) |
|
| 37 |
+
| **voc2vec-ls-pt** | Continues pre-training from a model that was **initially trained on the LibriSpeech dataset**. | [🔗 Model](https://huggingface.co/alkiskoudounas/voc2vec-ls-pt) |
|
| 38 |
+
|
| 39 |
## Usage examples
|
| 40 |
|
| 41 |
You can use the model directly in the following manner:
|