Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -15,15 +15,30 @@ task_categories:
|
|
| 15 |
|
| 16 |
This model generate a short textual description of any audio file.
|
| 17 |
|
|
|
|
| 18 |
```bash
|
| 19 |
pip install conette
|
| 20 |
```
|
| 21 |
|
|
|
|
| 22 |
```py
|
| 23 |
from conette import CoNeTTEConfig, CoNeTTEModel
|
| 24 |
|
| 25 |
config = CoNeTTEConfig.from_pretrained("Labbeti/conette")
|
| 26 |
model = CoNeTTEModel.from_pretrained("Labbeti/conette", config=config)
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 27 |
```
|
| 28 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 29 |
It was created by [@Labbeti](https://hf.co/Labbeti).
|
|
|
|
| 15 |
|
| 16 |
This model generate a short textual description of any audio file.
|
| 17 |
|
| 18 |
+
## Installation
|
| 19 |
```bash
|
| 20 |
pip install conette
|
| 21 |
```
|
| 22 |
|
| 23 |
+
## Usage
|
| 24 |
```py
|
| 25 |
from conette import CoNeTTEConfig, CoNeTTEModel
|
| 26 |
|
| 27 |
config = CoNeTTEConfig.from_pretrained("Labbeti/conette")
|
| 28 |
model = CoNeTTEModel.from_pretrained("Labbeti/conette", config=config)
|
| 29 |
+
|
| 30 |
+
path = "/my/path/to/audio.wav"
|
| 31 |
+
outputs = model(path)
|
| 32 |
+
cands = outputs["cands"][0]
|
| 33 |
+
print(cands)
|
| 34 |
```
|
| 35 |
|
| 36 |
+
## Performance
|
| 37 |
+
TODO
|
| 38 |
+
|
| 39 |
+
## Additional information
|
| 40 |
+
|
| 41 |
+
The encoder part of the architecture is based on a ConvNeXt model for audio classification, available here: https://huggingface.co/topel/ConvNeXt-Tiny-AT.
|
| 42 |
+
The encoder weights used are named "convnext_tiny_465mAP_BL_AC_70kit.pth", available on Zenodo: https://zenodo.org/record/8020843.
|
| 43 |
+
|
| 44 |
It was created by [@Labbeti](https://hf.co/Labbeti).
|