File size: 551 Bytes
b5f440b fed4bd1 b5f440b 7878038 dec7304 30b9138 d3cee8c 30b9138 d3cee8c 30b9138 dec7304 30b9138 dec7304 30b9138 dec7304 30b9138 dec7304 30b9138 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 |
---
library_name: nemo
license: cc-by-4.0
tags:
- pytorch
- NeMo
---
Speaker Verification model trained on Japanese data.
# Install
```bash
pip install nemo_toolkit['all']
```
# Inference
```python
import nemo.collections.asr as nemo_asr
speaker_model = nemo_asr.models.EncDecSpeakerLabelModel.from_pretrained("Respair/RyuseiNet")
emb = speaker_model.get_embedding("audio.wav") # speaker embedding
# or
speaker_model.verify_speakers("audio_1.wav","audio_2.wav")
```
# Architecture
Nvidia's Titanet Large
# Data
800 ~ 1000 hours
# Compute
GH200 |