File size: 551 Bytes
b5f440b
 
 
 
 
 
 
fed4bd1
b5f440b
7878038
 
 
 
dec7304
30b9138
d3cee8c
 
30b9138
 
 
 
 
d3cee8c
 
30b9138
dec7304
30b9138
dec7304
30b9138
dec7304
30b9138
dec7304
30b9138
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
---
library_name: nemo
license: cc-by-4.0
tags:
- pytorch
- NeMo
---
Speaker Verification model trained on Japanese data.

# Install
```bash
pip install nemo_toolkit['all']
```

# Inference

```python
import nemo.collections.asr as nemo_asr
speaker_model = nemo_asr.models.EncDecSpeakerLabelModel.from_pretrained("Respair/RyuseiNet")
emb = speaker_model.get_embedding("audio.wav") # speaker embedding
# or
speaker_model.verify_speakers("audio_1.wav","audio_2.wav")
```

# Architecture

Nvidia's Titanet Large

# Data

800 ~ 1000 hours

# Compute
GH200