Update README.md
Browse files
README.md
CHANGED
|
@@ -6,6 +6,9 @@ tags:
|
|
| 6 |
- NeMo
|
| 7 |
---
|
| 8 |
|
|
|
|
|
|
|
|
|
|
| 9 |
# Inference
|
| 10 |
|
| 11 |
```python
|
|
@@ -20,3 +23,13 @@ emb = speaker_model.get_embedding("audio.wav") # speaker embedding
|
|
| 20 |
speaker_model.verify_speakers("audio_1.wav","audio_2.wav")
|
| 21 |
```
|
| 22 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 6 |
- NeMo
|
| 7 |
---
|
| 8 |
|
| 9 |
+
Speaker Verification model trained on Japanese anime data.
|
| 10 |
+
|
| 11 |
+
|
| 12 |
# Inference
|
| 13 |
|
| 14 |
```python
|
|
|
|
| 23 |
speaker_model.verify_speakers("audio_1.wav","audio_2.wav")
|
| 24 |
```
|
| 25 |
|
| 26 |
+
# Architecture
|
| 27 |
+
|
| 28 |
+
Nvidia's Titanet Large
|
| 29 |
+
|
| 30 |
+
# Data
|
| 31 |
+
|
| 32 |
+
800 ~ 1000 hours
|
| 33 |
+
|
| 34 |
+
# Compute
|
| 35 |
+
GH200
|