Update README.md
Browse files
README.md
CHANGED
|
@@ -73,4 +73,28 @@ print(decode_wav([torchaudio.load('sample.wav')[0].squeeze()], model))
|
|
| 73 |
# <|0.06| What are the many parts that make a machine learning system feel like it works so magically cheap? |5.86|>
|
| 74 |
# <|5.68| Explletability factors important, so they tend to gear towards more simpler models with less parameters, but easier to explain, and on the other spectrum there are |15.86|>
|
| 75 |
|
| 76 |
-
```
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 73 |
# <|0.06| What are the many parts that make a machine learning system feel like it works so magically cheap? |5.86|>
|
| 74 |
# <|5.68| Explletability factors important, so they tend to gear towards more simpler models with less parameters, but easier to explain, and on the other spectrum there are |15.86|>
|
| 75 |
|
| 76 |
+
```
|
| 77 |
+
|
| 78 |
+
### Citation
|
| 79 |
+
|
| 80 |
+
This repository uses the idea from the following paper. Please cite the paper if this model is used to help produce published results or is incorporated into other software.
|
| 81 |
+
|
| 82 |
+
```text
|
| 83 |
+
@INPROCEEDINGS{10446589,
|
| 84 |
+
author={Nguyen, Thai-Binh and Waibel, Alexander},
|
| 85 |
+
booktitle={ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)},
|
| 86 |
+
title={Synthetic Conversations Improve Multi-Talker ASR},
|
| 87 |
+
year={2024},
|
| 88 |
+
volume={},
|
| 89 |
+
number={},
|
| 90 |
+
pages={10461-10465},
|
| 91 |
+
keywords={Systematics;Error analysis;Knowledge based systems;Oral communication;Signal processing;Data models;Acoustics;multi-talker;asr;synthetic conversation},
|
| 92 |
+
doi={10.1109/ICASSP48485.2024.10446589}
|
| 93 |
+
}
|
| 94 |
+
```
|
| 95 |
+
|
| 96 |
+
### Contact
|
| 97 |
+
|
| 98 |
+
nguyenvulebinh@gmail.com
|
| 99 |
+
|
| 100 |
+
[](https://twitter.com/intent/follow?screen_name=nguyenvulebinh)
|