Update README.md
Browse files
README.md
CHANGED
|
@@ -10,11 +10,11 @@ tags:
|
|
| 10 |
- generative-ai
|
| 11 |
---
|
| 12 |
|
| 13 |
-
#
|
| 14 |
|
| 15 |
## Overview
|
| 16 |
|
| 17 |
-
Welcome to the official Hugging Face repository for the **
|
| 18 |
This vocoder aims to not be realistic but rather give a "robotic" aesthetic to the output, also aims to be pretty fast, allowing quick CPU inference.
|
| 19 |
|
| 20 |
This repository was last updated on **October 13, 2025**
|
|
@@ -29,7 +29,7 @@ All versions are available for download as ZIP files, including the necessary mo
|
|
| 29 |
|
| 30 |
## Changelog:
|
| 31 |
- October 13, 2025: Version 2 release, a full retrain of the model, trained further than the original release.
|
| 32 |
-
- August 10, 2025: Initial Release of
|
| 33 |
### Ethical Considerations
|
| 34 |
This model is distributed under the **CreativeML Open RAIL-M License**, which promotes responsible AI use. Please adhere to the following:
|
| 35 |
- Use the model only for lawful purposes and avoid harmful applications (e.g., exploitation, defamation, or generating false information—see [LICENSE.md](LICENSE.md) for full restrictions).
|
|
@@ -38,7 +38,7 @@ This model is distributed under the **CreativeML Open RAIL-M License**, which pr
|
|
| 38 |
### Attribution
|
| 39 |
When using or redistributing this vocoder in your voicebanks, please credit the author [usamireko](https://huggingface.co/usamireko) and include both the [LICENSE.md](LICENSE.md) and [NOTICE.md](NOTICE.md) files. A suggested citation is:
|
| 40 |
|
| 41 |
-
>
|
| 42 |
|
| 43 |
## Known Issues
|
| 44 |
- GPU rendering seems to not work properly on some instances, CPU usage encouraged (Its small enough that there´s barely a performance penalty)
|
|
@@ -48,11 +48,23 @@ When using or redistributing this vocoder in your voicebanks, please credit the
|
|
| 48 |
|
| 49 |
## Training Resources
|
| 50 |
|
| 51 |
-
|
| 52 |
-
|
| 53 |
-
-
|
| 54 |
-
|
| 55 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 56 |
|
| 57 |
For detailed licensing terms and acknowledgments, refer to the [LICENSE.md](LICENSE.md) and [NOTICE.md](NOTICE.md) files included in the ZIP archives.
|
| 58 |
|
|
|
|
| 10 |
- generative-ai
|
| 11 |
---
|
| 12 |
|
| 13 |
+
# LoFiVocoder Model Family
|
| 14 |
|
| 15 |
## Overview
|
| 16 |
|
| 17 |
+
Welcome to the official Hugging Face repository for the **LoFiVocoder Model Family**, a collection of vocoder models designed for DiffSinger voicebanks for use in OpenUTAU. This project provides different model checkpoints, reflecting different stages of the training process, offering users flexibility in selecting the version that best suits their needs.
|
| 18 |
This vocoder aims to not be realistic but rather give a "robotic" aesthetic to the output, also aims to be pretty fast, allowing quick CPU inference.
|
| 19 |
|
| 20 |
This repository was last updated on **October 13, 2025**
|
|
|
|
| 29 |
|
| 30 |
## Changelog:
|
| 31 |
- October 13, 2025: Version 2 release, a full retrain of the model, trained further than the original release.
|
| 32 |
+
- August 10, 2025: Initial Release of LoFiVocoder.
|
| 33 |
### Ethical Considerations
|
| 34 |
This model is distributed under the **CreativeML Open RAIL-M License**, which promotes responsible AI use. Please adhere to the following:
|
| 35 |
- Use the model only for lawful purposes and avoid harmful applications (e.g., exploitation, defamation, or generating false information—see [LICENSE.md](LICENSE.md) for full restrictions).
|
|
|
|
| 38 |
### Attribution
|
| 39 |
When using or redistributing this vocoder in your voicebanks, please credit the author [usamireko](https://huggingface.co/usamireko) and include both the [LICENSE.md](LICENSE.md) and [NOTICE.md](NOTICE.md) files. A suggested citation is:
|
| 40 |
|
| 41 |
+
> LoFiVocoder by usamireko was trained using a combination of open datasets and community codebases, including the HiFiPLN vocoder framework (MIT License), VocalSet (CC-BY 4.0), Cantoría Dataset (CC-BY 4.0), Project AIdol Public English Dataset, ACV-001 dataset, and a private dataset provided by Spoopy☆Ace/SpoopyAce with explicit permission. Available at https://huggingface.co/usamireko/LoFiVocoder."
|
| 42 |
|
| 43 |
## Known Issues
|
| 44 |
- GPU rendering seems to not work properly on some instances, CPU usage encouraged (Its small enough that there´s barely a performance penalty)
|
|
|
|
| 48 |
|
| 49 |
## Training Resources
|
| 50 |
|
| 51 |
+
Codebase
|
| 52 |
+
|
| 53 |
+
- Scarfmonster/HiFiPLN – MIT License: https://github.com/Scarfmonster/HiFiPLN
|
| 54 |
+
|
| 55 |
+
Public Datasets
|
| 56 |
+
|
| 57 |
+
- VocalSet — CC-BY 4.0: https://zenodo.org/records/1442513
|
| 58 |
+
|
| 59 |
+
- Cantoría Dataset — CC-BY 4.0: https://zenodo.org/records/5878677
|
| 60 |
+
|
| 61 |
+
- Project AIdol Public English Dataset — CC-BY-SA 4.0: https://github.com/lottev1991/Project-AIdol-Public-English-Dataset
|
| 62 |
+
|
| 63 |
+
- ACV-001 Dataset — CC BY-SA 4.0:https://github.com/Archivoice/ACV-001
|
| 64 |
+
|
| 65 |
+
## Private Data
|
| 66 |
+
|
| 67 |
+
- Provided by Spoopy☆Ace/SpoopyAce with explicit permission
|
| 68 |
|
| 69 |
For detailed licensing terms and acknowledgments, refer to the [LICENSE.md](LICENSE.md) and [NOTICE.md](NOTICE.md) files included in the ZIP archives.
|
| 70 |
|