usamireko
/

LoFiVocoder

@@ -10,11 +10,11 @@ tags:
   - generative-ai
 ---
-# PC-DDSP-LoFiVocoder Model Family
 ## Overview
-Welcome to the official Hugging Face repository for the **PC-DDSP-LoFiVocoder Model Family**, a collection of vocoder models designed for DiffSinger voicebanks for use in OpenUTAU. This project provides different model checkpoints, reflecting different stages of the training process, offering users flexibility in selecting the version that best suits their needs.
 This vocoder aims to not be realistic but rather give a "robotic" aesthetic to the output, also aims to be pretty fast, allowing quick CPU inference.
 This repository was last updated on **October 13, 2025**
@@ -29,7 +29,7 @@ All versions are available for download as ZIP files, including the necessary mo
 ## Changelog:
 - October 13, 2025: Version 2 release, a full retrain of the model, trained further than the original release.
-- August 10, 2025: Initial Release of PC-DDSP-LoFiVocoder.
 ### Ethical Considerations
 This model is distributed under the **CreativeML Open RAIL-M License**, which promotes responsible AI use. Please adhere to the following:
 - Use the model only for lawful purposes and avoid harmful applications (e.g., exploitation, defamation, or generating false information—see [LICENSE.md](LICENSE.md) for full restrictions).
@@ -38,7 +38,7 @@ This model is distributed under the **CreativeML Open RAIL-M License**, which pr
 ### Attribution
 When using or redistributing this vocoder in your voicebanks, please credit the author [usamireko](https://huggingface.co/usamireko) and include both the [LICENSE.md](LICENSE.md) and [NOTICE.md](NOTICE.md) files. A suggested citation is:
-> "PC-DDSP-LoFiVocoder by usamireko, trained using resources from Scarfmonster/HiFiPLN (MIT), VocalSet (CC-BY 4.0), Cantoría Dataset (CC-BY 4.0), and a private dataset by Spoopy☆Ace/SpoopyAce. Available at https://huggingface.co/usamireko/PC-DDSP-LoFiVocoder."
 ## Known Issues
 - GPU rendering seems to not work properly on some instances, CPU usage encouraged (Its small enough that there´s barely a performance penalty)
@@ -48,11 +48,23 @@ When using or redistributing this vocoder in your voicebanks, please credit the
 ## Training Resources
-This model was developed using the following datasets and codebases:
-- **Code**: Based on [Scarfmonster/HiFiPLN](https://github.com/Scarfmonster/HiFiPLN), licensed under the MIT License, a community vocoder framework for DiffSinger.
-- **VocalSet Dataset**: DOI: [10.5281/zenodo.1442513](https://zenodo.org/records/1442513), licensed under Creative Commons Attribution 4.0 International (CC-BY 4.0), provided by Julia Wilkins et al. at Northwestern University.
-- **Cantoría Dataset**: DOI: [10.5281/zenodo.5878677](https://zenodo.org/records/5878677), licensed under Creative Commons Attribution 4.0 International (CC-BY 4.0), provided by Helena Cuesta et al. at Universitat Pompeu Fabra.
-- **Private Dataset**: Supplied by Spoopy☆Ace/SpoopyAce with explicit permission.
 For detailed licensing terms and acknowledgments, refer to the [LICENSE.md](LICENSE.md) and [NOTICE.md](NOTICE.md) files included in the ZIP archives.

   - generative-ai
 ---
+# LoFiVocoder Model Family
 ## Overview
+Welcome to the official Hugging Face repository for the **LoFiVocoder Model Family**, a collection of vocoder models designed for DiffSinger voicebanks for use in OpenUTAU. This project provides different model checkpoints, reflecting different stages of the training process, offering users flexibility in selecting the version that best suits their needs.
 This vocoder aims to not be realistic but rather give a "robotic" aesthetic to the output, also aims to be pretty fast, allowing quick CPU inference.
 This repository was last updated on **October 13, 2025**
 ## Changelog:
 - October 13, 2025: Version 2 release, a full retrain of the model, trained further than the original release.
+- August 10, 2025: Initial Release of LoFiVocoder.
 ### Ethical Considerations
 This model is distributed under the **CreativeML Open RAIL-M License**, which promotes responsible AI use. Please adhere to the following:
 - Use the model only for lawful purposes and avoid harmful applications (e.g., exploitation, defamation, or generating false information—see [LICENSE.md](LICENSE.md) for full restrictions).
 ### Attribution
 When using or redistributing this vocoder in your voicebanks, please credit the author [usamireko](https://huggingface.co/usamireko) and include both the [LICENSE.md](LICENSE.md) and [NOTICE.md](NOTICE.md) files. A suggested citation is:
+> LoFiVocoder by usamireko was trained using a combination of open datasets and community codebases, including the HiFiPLN vocoder framework (MIT License), VocalSet (CC-BY 4.0), Cantoría Dataset (CC-BY 4.0), Project AIdol Public English Dataset, ACV-001 dataset, and a private dataset provided by Spoopy☆Ace/SpoopyAce with explicit permission. Available at https://huggingface.co/usamireko/LoFiVocoder."
 ## Known Issues
 - GPU rendering seems to not work properly on some instances, CPU usage encouraged (Its small enough that there´s barely a performance penalty)
 ## Training Resources
+ Codebase
+- Scarfmonster/HiFiPLN – MIT License: https://github.com/Scarfmonster/HiFiPLN
+ Public Datasets
+- VocalSet — CC-BY 4.0: https://zenodo.org/records/1442513
+- Cantoría Dataset — CC-BY 4.0: https://zenodo.org/records/5878677
+- Project AIdol Public English Dataset — CC-BY-SA 4.0: https://github.com/lottev1991/Project-AIdol-Public-English-Dataset
+- ACV-001 Dataset —  CC BY-SA 4.0:https://github.com/Archivoice/ACV-001
+## Private Data
+- Provided by Spoopy☆Ace/SpoopyAce with explicit permission
 For detailed licensing terms and acknowledgments, refer to the [LICENSE.md](LICENSE.md) and [NOTICE.md](NOTICE.md) files included in the ZIP archives.