File size: 4,779 Bytes
59a943e
 
542d4f7
 
 
 
 
 
a858c0e
542d4f7
59a943e
 
4b80004
a9c7845
59a943e
 
 
a9c7845
6f6444f
59a943e
e166294
59a943e
6f6444f
59a943e
6f6444f
59a943e
87c966f
17cc804
6f6444f
 
59a943e
6f6444f
87c966f
6f6444f
a9c7845
59a943e
9fa52b1
59a943e
 
 
 
e67306b
59a943e
a9c7845
59a943e
63f3157
 
 
16324fc
 
63f3157
59a943e
 
a9c7845
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
59a943e
 
 
 
 
17cc804
59a943e
 
 
 
 
 
 
 
 
 
 
17cc804
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
---
tags:
- audio
- vocoder
- singing-synthesis
- openutau
- machine-learning
- generative-ai
- diffsinger
license: other
---


# LoFiVocoder Model Family

## Overview

Welcome to the official Hugging Face repository for the **LoFiVocoder Model Family**, a collection of vocoder models designed for DiffSinger voicebanks for use in OpenUTAU. This project provides different model checkpoints, reflecting different stages of the training process, offering users flexibility in selecting the version that best suits their needs.
This vocoder aims to not be realistic but rather give a "robotic" aesthetic to the output, also aims to be pretty fast, allowing quick CPU inference.

This repository was last updated on **February 03, 2026**

All versions are available for download as ZIP files, including the necessary model weights, configuration files, and associated documentation.

## Difference between version

- 20260203: Trained up to **February 03, 2026**, changed license to fit more the author´s desire, increased dataset (ACV-001 and AIdol introduced) with further training. **(Recommended)**
- 20251013: Trained up to **October 13, 2025** (This version changes the Vocoder´s name to LoFiVocoder, please update the name from the old release to be usable)
- Ver A: Based off the latest checkpoint trained up to **August 10, 2025**
- Ver B: Based off an earlier checkpoint trained up to **August 10, 2025**, use this one if you want a slightly more robotic-ish output

## Changelog: 
- February 3, 2026: Version 3 release, full retrain of model, increased dataset with ACV-001 and AIdol, trained even further compared previous release, repo and overall branding shortened to LoFiVocoder.
- October 13, 2025: Version 2 release, a full retrain of the model, trained further than the original release.
- August 10, 2025: Initial Release of LoFiVocoder.
### Ethical Considerations
This model is distributed under the **CreativeML Open RAIL-M-derived License**, which promotes responsible AI use. Please adhere to the following:
- Use the model only for lawful purposes and avoid harmful applications (e.g., exploitation, defamation, or generating false information—see [LICENSE.md](LICENSE.md) for full restrictions).
- Include attribution to the original resources and this repository in any derivative works or redistributions.

### Attribution
When using or redistributing this vocoder in your voicebanks, please credit the author [usamireko](https://huggingface.co/usamireko) and include both the [LICENSE.md](LICENSE.md) and [NOTICE.md](NOTICE.md) files. A suggested citation is:

> LoFiVocoder by usamireko was trained using a combination of open datasets and community codebases, including the HiFiPLN vocoder framework (MIT License), VocalSet (CC-BY 4.0), Cantoría Dataset (CC-BY 4.0), Project AIdol Public English Dataset, ACV-001 dataset, and a private dataset provided by Spoopy☆Ace/SpoopyAce with explicit permission. Available at https://huggingface.co/usamireko/LoFiVocoder."

## Known Issues
- GPU rendering seems to not work properly on some instances, CPU usage encouraged (Its small enough that there´s barely a performance penalty)
- If a voicebank with a custom vocoder (dsvocoder folder), apparently makes it ignore PC-DDSP-LoFiVocoder

*Both of these are getting investigated*

## Training Resources

 Codebase

- Scarfmonster/HiFiPLN – MIT License: https://github.com/Scarfmonster/HiFiPLN

 Public Datasets

- VocalSet — CC-BY 4.0: https://zenodo.org/records/1442513

- Cantoría Dataset — CC-BY 4.0: https://zenodo.org/records/5878677

- Project AIdol Public English Dataset — CC-BY-SA 4.0: https://github.com/lottev1991/Project-AIdol-Public-English-Dataset

- ACV-001 Dataset —  CC BY-SA 4.0:https://github.com/Archivoice/ACV-001

## Private Data

- Provided by Spoopy☆Ace/SpoopyAce with explicit permission

For detailed licensing terms and acknowledgments, refer to the [LICENSE.md](LICENSE.md) and [NOTICE.md](NOTICE.md) files included in the ZIP archives.

## License and Legal Notices

This model is released under a **CreativeML Open RAIL-M-derived License**, which grants permissions for use, modification, and distribution while imposing use-based restrictions to ensure responsible AI practices. Key points include:
- No warranties or guarantees are provided; use at your own risk.
- Redistribution must include the license and notice files.
- See [LICENSE.md](LICENSE.md) for the full terms and Attachment A for restricted uses.

The [NOTICE.md](NOTICE.md) file contains specific attributions to the training resources and contributors.

## Contributing and Support

This is a community-supported project. For feedback, issues, or contributions:
- Open an issue on this Hugging Face page.

Thank you for using LoFiVocoder!