BSC-LT
/

PL-BERT-wp-ca

@@ -1,28 +1,36 @@
 ---
 language:
 - ca
 ---
-# PL-BERT-wordpiece-cat-multiaccent
 ## Overview
 <details>
 <summary>Click to expand</summary>
-- **Model type:** Phoneme-level Language Model (PL-BERT)
-- **Architecture:** AlBERT-base (12 layers, 768 hidden units, 12 attention heads)
-- **Language:** Catalan (multiple accents)
-- **License:** Apache 2.0
-- **Data:** Crowdsourced phonemized Catalan speech text
 </details>
 ---
-## Model description
-**PL-BERT-wordpiece-cat-multiaccent** is a phoneme-level masked language model trained on Catalan text with diverse regional accents. It is based on the [PL-BERT architecture](https://github.com/yl4579/PL-BERT), which learns phoneme representations via a BERT-style masked language modeling objective.
 This model is designed to support **phoneme-based text-to-speech (TTS) systems**, including but not limited to [StyleTTS2](https://github.com/yl4579/StyleTTS2). Thanks to its Catalan-specific phoneme vocabulary and contextual embedding capabilities, it can serve as a phoneme encoder for any TTS architecture requiring phoneme-level features.
@@ -34,7 +42,7 @@ Features of our PL-BERT:
 ---
-## Intended uses and limitations
 ### Intended uses
@@ -50,7 +58,7 @@ Features of our PL-BERT:
 ---
-## How to use (with StyleTTS2)
 Here is an example of how to use this model within the StyleTTS2 framework:
@@ -76,7 +84,7 @@ Note: Although this example uses StyleTTS2, the model is compatible with other T
 ---
-## Training
 ### Training data
@@ -110,49 +118,46 @@ Other parameters:
 - Token mask: M
 - Word separator ID: 102
----
-## Evaluation
 The model has not been benchmarked via perplexity or extrinsic evaluation, but has been successfully integrated into TTS pipelines such as StyleTTS2, where it enables the synthesis of Catalan with regional accent variation.
 ---
 ## Citation
 If this code contributes to your research, please cite the work:
 ```
-@misc{LangtechVeu2025plbertwordpiececatmultiaccent,
-      title={PL-BERT-wordpiece-cat-multiaccent},
       author={Rodolfo Zevallos, Jose Giraldo and Carme Armentano-Oller},
       organization={Barcelona Supercomputing Center},
-      url={https://huggingface.co/langtech-veu/PL-BERT-wordpiece-cat-multiaccent},
       year={2025}
 }
 ```
-## Additional information
-### Contact
-For questions or feedback, please contact:
-rodolfo.zevallos@bsc.es
-### License
-Distributed under the Apache License, Version 2.0: https://www.apache.org/licenses/LICENSE-2.0
-### Funding
-This work is funded by the Ministerio para la Transformación Digital y de la Función Pública - Funded by EU – NextGenerationEU within the framework of the project Desarrollo de Modelos ALIA.
-### Disclaimer
-<details>
-<summary>Click to expand</summary>
-This model is released for research and educational use. It may exhibit biases or limitations based on training data characteristics. Users are responsible for ensuring appropriate use in deployed systems and for complying with all applicable regulations.
-</details>

 ---
+license: apache-2.0
 language:
 - ca
+tags:
+- TTS
+- PL-BERT
+- barcelona-supercomputing-center
 ---
+# PL-BERT-wp-ca
 ## Overview
 <details>
 <summary>Click to expand</summary>
+- [Model Description](#model-description)
+- [Intended Uses and Limitations](#intended-uses-and-limitations)
+- [How to Get Started with the Model](#how-to-get-started-with-the-model)
+- [Training Details](#training-details)
+- [Citation](#citation)
+- [Additional information](#additional-information)
 </details>
 ---
+## Model Description
+**PL-BERT-wp-ca** is a phoneme-level masked language model trained on Catalan text with diverse regional accents. It is based on the [PL-BERT architecture](https://github.com/yl4579/PL-BERT), which learns phoneme representations via a BERT-style masked language modeling objective.
 This model is designed to support **phoneme-based text-to-speech (TTS) systems**, including but not limited to [StyleTTS2](https://github.com/yl4579/StyleTTS2). Thanks to its Catalan-specific phoneme vocabulary and contextual embedding capabilities, it can serve as a phoneme encoder for any TTS architecture requiring phoneme-level features.
 ---
+## Intended Uses and Limitations
 ### Intended uses
 ---
+## How to Get Started with the Model
 Here is an example of how to use this model within the StyleTTS2 framework:
 ---
+## Training Details
 ### Training data
 - Token mask: M
 - Word separator ID: 102
+### Evaluation
 The model has not been benchmarked via perplexity or extrinsic evaluation, but has been successfully integrated into TTS pipelines such as StyleTTS2, where it enables the synthesis of Catalan with regional accent variation.
 ---
 ## Citation
 If this code contributes to your research, please cite the work:
 ```
+@misc{zevallosbertwpca,
+      title={PL-BERT-wp-ca},
       author={Rodolfo Zevallos, Jose Giraldo and Carme Armentano-Oller},
       organization={Barcelona Supercomputing Center},
+      url={https://huggingface.co/langtech-veu/PL-BERT-wp-ca},
       year={2025}
 }
 ```
+## Additional Information
+### Author
+The [Language Technologies Laboratory](https://huggingface.co/BSC-LT) of the [Barcelona Supercomputing Center](https://www.bsc.es/) by [Rodolfo Zevallos](https://huggingface.co/rjzevallos).
+### Contact
+For further information, please send an email to <langtech@bsc.es>.
+### Copyright
+Copyright(c) 2025 by Language Technologies Laboratory, Barcelona Supercomputing Center.
+### License
+[Apache-2.0](https://www.apache.org/licenses/LICENSE-2.0)
+### Funding
+This work is funded by the Ministerio para la Transformación Digital y de la Función Pública - Funded by EU – NextGenerationEU within the framework of the project Desarrollo de Modelos ALIA.