Update README.md

Browse files

Files changed (1) hide show

README.md +2 -4

README.md CHANGED Viewed

@@ -37,8 +37,6 @@ The project adapts a SpeechT5 TTS backbone and injects **two conditioning signal
 - **Fusion**: a trainable **StyleSpeakerFusion** merges both vectors into the **512‑D** `speaker_embeddings` tensor expected by SpeechT5 during generation. The official **SpeechT5 HiFi‑GAN** vocoder renders the waveform.
 - **Developed by:** Amirhossein Yousefiramandi (GitHub: `amirhossein-yousefi`)
-- **Funded by [optional]:** Not specified
-- **Shared by [optional]:** Repository author
 - **Model type:** TTS with emotion‑style transfer (recipe + training/inference code)
 - **Language(s):** Primarily **English**
 - **License:** Repository currently has **no LICENSE file**; treat code as “all rights reserved” unless the author adds a license. Base model licenses are listed in the **License** section below.
@@ -264,7 +262,7 @@ Use the [MLCO2 Impact calculator](https://mlco2.github.io/impact#compute) for yo
 }
 ```
-## Glossary (optional)
 - **Style transfer (speech):** Conditioning TTS on reference audio to transfer prosodic/emotional characteristics.
 - **Speaker embeddings:** Numeric vectors capturing speaker timbre (here from ECAPA‑TDNN).
@@ -275,7 +273,7 @@ Use the [MLCO2 Impact calculator](https://mlco2.github.io/impact#compute) for yo
 - **SageMaker utilities:** The repo includes scripts for launching training jobs, and deploying real‑time/async inference endpoints.
-## Model Card Authors (optional)
 - Repository & implementation: **Amirhossein Yousefiramandi** (`@amirhossein-yousefi`).

 - **Fusion**: a trainable **StyleSpeakerFusion** merges both vectors into the **512‑D** `speaker_embeddings` tensor expected by SpeechT5 during generation. The official **SpeechT5 HiFi‑GAN** vocoder renders the waveform.
 - **Developed by:** Amirhossein Yousefiramandi (GitHub: `amirhossein-yousefi`)
 - **Model type:** TTS with emotion‑style transfer (recipe + training/inference code)
 - **Language(s):** Primarily **English**
 - **License:** Repository currently has **no LICENSE file**; treat code as “all rights reserved” unless the author adds a license. Base model licenses are listed in the **License** section below.
 }
 ```
+## Glossary
 - **Style transfer (speech):** Conditioning TTS on reference audio to transfer prosodic/emotional characteristics.
 - **Speaker embeddings:** Numeric vectors capturing speaker timbre (here from ECAPA‑TDNN).
 - **SageMaker utilities:** The repo includes scripts for launching training jobs, and deploying real‑time/async inference endpoints.
+## Model Card Authors
 - Repository & implementation: **Amirhossein Yousefiramandi** (`@amirhossein-yousefi`).