jadechoghari
/

VoiceRestore

@@ -7,9 +7,7 @@ library_name: transformers
 VoiceRestore is a cutting-edge speech restoration model designed to significantly enhance the quality of degraded voice recordings. Leveraging flow-matching transformers, this model excels at addressing a wide range of audio imperfections commonly found in speech, including background noise, reverberation, distortion, and signal loss.
-Demo of audio restorations: [VoiceRestore](https://sparkling-rabanadas-3082be.netlify.app/)
-Credits: This repository is based on the [E2-TTS implementation by Lucidrains](https://github.com/lucidrains/e2-tts-pytorch)
 ## Usage
 ``` bash
@@ -33,28 +31,23 @@ model("test_input.wav", "test_output.wav")
 ## Example
 ### Degraded Input:
-![Degraded Input](./imgs/degraded.png "Degraded Input")
-Degraded audio (reverberation, distortion, noise, random cut):
-**Note**: Adjust your volume before playing the degraded audio sample, as it may contain distortions.
-https://github.com/user-attachments/assets/0c030274-60b5-41a4-abe6-59a3f1bc934b
 ---
 ### Restored (steps=32, cfg=1.0):
-![Restored](./imgs/restored.png "Restored")
 Restored audio - 16 steps, strength 0.5:
-https://github.com/user-attachments/assets/fdbbb988-9bd2-4750-bddd-32bd5153d254
----
-### Ground Truth:
-![Ground Truth](./imgs/ground_truth.png "Ground Truth")
 ---
 ## Key Features
@@ -65,7 +58,6 @@ https://github.com/user-attachments/assets/fdbbb988-9bd2-4750-bddd-32bd5153d254
 ---
 ## Model Details
 - **Architecture**: Flow-matching transformer
@@ -99,4 +91,5 @@ This project is licensed under the MIT License - see the [LICENSE](LICENSE) file
 ## Acknowledgments
 - Based on the [E2-TTS implementation by Lucidrains](https://github.com/lucidrains/e2-tts-pytorch)
-- Special thanks to the open-source community for their invaluable contributions.

 VoiceRestore is a cutting-edge speech restoration model designed to significantly enhance the quality of degraded voice recordings. Leveraging flow-matching transformers, this model excels at addressing a wide range of audio imperfections commonly found in speech, including background noise, reverberation, distortion, and signal loss.
+It is based on this [repo](https://github.com/skirdey/voicerestore) & demo of audio restorations: [VoiceRestore](https://sparkling-rabanadas-3082be.netlify.app/)
 ## Usage
 ``` bash
 ## Example
 ### Degraded Input:
+### Degraded Input Audio
+<audio controls>
+  <source src="https://huggingface.co/jadechoghari/VoiceRestore/resolve/main/test_input.wav" type="audio/mpeg">
+  Your browser does not support the audio element.
+</audio>
 ---
 ### Restored (steps=32, cfg=1.0):
+<audio controls>
+  <source src="https://huggingface.co/jadechoghari/VoiceRestore/resolve/main/test_output.wav" type="audio/mpeg">
+  Your browser does not support the audio element.
+</audio>
 Restored audio - 16 steps, strength 0.5:
 ---
 ## Key Features
 ---
 ## Model Details
 - **Architecture**: Flow-matching transformer
 ## Acknowledgments
 - Based on the [E2-TTS implementation by Lucidrains](https://github.com/lucidrains/e2-tts-pytorch)
+- Special thanks to the open-source community for their invaluable contributions.
+- Credits: This repository is based on the [E2-TTS implementation by Lucidrains](https://github.com/lucidrains/e2-tts-pytorch)