--- language: en tags: - audio - noise-reduction - speech-enhancement - deep-learning datasets: - haydarkadioglu/speech-noise-dataset license: mit pipeline_tag: audio-to-audio --- # Nocle - Noise Cleaner Model ## Model Description This is a deep learning model trained for audio noise reduction, specifically targeting speech enhancement. The model is designed to effectively remove background noise while preserving the quality and intelligibility of speech. ## Intended Use - Speech enhancement in noisy environments - Audio cleanup for voice recordings - Background noise reduction - Audio quality improvement for voice applications ## Training Data The model was trained using the [Speech Noise Dataset](https://huggingface.co/datasets/haydarkadioglu/speech-noise-dataset), which includes: - Clean speech recordings - Various environmental noise samples - Mixed noisy speech samples for training ## Model Architecture - Neural network optimized for audio processing - Input: Noisy audio waveform - Output: Enhanced clean audio waveform - Processing: 16kHz sampling rate ## Usage This model is integrated into the Nocle application. To use it: 1. Clone the repository: ```bash git clone https://github.com/haydarkadioglu/nocle-app.git cd nocle-app ``` 2. Install dependencies: ```bash pip install -r requirements.txt ``` 3. Run the application: ```bash python main.py ``` For programmatic usage: ```python from setup import Setup from model_handler import ModelHandler # Model will be automatically downloaded model = ModelHandler() # Process audio cleaned_audio = model.process_audio(noisy_audio) ``` For more detailed instructions and source code, visit the [GitHub Repository](https://github.com/haydarkadioglu/nocle-app). ## Performance and Limitations ### Strengths - Effective at removing common background noise - Preserves speech clarity - Real-time processing capability ### Limitations - Optimized for 16kHz audio - Best suited for speech enhancement - May require adjustment for non-speech audio ## Training Procedure ### Training Data - Dataset: [Speech Noise Dataset](https://huggingface.co/datasets/haydarkadioglu/speech-noise-dataset) - Audio format: 16kHz WAV files - Mixed with various noise types at different SNR levels ### Training Parameters - Optimizer: Adam - Loss function: Mean Squared Error (MSE) - Batch size: 12000 samples ## Ethical Considerations - The model is designed for general noise reduction and should be used responsibly - Users should respect privacy when processing audio containing personal information - The model should not be used for deceptive audio manipulation ## Links - [GitHub Repository](https://github.com/haydarkadioglu/nocle-app) - [Training Dataset](https://huggingface.co/datasets/haydarkadioglu/speech-noise-dataset) - [Model on Hugging Face](https://huggingface.co/haydarkadioglu/nocle-app) ## Citation ```bibtex @misc{nocle2025, author = {Haydar Kadıoğlu}, title = {Nocle: Noise Cleaner Model}, year = {2025}, publisher = {Hugging Face}, howpublished = {\url{https://huggingface.co/haydarkadioglu/nocle-app}} } ```