Mrkomiljon
/

DeepVoiceGuard

ONNX

English

Model card Files Files and versions

xet

Community

Mrkomiljon commited on Jan 13, 2025

Commit

03cb460

verified ·

1 Parent(s): 3f2a057

Update README.md

Browse files

Files changed (1) hide show

README.md +2 -11

README.md CHANGED Viewed

@@ -10,17 +10,13 @@ metrics:
 # DeepVoiceGuard: Real-Time Audio Authenticity Detection
 **DeepVoiceGuard** is an advanced AI-powered tool for detecting whether an audio file is genuine or AI-generated. Built using RawNet-based architecture and trained on ASVspoof datasets, this model is optimized for real-time inference using ONNX format.
 ---
 ## 🚀 Features
 - **Real-Time Detection:** Analyze audio files quickly and efficiently to determine authenticity.
 - **Sliding Window Processing:** Processes long audio files in segments for accurate classification.
 - **ONNX Optimized:** Faster inference compared to traditional formats.
 - **Interactive Demo:** Test the model using our Streamlit application.
 ---
 ## 📚 Model Overview
 - **Architecture:** RawNet-based Neural Network
 - **Frameworks Used:** PyTorch, ONNX
@@ -28,9 +24,7 @@ metrics:
 - **Classes:**
   - **Real:** Genuine human speech
   - **Fake:** AI-generated or spoofed audio
 ---
 ## 🛠 Installation
 Install the necessary dependencies:
 ```bash
@@ -58,7 +52,6 @@ def pad(x, max_len=64600):
     num_repeats = (max_len // x_len) + 1
     padded_x = np.tile(x, (1, num_repeats))[:, :max_len][0]
     return padded_x
 # Preprocess audio for a single segment
 def preprocess_audio_segment(segment, cut=64600):
     """
@@ -82,7 +75,6 @@ def download_model(url, local_path="RawNet_model.onnx"):
             else:
                 raise Exception("Failed to download ONNX model")
     return local_path
 # Sliding window prediction function
 def predict_with_sliding_window(audio_path, onnx_model_path, window_size=64600, step_size=64600, sample_rate=16000):
     """
@@ -125,7 +117,6 @@ def predict_with_sliding_window(audio_path, onnx_model_path, window_size=64600,
 result = predict("example.wav")
 print(f"Prediction: {result}")
 ```
 📊 Performance Metrics
 Equal Error Rate (EER): 4.21%
 Accuracy: 95.8%
@@ -137,5 +128,5 @@ This project is licensed under the MIT License.
 ✉️ Contact
 For inquiries or support, please contact:
-- GitHub: (Mrkomiljon)[https://github.com/Mrkomiljon/DeepVoiceGuard]
-- Hugging Face: (DeepVoiceGuard)[https://huggingface.co/spaces/Mrkomiljon/DeepVoiceGuard]

 # DeepVoiceGuard: Real-Time Audio Authenticity Detection
 **DeepVoiceGuard** is an advanced AI-powered tool for detecting whether an audio file is genuine or AI-generated. Built using RawNet-based architecture and trained on ASVspoof datasets, this model is optimized for real-time inference using ONNX format.
 ---
 ## 🚀 Features
 - **Real-Time Detection:** Analyze audio files quickly and efficiently to determine authenticity.
 - **Sliding Window Processing:** Processes long audio files in segments for accurate classification.
 - **ONNX Optimized:** Faster inference compared to traditional formats.
 - **Interactive Demo:** Test the model using our Streamlit application.
 ---
 ## 📚 Model Overview
 - **Architecture:** RawNet-based Neural Network
 - **Frameworks Used:** PyTorch, ONNX
 - **Classes:**
   - **Real:** Genuine human speech
   - **Fake:** AI-generated or spoofed audio
 ---
 ## 🛠 Installation
 Install the necessary dependencies:
 ```bash
     num_repeats = (max_len // x_len) + 1
     padded_x = np.tile(x, (1, num_repeats))[:, :max_len][0]
     return padded_x
 # Preprocess audio for a single segment
 def preprocess_audio_segment(segment, cut=64600):
     """
             else:
                 raise Exception("Failed to download ONNX model")
     return local_path
 # Sliding window prediction function
 def predict_with_sliding_window(audio_path, onnx_model_path, window_size=64600, step_size=64600, sample_rate=16000):
     """
 result = predict("example.wav")
 print(f"Prediction: {result}")
 ```
 📊 Performance Metrics
 Equal Error Rate (EER): 4.21%
 Accuracy: 95.8%
 ✉️ Contact
 For inquiries or support, please contact:
+- GitHub: [Mrkomiljon](https://github.com/Mrkomiljon/DeepVoiceGuard)
+- Hugging Face: [DeepVoiceGuard](https://huggingface.co/spaces/Mrkomiljon/DeepVoiceGuard)