msmaje commited on
Commit
c5b30b0
·
verified ·
1 Parent(s): 1bf3830

Updated README.md

Browse files
Files changed (1) hide show
  1. README.md +35 -13
README.md CHANGED
@@ -1,13 +1,35 @@
1
- ---
2
- title: VoiceAccess
3
- emoji: 🐢
4
- colorFrom: pink
5
- colorTo: blue
6
- sdk: gradio
7
- sdk_version: 5.12.0
8
- app_file: app.py
9
- pinned: false
10
- short_description: A voice access control model Adati
11
- ---
12
-
13
- Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Voice Access Control System
2
+
3
+ This is a deep learning-based voice access control system that can verify whether a person should be granted access based on their voice recording.
4
+
5
+ ## Description
6
+
7
+ The system uses a convolutional neural network to analyze mel spectrograms of voice recordings and determine if the speaker is authorized. It processes audio input through several steps:
8
+
9
+ 1. Audio preprocessing (resampling, normalization)
10
+ 2. Mel spectrogram generation
11
+ 3. Deep learning model analysis
12
+ 4. Access decision with confidence score
13
+
14
+ ## Usage
15
+
16
+ 1. Click the audio input button or drag and drop an audio file
17
+ 2. Wait for the system to process the recording
18
+ 3. View the access result and confidence score
19
+
20
+ ## Technical Details
21
+
22
+ - Model: Custom CNN architecture (VoiceAccessNet)
23
+ - Input: Audio files (WAV, MP3)
24
+ - Audio processing: 16kHz sample rate, mel spectrogram features
25
+ - Output: Binary classification (Access Granted/Denied) with confidence score
26
+
27
+ ## References
28
+
29
+ - Model training code and dataset details: [Link to your repository]
30
+ - Based on PyTorch and torchaudio
31
+ - Deployed using Gradio and Hugging Face Spaces
32
+
33
+ ## License
34
+
35
+ [Your chosen license]