Muhammed Essam commited on
Commit
e459b45
·
1 Parent(s): 3dcdd5f

Switch to Whisper base

Browse files

for faster voice processing

- Change from small (20s) to base
(~5-8s) model
- Better CPU performance while
maintaining good accuracy
- Reduces total voice query time from
20s to 8-10s

Files changed (2) hide show
  1. README.md +1 -1
  2. app.py +2 -2
README.md CHANGED
@@ -92,7 +92,7 @@ Priority given to division accuracy:
92
  ### Models Used
93
  - **Embeddings**: `sentence-transformers/all-MiniLM-L6-v2` - Fast, lightweight semantic search
94
  - **Name Extraction**: `urchade/gliner_small-v2.1` - Zero-shot NER for person names
95
- - **Speech-to-Text**: `openai/whisper-small` - Better accuracy for multi-language transcription
96
 
97
  ### Confidence Scoring
98
 
 
92
  ### Models Used
93
  - **Embeddings**: `sentence-transformers/all-MiniLM-L6-v2` - Fast, lightweight semantic search
94
  - **Name Extraction**: `urchade/gliner_small-v2.1` - Zero-shot NER for person names
95
+ - **Speech-to-Text**: `openai/whisper-base` - Optimized for CPU with good accuracy
96
 
97
  ### Confidence Scoring
98
 
app.py CHANGED
@@ -41,9 +41,9 @@ def initialize_services():
41
  name_extractor = NameExtractor(model_name="urchade/gliner_small-v2.1")
42
  logger.info("✓ Name extractor ready!")
43
 
44
- # Initialize voice processor (using small model for better accuracy)
45
  logger.info("Loading Whisper model...")
46
- voice_processor = VoiceProcessor(model_size="small") # Using small for better accuracy
47
  logger.info("✓ Voice processor ready!")
48
 
49
  # Initialize contact search
 
41
  name_extractor = NameExtractor(model_name="urchade/gliner_small-v2.1")
42
  logger.info("✓ Name extractor ready!")
43
 
44
+ # Initialize voice processor (using base model for speed/accuracy balance)
45
  logger.info("Loading Whisper model...")
46
+ voice_processor = VoiceProcessor(model_size="base") # Using base for best speed/accuracy on CPU
47
  logger.info("✓ Voice processor ready!")
48
 
49
  # Initialize contact search