Spaces:
Sleeping
Sleeping
Muhammed Essam
commited on
Commit
·
e459b45
1
Parent(s):
3dcdd5f
Switch to Whisper base
Browse filesfor faster voice processing
- Change from small (20s) to base
(~5-8s) model
- Better CPU performance while
maintaining good accuracy
- Reduces total voice query time from
20s to 8-10s
README.md
CHANGED
|
@@ -92,7 +92,7 @@ Priority given to division accuracy:
|
|
| 92 |
### Models Used
|
| 93 |
- **Embeddings**: `sentence-transformers/all-MiniLM-L6-v2` - Fast, lightweight semantic search
|
| 94 |
- **Name Extraction**: `urchade/gliner_small-v2.1` - Zero-shot NER for person names
|
| 95 |
-
- **Speech-to-Text**: `openai/whisper-
|
| 96 |
|
| 97 |
### Confidence Scoring
|
| 98 |
|
|
|
|
| 92 |
### Models Used
|
| 93 |
- **Embeddings**: `sentence-transformers/all-MiniLM-L6-v2` - Fast, lightweight semantic search
|
| 94 |
- **Name Extraction**: `urchade/gliner_small-v2.1` - Zero-shot NER for person names
|
| 95 |
+
- **Speech-to-Text**: `openai/whisper-base` - Optimized for CPU with good accuracy
|
| 96 |
|
| 97 |
### Confidence Scoring
|
| 98 |
|
app.py
CHANGED
|
@@ -41,9 +41,9 @@ def initialize_services():
|
|
| 41 |
name_extractor = NameExtractor(model_name="urchade/gliner_small-v2.1")
|
| 42 |
logger.info("✓ Name extractor ready!")
|
| 43 |
|
| 44 |
-
# Initialize voice processor (using
|
| 45 |
logger.info("Loading Whisper model...")
|
| 46 |
-
voice_processor = VoiceProcessor(model_size="
|
| 47 |
logger.info("✓ Voice processor ready!")
|
| 48 |
|
| 49 |
# Initialize contact search
|
|
|
|
| 41 |
name_extractor = NameExtractor(model_name="urchade/gliner_small-v2.1")
|
| 42 |
logger.info("✓ Name extractor ready!")
|
| 43 |
|
| 44 |
+
# Initialize voice processor (using base model for speed/accuracy balance)
|
| 45 |
logger.info("Loading Whisper model...")
|
| 46 |
+
voice_processor = VoiceProcessor(model_size="base") # Using base for best speed/accuracy on CPU
|
| 47 |
logger.info("✓ Voice processor ready!")
|
| 48 |
|
| 49 |
# Initialize contact search
|