Update README: correct progressive update interval to 500ms
Browse files
README.md
CHANGED
|
@@ -26,7 +26,7 @@ Real-time speech recognition with smart progressive streaming, powered by **Para
|
|
| 26 |
- **β‘ Smart Progressive Streaming**: Intelligent window management with sentence-aware boundaries
|
| 27 |
- Growing window (0-15s) for accuracy
|
| 28 |
- Sentence-aware sliding window (>15s) to maintain context
|
| 29 |
-
- Real-time updates every
|
| 30 |
|
| 31 |
- **π Privacy-First**: All processing happens locally in your browser - no data sent to servers
|
| 32 |
|
|
@@ -62,7 +62,7 @@ This demo implements the smart progressive streaming algorithm from the [speech-
|
|
| 62 |
|
| 63 |
1. **Growing Window (0-15s)**:
|
| 64 |
- Accumulates audio for better accuracy
|
| 65 |
-
- Re-transcribes entire buffer every
|
| 66 |
|
| 67 |
2. **Sliding Window (>15s)**:
|
| 68 |
- Locks completed sentences as "fixed"
|
|
@@ -78,7 +78,7 @@ Web Audio API (16kHz)
|
|
| 78 |
β
|
| 79 |
Audio Processor (accumulate chunks)
|
| 80 |
β
|
| 81 |
-
Progressive Streaming Handler (
|
| 82 |
β
|
| 83 |
Web Worker β Parakeet ONNX Model (via parakeet.js + WebGPU)
|
| 84 |
β
|
|
@@ -106,7 +106,7 @@ Transcription Display (yellow fixed + cyan active)
|
|
| 106 |
## Performance
|
| 107 |
|
| 108 |
- **First result**: <500ms latency
|
| 109 |
-
- **Progressive updates**:
|
| 110 |
- **RTF (Real-time Factor)**: ~0.3-0.5x with WebGPU
|
| 111 |
- **Model loading**: 1-2 minutes (one-time, cached locally)
|
| 112 |
|
|
|
|
| 26 |
- **β‘ Smart Progressive Streaming**: Intelligent window management with sentence-aware boundaries
|
| 27 |
- Growing window (0-15s) for accuracy
|
| 28 |
- Sentence-aware sliding window (>15s) to maintain context
|
| 29 |
+
- Real-time updates every 500ms
|
| 30 |
|
| 31 |
- **π Privacy-First**: All processing happens locally in your browser - no data sent to servers
|
| 32 |
|
|
|
|
| 62 |
|
| 63 |
1. **Growing Window (0-15s)**:
|
| 64 |
- Accumulates audio for better accuracy
|
| 65 |
+
- Re-transcribes entire buffer every 500ms
|
| 66 |
|
| 67 |
2. **Sliding Window (>15s)**:
|
| 68 |
- Locks completed sentences as "fixed"
|
|
|
|
| 78 |
β
|
| 79 |
Audio Processor (accumulate chunks)
|
| 80 |
β
|
| 81 |
+
Progressive Streaming Handler (500ms updates)
|
| 82 |
β
|
| 83 |
Web Worker β Parakeet ONNX Model (via parakeet.js + WebGPU)
|
| 84 |
β
|
|
|
|
| 106 |
## Performance
|
| 107 |
|
| 108 |
- **First result**: <500ms latency
|
| 109 |
+
- **Progressive updates**: 500ms cadence
|
| 110 |
- **RTF (Real-time Factor)**: ~0.3-0.5x with WebGPU
|
| 111 |
- **Model loading**: 1-2 minutes (one-time, cached locally)
|
| 112 |
|