Update VAD parameters and enhance silence detection logic in AudioJobRunner. This commit modifies the VAD aggressiveness setting and adjusts silence detection thresholds, improving the accuracy of silence spans and fallback mechanisms. Additionally, it refines the handling of stereo audio channels during VAD processing.
46f053a
liuyangcommited on
Refactor VAD alignment logic in AudioJobRunner to improve silence detection and expand search window. This update introduces a new method for attempting silence detection with adjustable seek windows, enhancing the flexibility and accuracy of audio chunk processing.
1739f44
liuyangcommited on
Enhance logging in AudioJobRunner for VAD alignment and chunk processing. This update adds detailed logging statements to track VAD alignment status, chunk durations, and silence detection, improving traceability and debugging capabilities during audio processing.
37570e3
liuyangcommited on
Refactor chunk processing in AudioJobRunner to enhance VAD alignment and overlap handling. This update improves the logic for determining chunk durations and ensures proper progress through audio segments, while also adding alignment parameters when VAD is enabled.
370f42a
liuyangcommited on
Add VAD-based features to AudioJobRunner for improved silence detection and alignment
570e908
liuyangcommited on
fix value
f69d26d
liuyangcommited on
Enhance dual-mono detection in _stereo_decision_quick method of AudioJobRunner by implementing VAD-based similarity metrics, improving audio processing accuracy and robustness with fallback to previous methods if VAD is unavailable.
00d0703
liuyangcommited on
Refactor comments and improve readability in audiojob.py, clarifying thresholds and processing logic in the AudioJobRunner class for better maintainability.
2c2b916
liuyangcommited on
Refactor dual-mono detection in _stereo_decision_quick method of AudioJobRunner, adjusting thresholds and improving correlation metrics for enhanced audio processing accuracy.
f3ec09a
liuyangcommited on
Add dual-mono detection logic to _stereo_decision_quick method in AudioJobRunner, enhancing audio processing accuracy with new thresholds and correlation metrics.
d03f859
liuyangcommited on
Update chunk_target_ms in DEFAULT_PRESETS to improve audio processing window size configuration.
74c0262
liuyangcommited on
Add chunk_target_ms configuration to S3 upload settings in app.py for improved audio processing control.
c854aa9
liuyangcommited on
Remove LocalStorageAdapter import from app.py, simplifying AudioJobRunner integration.
7b72138
liuyangcommited on
Refactor AudioJobRunner to remove LocalStorageAdapter dependency; update manifest handling and improve chunk processing logic for fixed window splits with configurable overlap.
fbedc7c
liuyangcommited on
Update app.py to include S3 prefix input in audio job UI; enhance logging in audiojob.py for command execution, upload stages, and VAD processing, improving traceability and error handling.
e8c330f
liuyangcommited on
Refactor audio processing command in AudioJobRunner to utilize filter_complex for channel splitting, improving clarity and efficiency of audio output generation.
3d01174
liuyangcommited on
Fix audio processing command in AudioJobRunner by adding input stream reference for channel splitting in filter_complex.
89dc159
liuyangcommited on
Implement detailed logging in AudioJobRunner for upload, probe, and preprocess stages; enhance error handling for VAD pipeline fallback.
4c2c2a3
liuyangcommited on
Add AudioJob integration to app.py with UI for running audio jobs and handling manifests. Updated requirements.txt to include webrtcvad and boto3.
a4ab88e
liuyangcommited on
Enhance cut_audio function to include error handling for ffmpeg stream copy. Added fallback to re-encode using pydub if the stream copy fails, with improved logging of errors.
a095ed4
liuyangcommited on
Fix temporary file handling in cut_audio function by removing unnecessary fd closure.
52cf8f9
liuyangcommited on
Refactor cut_audio function to use ffmpeg for audio cutting, preserving original codec and file extension. Improved logging and temporary file handling.
5c87856
liuyangcommited on
Specify temporary directory for audio export in cut_audio function
d4de52b
liuyangcommited on
Remove info text from audio output component in app.py