Commit History

revert
a7c5fd6

liuyang commited on

Update VAD parameters and enhance silence detection logic in AudioJobRunner. This commit modifies the VAD aggressiveness setting and adjusts silence detection thresholds, improving the accuracy of silence spans and fallback mechanisms. Additionally, it refines the handling of stereo audio channels during VAD processing.
46f053a

liuyang commited on

Refactor VAD alignment logic in AudioJobRunner to improve silence detection and expand search window. This update introduces a new method for attempting silence detection with adjustable seek windows, enhancing the flexibility and accuracy of audio chunk processing.
1739f44

liuyang commited on

Enhance logging in AudioJobRunner for VAD alignment and chunk processing. This update adds detailed logging statements to track VAD alignment status, chunk durations, and silence detection, improving traceability and debugging capabilities during audio processing.
37570e3

liuyang commited on

Refactor chunk processing in AudioJobRunner to enhance VAD alignment and overlap handling. This update improves the logic for determining chunk durations and ensures proper progress through audio segments, while also adding alignment parameters when VAD is enabled.
370f42a

liuyang commited on

Add VAD-based features to AudioJobRunner for improved silence detection and alignment
570e908

liuyang commited on

fix value
f69d26d

liuyang commited on

Enhance dual-mono detection in _stereo_decision_quick method of AudioJobRunner by implementing VAD-based similarity metrics, improving audio processing accuracy and robustness with fallback to previous methods if VAD is unavailable.
00d0703

liuyang commited on

Refactor comments and improve readability in audiojob.py, clarifying thresholds and processing logic in the AudioJobRunner class for better maintainability.
2c2b916

liuyang commited on

Refactor dual-mono detection in _stereo_decision_quick method of AudioJobRunner, adjusting thresholds and improving correlation metrics for enhanced audio processing accuracy.
f3ec09a

liuyang commited on

Add dual-mono detection logic to _stereo_decision_quick method in AudioJobRunner, enhancing audio processing accuracy with new thresholds and correlation metrics.
d03f859

liuyang commited on

Update chunk_target_ms in DEFAULT_PRESETS to improve audio processing window size configuration.
74c0262

liuyang commited on

Add chunk_target_ms configuration to S3 upload settings in app.py for improved audio processing control.
c854aa9

liuyang commited on

Remove LocalStorageAdapter import from app.py, simplifying AudioJobRunner integration.
7b72138

liuyang commited on

Refactor AudioJobRunner to remove LocalStorageAdapter dependency; update manifest handling and improve chunk processing logic for fixed window splits with configurable overlap.
fbedc7c

liuyang commited on

Update app.py to include S3 prefix input in audio job UI; enhance logging in audiojob.py for command execution, upload stages, and VAD processing, improving traceability and error handling.
e8c330f

liuyang commited on

Refactor audio processing command in AudioJobRunner to utilize filter_complex for channel splitting, improving clarity and efficiency of audio output generation.
3d01174

liuyang commited on

Fix audio processing command in AudioJobRunner by adding input stream reference for channel splitting in filter_complex.
89dc159

liuyang commited on

Implement detailed logging in AudioJobRunner for upload, probe, and preprocess stages; enhance error handling for VAD pipeline fallback.
4c2c2a3

liuyang commited on

Add AudioJob integration to app.py with UI for running audio jobs and handling manifests. Updated requirements.txt to include webrtcvad and boto3.
a4ab88e

liuyang commited on

Enhance cut_audio function to include error handling for ffmpeg stream copy. Added fallback to re-encode using pydub if the stream copy fails, with improved logging of errors.
a095ed4

liuyang commited on

Fix temporary file handling in cut_audio function by removing unnecessary fd closure.
52cf8f9

liuyang commited on

Refactor cut_audio function to use ffmpeg for audio cutting, preserving original codec and file extension. Improved logging and temporary file handling.
5c87856

liuyang commited on

Specify temporary directory for audio export in cut_audio function
d4de52b

liuyang commited on

Remove info text from audio output component in app.py
50f7140

liuyang commited on

init
4771e75

liuyang commited on

init
e1500f1

liuyang commited on

initial commit
d0d1c87
verified

nsfwalex commited on