ms180's picture
Upload folder using huggingface_hub
f56040d verified
2026-01-08 18:33:28 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:152 | === ESPnet3 run started: 2026-01-08T18:33:28.816246 ===
2026-01-08 18:33:28 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:153 | Command: /mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/bin/python run.py --train_config conf/train_asr_rnn_data_aug_debug.yaml --infer_config conf/infer.yaml --measure_config conf/measure.yaml
2026-01-08 18:33:28 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:156 | Working directory: /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr
2026-01-08 18:33:28 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:162 | train config: /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr/conf/train_asr_rnn_data_aug_debug.yaml
2026-01-08 18:33:28 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:162 | infer config: /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr/conf/infer.yaml
2026-01-08 18:33:28 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:162 | measure config: /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr/conf/measure.yaml
2026-01-08 18:34:35 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:167 | Git: commit=e649581628c3a7d875d2e1ee9c795b7595d4fcb7, short_commit=e649581628, branch=espnet3/integration_test, worktree=dirty
2026-01-08 18:34:35 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:250 | Cluster env:
(none)
2026-01-08 18:34:35 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:251 | Runtime env:
LD_LIBRARY_PATH=/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/espeak-ng/lib:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/lib:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/lib64:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/espeak-ng/lib:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/lib:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/lib64:/usr/local/cuda-10.1/lib64
PATH=/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/ffmpeg-release:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/festival/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/MBROLA/Bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/espeak-ng/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/BeamformIt:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kenlm/build/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/PESQ/P862_annex_A_2005_CD/source:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/nkf/nkf-2.1.4:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/moses/scripts/tokenizer:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/moses/scripts/generic:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/tools/moses/scripts/recaser:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/moses/scripts/training:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/mwerSegmenter:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/sctk/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/sph2pipe:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/sentencepiece_commands:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/chainbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/featbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/fgmmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/fstbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/gmmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/ivectorbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/kwsbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/latbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/lmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/nnet2bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/nnet3bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/nnetbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/online2bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/onlinebin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/rnnlmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/sgmm2bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/sgmmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/tfrnnlmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/cudadecoderbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/cudafeatbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/bin:/home/masao/.local/bin:/home/masao/utils:/home/masao/git/SCTK/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/games:/usr/local/games:/usr/lib/wsl/lib:/mnt/c/Users/might/AppData/Roaming/Code/User/globalStorage/github.copilot-chat/debugCommand:/mnt/c/Users/might/AppData/Roaming/Code/User/globalStorage/github.copilot-chat/copilotCli:/mnt/c/Windows/system32:/mnt/c/Windows:/mnt/c/Windows/System32/Wbem:/mnt/c/Windows/System32/WindowsPowerShell/v1.0/:/mnt/c/Windows/System32/OpenSSH/:/mnt/c/Program Files (x86)/NVIDIA Corporation/PhysX/Common:/mnt/c/Program Files/NVIDIA Corporation/NVIDIA NvDLISR:/mnt/c/Program Files/dotnet/:/mnt/c/Program Files/Git/cmd:/mnt/c/Program Files/Tailscale/:/mnt/c/Users/might/AppData/Local/Microsoft/WindowsApps:/mnt/c/Users/might/AppData/Local/Programs/Microsoft VS Code/bin:/mnt/c/Users/might/AppData/Local/Muse Hub/lib:/mnt/c/Software/cmake-3.30.2-windows-x86_64/bin:/mnt/c/Software/flutter/bin:/mnt/c/Users/might/.vscode/extensions/ms-python.debugpy-2025.18.0-win32-x64/bundled/scripts/noConfigScripts:/snap/bin:/mnt/c/Software/flutter/bin
PYTHONPATH=/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/RawNet/python/RawNet3:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/RawNet/python/RawNet3/models:../../../:../../TEMPLATE/asr:/mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/RawNet/python/RawNet3:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/RawNet/python/RawNet3/models:
2026-01-08 18:34:35 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:140 | Train config content:
num_device: 1
num_nodes: 1
task: espnet3.systems.asr.task.ASRTask
recipe_dir: .
data_dir: ./data
exp_tag: train_asr_rnn_data_aug_debug
exp_dir: ./exp/train_asr_rnn_data_aug_debug
stats_dir: ./exp/stats
decode_dir: ./exp/train_asr_rnn_data_aug_debug/decode
dataset_dir: ./data/mini_an4
create_dataset:
func: src.create_dataset.create_dataset
dataset_dir: ./data/mini_an4
archive_path: ./../../egs2/mini_an4/asr1/downloads.tar.gz
dataset:
_target_: espnet3.components.data.data_organizer.DataOrganizer
train:
- name: train_nodev
dataset:
_target_: src.dataset.MiniAN4Dataset
manifest_path: ./data/mini_an4/manifest/train_nodev.tsv
valid:
- name: train_dev
dataset:
_target_: src.dataset.MiniAN4Dataset
manifest_path: ./data/mini_an4/manifest/train_dev.tsv
preprocessor:
_target_: espnet2.train.preprocessor.CommonPreprocessor
fs: 16000
data_aug_effects:
- - 0.1
- contrast
- enhancement_amount: 75.0
- - 0.1
- highpass
- cutoff_freq: 5000
Q: 0.707
- - 0.1
- equalization
- center_freq: 1000
gain: 0
Q: 0.707
- - 0.1
- - - 0.3
- speed_perturb
- factor: 0.9
- - 0.3
- speed_perturb
- factor: 1.1
- - 0.3
- speed_perturb
- factor: 1.3
data_aug_num:
- 1
- 4
data_aug_prob: 1.0
parallel:
env: local
n_workers: 1
dataloader:
collate_fn:
_target_: espnet2.train.collate_fn.CommonCollateFn
int_pad_value: -1
train:
multiple_iterator: false
num_shards: 1
iter_factory:
_target_: espnet2.iterators.sequence_iter_factory.SequenceIterFactory
shuffle: true
collate_fn:
_target_: espnet2.train.collate_fn.CommonCollateFn
int_pad_value: -1
num_workers: 0
batches:
type: folded
shape_files:
- ./exp/stats/train/feats_shape
batch_size: 2
batch_bins: 200000
valid:
multiple_iterator: false
num_shards: 1
iter_factory:
_target_: espnet2.iterators.sequence_iter_factory.SequenceIterFactory
shuffle: false
collate_fn:
_target_: espnet2.train.collate_fn.CommonCollateFn
int_pad_value: -1
batches:
type: folded
shape_files:
- ./exp/stats/valid/feats_shape
batch_size: 2
batch_bins: 200000
optim:
_target_: torch.optim.Adam
lr: 0.001
weight_decay: 0.0
scheduler:
_target_: torch.optim.lr_scheduler.ReduceLROnPlateau
mode: min
factor: 0.5
patience: 1
val_scheduler_criterion:
- valid
- loss
best_model_criterion:
- - valid/acc
- 1
- max
trainer:
accelerator: auto
devices: 1
num_nodes: 1
accumulate_grad_batches: 1
check_val_every_n_epoch: 1
gradient_clip_val: 1.0
log_every_n_steps: 1
max_epochs: 1
limit_train_batches: 1
limit_val_batches: 1
precision: 32
logger:
- _target_: lightning.pytorch.loggers.TensorBoardLogger
save_dir: ./exp/train_asr_rnn_data_aug_debug/tensorboard
name: tb_logger
strategy: auto
tokenizer:
vocab_size: 30
character_coverage: 1.0
model_type: bpe
save_path: ./data/bpe_30
text_builder:
func: src.tokenizer.gather_training_text
manifest_path: ./data/mini_an4/manifest/train_nodev.tsv
model:
vocab_size: 30
token_list: ./data/bpe_30/tokens.txt
encoder: vgg_rnn
encoder_conf:
num_layers: 1
hidden_size: 2
output_size: 2
decoder: rnn
decoder_conf:
hidden_size: 2
normalize: utterance_mvn
normalize_conf: {}
model_conf:
ctc_weight: 0.3
lsm_weight: 0.1
length_normalized_loss: false
frontend: default
frontend_conf:
n_fft: 512
win_length: 400
hop_length: 160
2026-01-08 18:34:35 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:144 | Infer config content:
num_device: 1
num_nodes: 1
recipe_dir: .
data_dir: ./data
exp_tag: mini_an4_asr_debug
exp_dir: ./exp/mini_an4_asr_debug
stats_dir: ./exp/stats
decode_dir: ./exp/mini_an4_asr_debug/decode
dataset_dir: ./data/mini_an4
dataset:
_target_: espnet3.components.data.data_organizer.DataOrganizer
test:
- name: test
dataset:
_target_: src.dataset.MiniAN4Dataset
manifest_path: ./data/mini_an4/manifest/test.tsv
parallel:
env: local
n_workers: 1
model:
_target_: espnet2.bin.asr_inference.Speech2Text
asr_train_config: ./exp/mini_an4_asr_debug/config.yaml
asr_model_file: ./exp/mini_an4_asr_debug/last.ckpt
beam_size: 1
ctc_weight: 0.3
tokenizer:
vocab_size: 30
character_coverage: 1.0
model_type: bpe
save_path: ./data/bpe_30
2026-01-08 18:34:35 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:148 | Measure config content:
recipe_dir: .
data_dir: ./data
exp_tag: mini_an4_asr_debug
exp_dir: ./exp/mini_an4_asr_debug
stats_dir: ./exp/stats
decode_dir: ./exp/mini_an4_asr_debug/decode
dataset_dir: ./data/mini_an4
dataset:
_target_: espnet3.components.data.data_organizer.DataOrganizer
test:
- name: test
dataset:
_target_: src.dataset.MiniAN4Dataset
manifest_path: ./data/mini_an4/manifest/test.tsv
metrics:
- metric:
_target_: espnet3.systems.asr.metrics.wer.WER
clean_types: null
- metric:
_target_: espnet3.systems.asr.metrics.cer.CER
clean_types: null
2026-01-08 18:34:35 | INFO | espnet3.systems.base.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/system.py:49 | Initialized ASRSystem with train_config=True infer_config=True measure_config=True exp_dir=exp/train_asr_rnn_data_aug_debug
2026-01-08 18:34:35 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:165 | System: ASRSystem
2026-01-08 18:34:35 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:166 | Requested stages: ['all']
2026-01-08 18:34:35 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:167 | Resolved stages: ['create_dataset', 'train_tokenizer', 'collect_stats', 'train', 'infer', 'measure', 'publish']
2026-01-08 18:34:35 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:59 | === [START] stage: create_dataset ===
2026-01-08 18:34:35 | INFO | espnet3.systems.asr.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/system.py:54 | ASRSystem.create_dataset(): starting dataset creation process
2026-01-08 18:34:35 | INFO | espnet3.systems.asr.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/system.py:63 | Creating dataset with function src.create_dataset.create_dataset
2026-01-08 18:34:36 | ERROR | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:70 | Stage 'create_dataset' failed after 0.63s
Traceback (most recent call last):
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py", line 61, in run_stages
fn()
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/system.py", line 64, in create_dataset
result = fn(**extra)
^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr/src/create_dataset.py", line 135, in create_dataset
raise RuntimeError("sph2pipe not found in PATH. Please install it.")
RuntimeError: sph2pipe not found in PATH. Please install it.
2026-01-08 18:36:19 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:152 | === ESPnet3 run started: 2026-01-08T18:36:19.909692 ===
2026-01-08 18:36:19 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:153 | Command: /mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/bin/python run.py --train_config conf/train_asr_rnn_data_aug_debug.yaml --infer_config conf/infer.yaml --measure_config conf/measure.yaml
2026-01-08 18:36:19 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:156 | Working directory: /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr
2026-01-08 18:36:19 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:162 | train config: /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr/conf/train_asr_rnn_data_aug_debug.yaml
2026-01-08 18:36:19 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:162 | infer config: /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr/conf/infer.yaml
2026-01-08 18:36:19 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:162 | measure config: /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr/conf/measure.yaml
2026-01-08 18:36:32 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:167 | Git: commit=e649581628c3a7d875d2e1ee9c795b7595d4fcb7, short_commit=e649581628, branch=espnet3/integration_test, worktree=dirty
2026-01-08 18:36:32 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:250 | Cluster env:
(none)
2026-01-08 18:36:32 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:251 | Runtime env:
LD_LIBRARY_PATH=/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/espeak-ng/lib:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/lib:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/lib64:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/espeak-ng/lib:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/lib:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/lib64:/usr/local/cuda-10.1/lib64
PATH=/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/ffmpeg-release:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/festival/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/MBROLA/Bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/espeak-ng/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/BeamformIt:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kenlm/build/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/PESQ/P862_annex_A_2005_CD/source:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/nkf/nkf-2.1.4:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/moses/scripts/tokenizer:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/moses/scripts/generic:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/tools/moses/scripts/recaser:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/moses/scripts/training:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/mwerSegmenter:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/sctk/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/sph2pipe:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/sentencepiece_commands:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/chainbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/featbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/fgmmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/fstbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/gmmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/ivectorbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/kwsbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/latbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/lmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/nnet2bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/nnet3bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/nnetbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/online2bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/onlinebin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/rnnlmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/sgmm2bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/sgmmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/tfrnnlmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/cudadecoderbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/cudafeatbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/bin:/home/masao/.local/bin:/home/masao/utils:/home/masao/git/SCTK/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/games:/usr/local/games:/usr/lib/wsl/lib:/mnt/c/Users/might/AppData/Roaming/Code/User/globalStorage/github.copilot-chat/debugCommand:/mnt/c/Users/might/AppData/Roaming/Code/User/globalStorage/github.copilot-chat/copilotCli:/mnt/c/Windows/system32:/mnt/c/Windows:/mnt/c/Windows/System32/Wbem:/mnt/c/Windows/System32/WindowsPowerShell/v1.0/:/mnt/c/Windows/System32/OpenSSH/:/mnt/c/Program Files (x86)/NVIDIA Corporation/PhysX/Common:/mnt/c/Program Files/NVIDIA Corporation/NVIDIA NvDLISR:/mnt/c/Program Files/dotnet/:/mnt/c/Program Files/Git/cmd:/mnt/c/Program Files/Tailscale/:/mnt/c/Users/might/AppData/Local/Microsoft/WindowsApps:/mnt/c/Users/might/AppData/Local/Programs/Microsoft VS Code/bin:/mnt/c/Users/might/AppData/Local/Muse Hub/lib:/mnt/c/Software/cmake-3.30.2-windows-x86_64/bin:/mnt/c/Software/flutter/bin:/mnt/c/Users/might/.vscode/extensions/ms-python.debugpy-2025.18.0-win32-x64/bundled/scripts/noConfigScripts:/snap/bin:/mnt/c/Software/flutter/bin
PYTHONPATH=/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/RawNet/python/RawNet3:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/RawNet/python/RawNet3/models:../../../:../../TEMPLATE/asr:/mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/RawNet/python/RawNet3:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/RawNet/python/RawNet3/models:
2026-01-08 18:36:32 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:140 | Train config content:
num_device: 1
num_nodes: 1
task: espnet3.systems.asr.task.ASRTask
recipe_dir: .
data_dir: ./data
exp_tag: train_asr_rnn_data_aug_debug
exp_dir: ./exp/train_asr_rnn_data_aug_debug
stats_dir: ./exp/stats
decode_dir: ./exp/train_asr_rnn_data_aug_debug/decode
dataset_dir: ./data/mini_an4
create_dataset:
func: src.create_dataset.create_dataset
dataset_dir: ./data/mini_an4
archive_path: ./../../egs2/mini_an4/asr1/downloads.tar.gz
dataset:
_target_: espnet3.components.data.data_organizer.DataOrganizer
train:
- name: train_nodev
dataset:
_target_: src.dataset.MiniAN4Dataset
manifest_path: ./data/mini_an4/manifest/train_nodev.tsv
valid:
- name: train_dev
dataset:
_target_: src.dataset.MiniAN4Dataset
manifest_path: ./data/mini_an4/manifest/train_dev.tsv
preprocessor:
_target_: espnet2.train.preprocessor.CommonPreprocessor
fs: 16000
data_aug_effects:
- - 0.1
- contrast
- enhancement_amount: 75.0
- - 0.1
- highpass
- cutoff_freq: 5000
Q: 0.707
- - 0.1
- equalization
- center_freq: 1000
gain: 0
Q: 0.707
- - 0.1
- - - 0.3
- speed_perturb
- factor: 0.9
- - 0.3
- speed_perturb
- factor: 1.1
- - 0.3
- speed_perturb
- factor: 1.3
data_aug_num:
- 1
- 4
data_aug_prob: 1.0
parallel:
env: local
n_workers: 1
dataloader:
collate_fn:
_target_: espnet2.train.collate_fn.CommonCollateFn
int_pad_value: -1
train:
multiple_iterator: false
num_shards: 1
iter_factory:
_target_: espnet2.iterators.sequence_iter_factory.SequenceIterFactory
shuffle: true
collate_fn:
_target_: espnet2.train.collate_fn.CommonCollateFn
int_pad_value: -1
num_workers: 0
batches:
type: folded
shape_files:
- ./exp/stats/train/feats_shape
batch_size: 2
batch_bins: 200000
valid:
multiple_iterator: false
num_shards: 1
iter_factory:
_target_: espnet2.iterators.sequence_iter_factory.SequenceIterFactory
shuffle: false
collate_fn:
_target_: espnet2.train.collate_fn.CommonCollateFn
int_pad_value: -1
batches:
type: folded
shape_files:
- ./exp/stats/valid/feats_shape
batch_size: 2
batch_bins: 200000
optim:
_target_: torch.optim.Adam
lr: 0.001
weight_decay: 0.0
scheduler:
_target_: torch.optim.lr_scheduler.ReduceLROnPlateau
mode: min
factor: 0.5
patience: 1
val_scheduler_criterion:
- valid
- loss
best_model_criterion:
- - valid/acc
- 1
- max
trainer:
accelerator: auto
devices: 1
num_nodes: 1
accumulate_grad_batches: 1
check_val_every_n_epoch: 1
gradient_clip_val: 1.0
log_every_n_steps: 1
max_epochs: 1
limit_train_batches: 1
limit_val_batches: 1
precision: 32
logger:
- _target_: lightning.pytorch.loggers.TensorBoardLogger
save_dir: ./exp/train_asr_rnn_data_aug_debug/tensorboard
name: tb_logger
strategy: auto
tokenizer:
vocab_size: 30
character_coverage: 1.0
model_type: bpe
save_path: ./data/bpe_30
text_builder:
func: src.tokenizer.gather_training_text
manifest_path: ./data/mini_an4/manifest/train_nodev.tsv
model:
vocab_size: 30
token_list: ./data/bpe_30/tokens.txt
encoder: vgg_rnn
encoder_conf:
num_layers: 1
hidden_size: 2
output_size: 2
decoder: rnn
decoder_conf:
hidden_size: 2
normalize: utterance_mvn
normalize_conf: {}
model_conf:
ctc_weight: 0.3
lsm_weight: 0.1
length_normalized_loss: false
frontend: default
frontend_conf:
n_fft: 512
win_length: 400
hop_length: 160
2026-01-08 18:36:32 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:144 | Infer config content:
num_device: 1
num_nodes: 1
recipe_dir: .
data_dir: ./data
exp_tag: mini_an4_asr_debug
exp_dir: ./exp/mini_an4_asr_debug
stats_dir: ./exp/stats
decode_dir: ./exp/mini_an4_asr_debug/decode
dataset_dir: ./data/mini_an4
dataset:
_target_: espnet3.components.data.data_organizer.DataOrganizer
test:
- name: test
dataset:
_target_: src.dataset.MiniAN4Dataset
manifest_path: ./data/mini_an4/manifest/test.tsv
parallel:
env: local
n_workers: 1
model:
_target_: espnet2.bin.asr_inference.Speech2Text
asr_train_config: ./exp/mini_an4_asr_debug/config.yaml
asr_model_file: ./exp/mini_an4_asr_debug/last.ckpt
beam_size: 1
ctc_weight: 0.3
tokenizer:
vocab_size: 30
character_coverage: 1.0
model_type: bpe
save_path: ./data/bpe_30
2026-01-08 18:36:32 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:148 | Measure config content:
recipe_dir: .
data_dir: ./data
exp_tag: mini_an4_asr_debug
exp_dir: ./exp/mini_an4_asr_debug
stats_dir: ./exp/stats
decode_dir: ./exp/mini_an4_asr_debug/decode
dataset_dir: ./data/mini_an4
dataset:
_target_: espnet3.components.data.data_organizer.DataOrganizer
test:
- name: test
dataset:
_target_: src.dataset.MiniAN4Dataset
manifest_path: ./data/mini_an4/manifest/test.tsv
metrics:
- metric:
_target_: espnet3.systems.asr.metrics.wer.WER
clean_types: null
- metric:
_target_: espnet3.systems.asr.metrics.cer.CER
clean_types: null
2026-01-08 18:36:32 | INFO | espnet3.systems.base.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/system.py:49 | Initialized ASRSystem with train_config=True infer_config=True measure_config=True exp_dir=exp/train_asr_rnn_data_aug_debug
2026-01-08 18:36:32 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:165 | System: ASRSystem
2026-01-08 18:36:32 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:166 | Requested stages: ['all']
2026-01-08 18:36:32 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:167 | Resolved stages: ['create_dataset', 'train_tokenizer', 'collect_stats', 'train', 'infer', 'measure', 'publish']
2026-01-08 18:36:32 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:59 | === [START] stage: create_dataset ===
2026-01-08 18:36:32 | INFO | espnet3.systems.asr.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/system.py:54 | ASRSystem.create_dataset(): starting dataset creation process
2026-01-08 18:36:32 | INFO | espnet3.systems.asr.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/system.py:63 | Creating dataset with function src.create_dataset.create_dataset
2026-01-08 18:36:32 | INFO | src.create_dataset | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr/src/create_dataset.py:148 | Prepared Mini AN4 manifests under data/mini_an4/manifest
2026-01-08 18:36:32 | INFO | espnet3.systems.asr.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/system.py:65 | Dataset creation completed in 0.43s using src.create_dataset.create_dataset
2026-01-08 18:36:32 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:74 | === [DONE] stage: create_dataset (0.43s) ===
2026-01-08 18:36:32 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:59 | === [START] stage: train_tokenizer ===
2026-01-08 18:36:32 | INFO | espnet3.systems.asr.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/system.py:131 | Building tokenizer training text via src.tokenizer.gather_training_text
2026-01-08 18:36:32 | INFO | espnet3.systems.asr.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/system.py:151 | Collected 4 transcript lines for tokenizer training
2026-01-08 18:36:32 | INFO | espnet3.systems.asr.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/system.py:155 | Training tokenizer: bpe
2026-01-08 18:36:32 | INFO | espnet3.systems.asr.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/system.py:156 | Tokenizer output: ./data/bpe_30
2026-01-08 18:36:33 | INFO | espnet3.systems.asr.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/system.py:165 | Tokenizer training completed in 0.09s
2026-01-08 18:36:33 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:74 | === [DONE] stage: train_tokenizer (0.09s) ===
2026-01-08 18:36:33 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:59 | === [START] stage: collect_stats ===
2026-01-08 18:36:33 | INFO | espnet3.systems.base.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/system.py:79 | Collecting stats | exp_dir=./exp/train_asr_rnn_data_aug_debug stats_dir=./exp/stats
2026-01-08 18:36:41 | INFO | root | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/task.py:548 | Vocabulary size: 30
2026-01-08 18:36:41 | INFO | espnet3.systems.base.train | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/train.py:33 | Model:
ESPnetASRModel(
(frontend): DefaultFrontend(
(stft): Stft(n_fft=512, win_length=400, hop_length=160, center=True, normalized=False, onesided=True)
(frontend): Frontend()
(logmel): LogMel(sr=16000, n_fft=512, n_mels=80, fmin=0, fmax=8000.0, htk=False)
)
(normalize): UtteranceMVN(norm_means=True, norm_vars=False)
(encoder): VGGRNNEncoder(
(enc): ModuleList(
(0): VGG2L(
(conv1_1): Conv2d(1, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
(conv1_2): Conv2d(64, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
(conv2_1): Conv2d(64, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
(conv2_2): Conv2d(128, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
)
(1): RNNP(
(birnn0): LSTM(2560, 2, batch_first=True, bidirectional=True)
(bt0): Linear(in_features=4, out_features=2, bias=True)
)
)
)
(decoder): RNNDecoder(
(embed): Embedding(30, 2)
(dropout_emb): Dropout(p=0.0, inplace=False)
(decoder): ModuleList(
(0): LSTMCell(4, 2)
)
(dropout_dec): ModuleList(
(0): Dropout(p=0.0, inplace=False)
)
(output): Linear(in_features=2, out_features=30, bias=True)
(att_list): ModuleList(
(0): AttLoc(
(mlp_enc): Linear(in_features=2, out_features=320, bias=True)
(mlp_dec): Linear(in_features=2, out_features=320, bias=False)
(mlp_att): Linear(in_features=10, out_features=320, bias=False)
(loc_conv): Conv2d(1, 10, kernel_size=(1, 201), stride=(1, 1), padding=(0, 100), bias=False)
(gvec): Linear(in_features=320, out_features=1, bias=True)
)
)
)
(criterion_att): LabelSmoothingLoss(
(criterion): KLDivLoss()
)
(ctc): CTC(
(ctc_lo): Linear(in_features=2, out_features=30, bias=True)
(ctc_loss): CTCLoss()
)
)
2026-01-08 18:36:41 | ERROR | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:70 | Stage 'collect_stats' failed after 8.38s
Traceback (most recent call last):
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/hydra/_internal/instantiate/_instantiate2.py", line 92, in _call_target
return _target_(*args, **kwargs)
^^^^^^^^^^^^^^^^^^^^^^^^^
TypeError: CommonPreprocessor.__init__() missing 1 required positional argument: 'train'
The above exception was the direct cause of the following exception:
Traceback (most recent call last):
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py", line 61, in run_stages
fn()
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/system.py", line 84, in collect_stats
return collect_stats(self.train_config)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/train.py", line 68, in collect_stats
trainer = _build_trainer(cfg)
^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/train.py", line 34, in _build_trainer
lit_model = LitESPnetModel(model, cfg)
^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/components/modeling/model.py", line 47, in __init__
data_organizer = instantiate(config.dataset)
^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/hydra/_internal/instantiate/_instantiate2.py", line 226, in instantiate
return instantiate_node(
^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/hydra/_internal/instantiate/_instantiate2.py", line 342, in instantiate_node
value = instantiate_node(
^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/hydra/_internal/instantiate/_instantiate2.py", line 347, in instantiate_node
return _call_target(_target_, partial, args, kwargs, full_key)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/hydra/_internal/instantiate/_instantiate2.py", line 97, in _call_target
raise InstantiationException(msg) from e
hydra.errors.InstantiationException: Error in call to target 'espnet2.train.preprocessor.CommonPreprocessor':
TypeError("CommonPreprocessor.__init__() missing 1 required positional argument: 'train'")
full_key: dataset.preprocessor
2026-01-08 18:38:37 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:152 | === ESPnet3 run started: 2026-01-08T18:38:37.023878 ===
2026-01-08 18:38:37 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:153 | Command: /mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/bin/python run.py --train_config conf/train_asr_rnn_data_aug_debug.yaml --infer_config conf/infer.yaml --measure_config conf/measure.yaml
2026-01-08 18:38:37 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:156 | Working directory: /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr
2026-01-08 18:38:37 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:162 | train config: /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr/conf/train_asr_rnn_data_aug_debug.yaml
2026-01-08 18:38:37 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:162 | infer config: /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr/conf/infer.yaml
2026-01-08 18:38:37 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:162 | measure config: /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr/conf/measure.yaml
2026-01-08 18:38:49 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:167 | Git: commit=e649581628c3a7d875d2e1ee9c795b7595d4fcb7, short_commit=e649581628, branch=espnet3/integration_test, worktree=dirty
2026-01-08 18:38:49 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:250 | Cluster env:
(none)
2026-01-08 18:38:49 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:251 | Runtime env:
LD_LIBRARY_PATH=/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/espeak-ng/lib:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/lib:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/lib64:/usr/local/cuda-10.1/lib64
PATH=/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/ffmpeg-release:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/festival/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/MBROLA/Bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/espeak-ng/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/BeamformIt:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kenlm/build/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/PESQ/P862_annex_A_2005_CD/source:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/nkf/nkf-2.1.4:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/moses/scripts/tokenizer:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/moses/scripts/generic:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/tools/moses/scripts/recaser:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/moses/scripts/training:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/mwerSegmenter:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/sctk/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/sph2pipe:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/sentencepiece_commands:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/chainbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/featbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/fgmmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/fstbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/gmmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/ivectorbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/kwsbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/latbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/lmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/nnet2bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/nnet3bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/nnetbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/online2bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/onlinebin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/rnnlmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/sgmm2bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/sgmmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/tfrnnlmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/cudadecoderbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/cudafeatbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/bin:/home/masao/.local/bin:/home/masao/utils:/home/masao/git/SCTK/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/games:/usr/local/games:/usr/lib/wsl/lib:/mnt/c/Users/might/AppData/Roaming/Code/User/globalStorage/github.copilot-chat/debugCommand:/mnt/c/Users/might/AppData/Roaming/Code/User/globalStorage/github.copilot-chat/copilotCli:/mnt/c/Windows/system32:/mnt/c/Windows:/mnt/c/Windows/System32/Wbem:/mnt/c/Windows/System32/WindowsPowerShell/v1.0/:/mnt/c/Windows/System32/OpenSSH/:/mnt/c/Program Files (x86)/NVIDIA Corporation/PhysX/Common:/mnt/c/Program Files/NVIDIA Corporation/NVIDIA NvDLISR:/mnt/c/Program Files/dotnet/:/mnt/c/Program Files/Git/cmd:/mnt/c/Program Files/Tailscale/:/mnt/c/Users/might/AppData/Local/Microsoft/WindowsApps:/mnt/c/Users/might/AppData/Local/Programs/Microsoft VS Code/bin:/mnt/c/Users/might/AppData/Local/Muse Hub/lib:/mnt/c/Software/cmake-3.30.2-windows-x86_64/bin:/mnt/c/Software/flutter/bin:/mnt/c/Users/might/.vscode/extensions/ms-python.debugpy-2025.18.0-win32-x64/bundled/scripts/noConfigScripts:/snap/bin:/mnt/c/Software/flutter/bin
PYTHONPATH=/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/RawNet/python/RawNet3:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/RawNet/python/RawNet3/models:../../../:../../TEMPLATE/asr:/mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr:
2026-01-08 18:38:49 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:140 | Train config content:
num_device: 1
num_nodes: 1
task: espnet3.systems.asr.task.ASRTask
recipe_dir: .
data_dir: ./data
exp_tag: train_asr_rnn_data_aug_debug
exp_dir: ./exp/train_asr_rnn_data_aug_debug
stats_dir: ./exp/stats
decode_dir: ./exp/train_asr_rnn_data_aug_debug/decode
dataset_dir: ./data/mini_an4
create_dataset:
func: src.create_dataset.create_dataset
dataset_dir: ./data/mini_an4
archive_path: ./../../egs2/mini_an4/asr1/downloads.tar.gz
dataset:
_target_: espnet3.components.data.data_organizer.DataOrganizer
train:
- name: train_nodev
dataset:
_target_: src.dataset.MiniAN4Dataset
manifest_path: ./data/mini_an4/manifest/train_nodev.tsv
valid:
- name: train_dev
dataset:
_target_: src.dataset.MiniAN4Dataset
manifest_path: ./data/mini_an4/manifest/train_dev.tsv
preprocessor:
_target_: espnet2.train.preprocessor.CommonPreprocessor
fs: 16000
train: true
data_aug_effects:
- - 0.1
- contrast
- enhancement_amount: 75.0
- - 0.1
- highpass
- cutoff_freq: 5000
Q: 0.707
- - 0.1
- equalization
- center_freq: 1000
gain: 0
Q: 0.707
- - 0.1
- - - 0.3
- speed_perturb
- factor: 0.9
- - 0.3
- speed_perturb
- factor: 1.1
- - 0.3
- speed_perturb
- factor: 1.3
data_aug_num:
- 1
- 4
data_aug_prob: 1.0
parallel:
env: local
n_workers: 1
dataloader:
collate_fn:
_target_: espnet2.train.collate_fn.CommonCollateFn
int_pad_value: -1
train:
multiple_iterator: false
num_shards: 1
iter_factory:
_target_: espnet2.iterators.sequence_iter_factory.SequenceIterFactory
shuffle: true
collate_fn:
_target_: espnet2.train.collate_fn.CommonCollateFn
int_pad_value: -1
num_workers: 0
batches:
type: folded
shape_files:
- ./exp/stats/train/feats_shape
batch_size: 2
batch_bins: 200000
valid:
multiple_iterator: false
num_shards: 1
iter_factory:
_target_: espnet2.iterators.sequence_iter_factory.SequenceIterFactory
shuffle: false
collate_fn:
_target_: espnet2.train.collate_fn.CommonCollateFn
int_pad_value: -1
batches:
type: folded
shape_files:
- ./exp/stats/valid/feats_shape
batch_size: 2
batch_bins: 200000
optim:
_target_: torch.optim.Adam
lr: 0.001
weight_decay: 0.0
scheduler:
_target_: torch.optim.lr_scheduler.ReduceLROnPlateau
mode: min
factor: 0.5
patience: 1
val_scheduler_criterion:
- valid
- loss
best_model_criterion:
- - valid/acc
- 1
- max
trainer:
accelerator: auto
devices: 1
num_nodes: 1
accumulate_grad_batches: 1
check_val_every_n_epoch: 1
gradient_clip_val: 1.0
log_every_n_steps: 1
max_epochs: 1
limit_train_batches: 1
limit_val_batches: 1
precision: 32
logger:
- _target_: lightning.pytorch.loggers.TensorBoardLogger
save_dir: ./exp/train_asr_rnn_data_aug_debug/tensorboard
name: tb_logger
strategy: auto
tokenizer:
vocab_size: 30
character_coverage: 1.0
model_type: bpe
save_path: ./data/bpe_30
text_builder:
func: src.tokenizer.gather_training_text
manifest_path: ./data/mini_an4/manifest/train_nodev.tsv
model:
vocab_size: 30
token_list: ./data/bpe_30/tokens.txt
encoder: vgg_rnn
encoder_conf:
num_layers: 1
hidden_size: 2
output_size: 2
decoder: rnn
decoder_conf:
hidden_size: 2
normalize: utterance_mvn
normalize_conf: {}
model_conf:
ctc_weight: 0.3
lsm_weight: 0.1
length_normalized_loss: false
frontend: default
frontend_conf:
n_fft: 512
win_length: 400
hop_length: 160
2026-01-08 18:38:49 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:144 | Infer config content:
num_device: 1
num_nodes: 1
recipe_dir: .
data_dir: ./data
exp_tag: mini_an4_asr_debug
exp_dir: ./exp/mini_an4_asr_debug
stats_dir: ./exp/stats
decode_dir: ./exp/mini_an4_asr_debug/decode
dataset_dir: ./data/mini_an4
dataset:
_target_: espnet3.components.data.data_organizer.DataOrganizer
test:
- name: test
dataset:
_target_: src.dataset.MiniAN4Dataset
manifest_path: ./data/mini_an4/manifest/test.tsv
parallel:
env: local
n_workers: 1
model:
_target_: espnet2.bin.asr_inference.Speech2Text
asr_train_config: ./exp/mini_an4_asr_debug/config.yaml
asr_model_file: ./exp/mini_an4_asr_debug/last.ckpt
beam_size: 1
ctc_weight: 0.3
tokenizer:
vocab_size: 30
character_coverage: 1.0
model_type: bpe
save_path: ./data/bpe_30
2026-01-08 18:38:49 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:148 | Measure config content:
recipe_dir: .
data_dir: ./data
exp_tag: mini_an4_asr_debug
exp_dir: ./exp/mini_an4_asr_debug
stats_dir: ./exp/stats
decode_dir: ./exp/mini_an4_asr_debug/decode
dataset_dir: ./data/mini_an4
dataset:
_target_: espnet3.components.data.data_organizer.DataOrganizer
test:
- name: test
dataset:
_target_: src.dataset.MiniAN4Dataset
manifest_path: ./data/mini_an4/manifest/test.tsv
metrics:
- metric:
_target_: espnet3.systems.asr.metrics.wer.WER
clean_types: null
- metric:
_target_: espnet3.systems.asr.metrics.cer.CER
clean_types: null
2026-01-08 18:38:49 | INFO | espnet3.systems.base.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/system.py:49 | Initialized ASRSystem with train_config=True infer_config=True measure_config=True exp_dir=exp/train_asr_rnn_data_aug_debug
2026-01-08 18:38:49 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:165 | System: ASRSystem
2026-01-08 18:38:49 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:166 | Requested stages: ['all']
2026-01-08 18:38:49 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:167 | Resolved stages: ['create_dataset', 'train_tokenizer', 'collect_stats', 'train', 'infer', 'measure', 'publish']
2026-01-08 18:38:49 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:59 | === [START] stage: create_dataset ===
2026-01-08 18:38:49 | INFO | espnet3.systems.asr.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/system.py:54 | ASRSystem.create_dataset(): starting dataset creation process
2026-01-08 18:38:49 | INFO | espnet3.systems.asr.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/system.py:63 | Creating dataset with function src.create_dataset.create_dataset
2026-01-08 18:38:49 | INFO | src.create_dataset | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr/src/create_dataset.py:148 | Prepared Mini AN4 manifests under data/mini_an4/manifest
2026-01-08 18:38:49 | INFO | espnet3.systems.asr.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/system.py:65 | Dataset creation completed in 0.21s using src.create_dataset.create_dataset
2026-01-08 18:38:49 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:74 | === [DONE] stage: create_dataset (0.22s) ===
2026-01-08 18:38:49 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:59 | === [START] stage: train_tokenizer ===
2026-01-08 18:38:49 | INFO | espnet3.systems.asr.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/system.py:115 | Tokenizer already exists. Skipping train_tokenizer().
2026-01-08 18:38:49 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:74 | === [DONE] stage: train_tokenizer (0.01s) ===
2026-01-08 18:38:49 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:59 | === [START] stage: collect_stats ===
2026-01-08 18:38:49 | INFO | espnet3.systems.base.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/system.py:79 | Collecting stats | exp_dir=./exp/train_asr_rnn_data_aug_debug stats_dir=./exp/stats
2026-01-08 18:38:57 | INFO | root | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/task.py:548 | Vocabulary size: 30
2026-01-08 18:38:58 | INFO | espnet3.systems.base.train | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/train.py:33 | Model:
ESPnetASRModel(
(frontend): DefaultFrontend(
(stft): Stft(n_fft=512, win_length=400, hop_length=160, center=True, normalized=False, onesided=True)
(frontend): Frontend()
(logmel): LogMel(sr=16000, n_fft=512, n_mels=80, fmin=0, fmax=8000.0, htk=False)
)
(normalize): UtteranceMVN(norm_means=True, norm_vars=False)
(encoder): VGGRNNEncoder(
(enc): ModuleList(
(0): VGG2L(
(conv1_1): Conv2d(1, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
(conv1_2): Conv2d(64, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
(conv2_1): Conv2d(64, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
(conv2_2): Conv2d(128, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
)
(1): RNNP(
(birnn0): LSTM(2560, 2, batch_first=True, bidirectional=True)
(bt0): Linear(in_features=4, out_features=2, bias=True)
)
)
)
(decoder): RNNDecoder(
(embed): Embedding(30, 2)
(dropout_emb): Dropout(p=0.0, inplace=False)
(decoder): ModuleList(
(0): LSTMCell(4, 2)
)
(dropout_dec): ModuleList(
(0): Dropout(p=0.0, inplace=False)
)
(output): Linear(in_features=2, out_features=30, bias=True)
(att_list): ModuleList(
(0): AttLoc(
(mlp_enc): Linear(in_features=2, out_features=320, bias=True)
(mlp_dec): Linear(in_features=2, out_features=320, bias=False)
(mlp_att): Linear(in_features=10, out_features=320, bias=False)
(loc_conv): Conv2d(1, 10, kernel_size=(1, 201), stride=(1, 1), padding=(0, 100), bias=False)
(gvec): Linear(in_features=320, out_features=1, bias=True)
)
)
)
(criterion_att): LabelSmoothingLoss(
(criterion): KLDivLoss()
)
(ctc): CTC(
(ctc_lo): Linear(in_features=2, out_features=30, bias=True)
(ctc_loss): CTCLoss()
)
)
2026-01-08 18:38:58 | INFO | lightning.pytorch.utilities.rank_zero | /mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/lightning/pytorch/trainer/setup.py:155 | GPU available: True (cuda), used: True
2026-01-08 18:38:58 | INFO | lightning.pytorch.utilities.rank_zero | /mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/lightning/pytorch/trainer/setup.py:158 | TPU available: False, using: 0 TPU cores
2026-01-08 18:38:58 | INFO | lightning.pytorch.utilities.rank_zero | /mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/lightning/pytorch/trainer/setup.py:105 | `Trainer(limit_train_batches=1)` was configured so 1 batch per epoch will be used.
2026-01-08 18:38:58 | INFO | lightning.pytorch.utilities.rank_zero | /mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/lightning/pytorch/trainer/setup.py:105 | `Trainer(limit_val_batches=1)` was configured so 1 batch will be used.
2026-01-08 18:38:58 | INFO | root | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/task.py:548 | Vocabulary size: 30
2026-01-08 18:38:59 | ERROR | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:70 | Stage 'collect_stats' failed after 9.63s
Traceback (most recent call last):
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py", line 61, in run_stages
fn()
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/system.py", line 84, in collect_stats
return collect_stats(self.train_config)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/train.py", line 69, in collect_stats
trainer.collect_stats()
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/components/training/trainer.py", line 205, in collect_stats
return self.model.collect_stats(*args, **kwargs)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/components/modeling/model.py", line 438, in collect_stats
collect_stats(
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/components/data/collect_stats.py", line 512, in collect_stats
sum_dict, sq_dict, count_dict = _collect_stats_common(
^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/components/data/collect_stats.py", line 375, in _collect_stats_common
results = runner(index_batches)
^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/parallel/base_runner.py", line 340, in __call__
return self._run_local(indices)
^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/parallel/base_runner.py", line 209, in _run_local
return [f(i, **env) for i in tqdm(indices, total=len(indices))]
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/parallel/base_runner.py", line 209, in <listcomp>
return [f(i, **env) for i in tqdm(indices, total=len(indices))]
^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/components/data/collect_stats.py", line 328, in forward
return batch_collect_stats(
^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/components/data/collect_stats.py", line 44, in batch_collect_stats
item = dataset[i]
~~~~~~~^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/components/data/dataset.py", line 151, in __getitem__
transformed = self.transforms[i][1](str(idx), transformed)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet2/train/preprocessor.py", line 547, in __call__
data = self._speech_process(data)
^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet2/train/preprocessor.py", line 451, in _speech_process
data[self.speech_name] = self.data_aug(
^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet2/layers/augmentation.py", line 99, in __call__
print(effect)
IndexError: list index out of range
2026-01-08 18:40:16 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:152 | === ESPnet3 run started: 2026-01-08T18:40:16.707156 ===
2026-01-08 18:40:16 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:153 | Command: /mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/bin/python run.py --train_config conf/train_asr_rnn_data_aug_debug.yaml --infer_config conf/infer.yaml --measure_config conf/measure.yaml
2026-01-08 18:40:16 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:156 | Working directory: /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr
2026-01-08 18:40:16 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:162 | train config: /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr/conf/train_asr_rnn_data_aug_debug.yaml
2026-01-08 18:40:16 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:162 | infer config: /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr/conf/infer.yaml
2026-01-08 18:40:16 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:162 | measure config: /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr/conf/measure.yaml
2026-01-08 18:40:29 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:167 | Git: commit=e649581628c3a7d875d2e1ee9c795b7595d4fcb7, short_commit=e649581628, branch=espnet3/integration_test, worktree=dirty
2026-01-08 18:40:29 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:250 | Cluster env:
(none)
2026-01-08 18:40:29 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:251 | Runtime env:
LD_LIBRARY_PATH=/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/espeak-ng/lib:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/lib:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/lib64:/usr/local/cuda-10.1/lib64
PATH=/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/ffmpeg-release:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/festival/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/MBROLA/Bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/espeak-ng/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/BeamformIt:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kenlm/build/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/PESQ/P862_annex_A_2005_CD/source:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/nkf/nkf-2.1.4:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/moses/scripts/tokenizer:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/moses/scripts/generic:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/tools/moses/scripts/recaser:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/moses/scripts/training:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/mwerSegmenter:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/sctk/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/sph2pipe:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/sentencepiece_commands:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/chainbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/featbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/fgmmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/fstbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/gmmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/ivectorbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/kwsbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/latbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/lmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/nnet2bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/nnet3bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/nnetbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/online2bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/onlinebin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/rnnlmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/sgmm2bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/sgmmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/tfrnnlmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/cudadecoderbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/cudafeatbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/bin:/home/masao/.local/bin:/home/masao/utils:/home/masao/git/SCTK/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/games:/usr/local/games:/usr/lib/wsl/lib:/mnt/c/Users/might/AppData/Roaming/Code/User/globalStorage/github.copilot-chat/debugCommand:/mnt/c/Users/might/AppData/Roaming/Code/User/globalStorage/github.copilot-chat/copilotCli:/mnt/c/Windows/system32:/mnt/c/Windows:/mnt/c/Windows/System32/Wbem:/mnt/c/Windows/System32/WindowsPowerShell/v1.0/:/mnt/c/Windows/System32/OpenSSH/:/mnt/c/Program Files (x86)/NVIDIA Corporation/PhysX/Common:/mnt/c/Program Files/NVIDIA Corporation/NVIDIA NvDLISR:/mnt/c/Program Files/dotnet/:/mnt/c/Program Files/Git/cmd:/mnt/c/Program Files/Tailscale/:/mnt/c/Users/might/AppData/Local/Microsoft/WindowsApps:/mnt/c/Users/might/AppData/Local/Programs/Microsoft VS Code/bin:/mnt/c/Users/might/AppData/Local/Muse Hub/lib:/mnt/c/Software/cmake-3.30.2-windows-x86_64/bin:/mnt/c/Software/flutter/bin:/mnt/c/Users/might/.vscode/extensions/ms-python.debugpy-2025.18.0-win32-x64/bundled/scripts/noConfigScripts:/snap/bin:/mnt/c/Software/flutter/bin
PYTHONPATH=/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/RawNet/python/RawNet3:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/RawNet/python/RawNet3/models:../../../:../../TEMPLATE/asr:/mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr:
2026-01-08 18:40:29 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:140 | Train config content:
num_device: 1
num_nodes: 1
task: espnet3.systems.asr.task.ASRTask
recipe_dir: .
data_dir: ./data
exp_tag: train_asr_rnn_data_aug_debug
exp_dir: ./exp/train_asr_rnn_data_aug_debug
stats_dir: ./exp/stats
decode_dir: ./exp/train_asr_rnn_data_aug_debug/decode
dataset_dir: ./data/mini_an4
create_dataset:
func: src.create_dataset.create_dataset
dataset_dir: ./data/mini_an4
archive_path: ./../../egs2/mini_an4/asr1/downloads.tar.gz
dataset:
_target_: espnet3.components.data.data_organizer.DataOrganizer
train:
- name: train_nodev
dataset:
_target_: src.dataset.MiniAN4Dataset
manifest_path: ./data/mini_an4/manifest/train_nodev.tsv
valid:
- name: train_dev
dataset:
_target_: src.dataset.MiniAN4Dataset
manifest_path: ./data/mini_an4/manifest/train_dev.tsv
preprocessor:
_target_: espnet2.train.preprocessor.CommonPreprocessor
fs: 16000
train: true
data_aug_effects:
- - 0.1
- contrast
- enhancement_amount: 75.0
- - 0.1
- highpass
- cutoff_freq: 5000
Q: 0.707
- - 0.1
- equalization
- center_freq: 1000
gain: 0
Q: 0.707
- - 0.1
- - - 0.3
- speed_perturb
- factor: 0.9
- - 0.3
- speed_perturb
- factor: 1.1
- - 0.3
- speed_perturb
- factor: 1.3
data_aug_num:
- 1
- 4
data_aug_prob: 1.0
parallel:
env: local
n_workers: 1
dataloader:
collate_fn:
_target_: espnet2.train.collate_fn.CommonCollateFn
int_pad_value: -1
train:
multiple_iterator: false
num_shards: 1
iter_factory:
_target_: espnet2.iterators.sequence_iter_factory.SequenceIterFactory
shuffle: true
collate_fn:
_target_: espnet2.train.collate_fn.CommonCollateFn
int_pad_value: -1
num_workers: 0
batches:
type: folded
shape_files:
- ./exp/stats/train/feats_shape
batch_size: 2
batch_bins: 200000
valid:
multiple_iterator: false
num_shards: 1
iter_factory:
_target_: espnet2.iterators.sequence_iter_factory.SequenceIterFactory
shuffle: false
collate_fn:
_target_: espnet2.train.collate_fn.CommonCollateFn
int_pad_value: -1
batches:
type: folded
shape_files:
- ./exp/stats/valid/feats_shape
batch_size: 2
batch_bins: 200000
optim:
_target_: torch.optim.Adam
lr: 0.001
weight_decay: 0.0
scheduler:
_target_: torch.optim.lr_scheduler.ReduceLROnPlateau
mode: min
factor: 0.5
patience: 1
val_scheduler_criterion:
- valid
- loss
best_model_criterion:
- - valid/acc
- 1
- max
trainer:
accelerator: auto
devices: 1
num_nodes: 1
accumulate_grad_batches: 1
check_val_every_n_epoch: 1
gradient_clip_val: 1.0
log_every_n_steps: 1
max_epochs: 1
limit_train_batches: 1
limit_val_batches: 1
precision: 32
logger:
- _target_: lightning.pytorch.loggers.TensorBoardLogger
save_dir: ./exp/train_asr_rnn_data_aug_debug/tensorboard
name: tb_logger
strategy: auto
tokenizer:
vocab_size: 30
character_coverage: 1.0
model_type: bpe
save_path: ./data/bpe_30
text_builder:
func: src.tokenizer.gather_training_text
manifest_path: ./data/mini_an4/manifest/train_nodev.tsv
model:
vocab_size: 30
token_list: ./data/bpe_30/tokens.txt
encoder: vgg_rnn
encoder_conf:
num_layers: 1
hidden_size: 2
output_size: 2
decoder: rnn
decoder_conf:
hidden_size: 2
normalize: utterance_mvn
normalize_conf: {}
model_conf:
ctc_weight: 0.3
lsm_weight: 0.1
length_normalized_loss: false
frontend: default
frontend_conf:
n_fft: 512
win_length: 400
hop_length: 160
2026-01-08 18:40:29 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:144 | Infer config content:
num_device: 1
num_nodes: 1
recipe_dir: .
data_dir: ./data
exp_tag: mini_an4_asr_debug
exp_dir: ./exp/mini_an4_asr_debug
stats_dir: ./exp/stats
decode_dir: ./exp/mini_an4_asr_debug/decode
dataset_dir: ./data/mini_an4
dataset:
_target_: espnet3.components.data.data_organizer.DataOrganizer
test:
- name: test
dataset:
_target_: src.dataset.MiniAN4Dataset
manifest_path: ./data/mini_an4/manifest/test.tsv
parallel:
env: local
n_workers: 1
model:
_target_: espnet2.bin.asr_inference.Speech2Text
asr_train_config: ./exp/mini_an4_asr_debug/config.yaml
asr_model_file: ./exp/mini_an4_asr_debug/last.ckpt
beam_size: 1
ctc_weight: 0.3
tokenizer:
vocab_size: 30
character_coverage: 1.0
model_type: bpe
save_path: ./data/bpe_30
2026-01-08 18:40:29 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:148 | Measure config content:
recipe_dir: .
data_dir: ./data
exp_tag: mini_an4_asr_debug
exp_dir: ./exp/mini_an4_asr_debug
stats_dir: ./exp/stats
decode_dir: ./exp/mini_an4_asr_debug/decode
dataset_dir: ./data/mini_an4
dataset:
_target_: espnet3.components.data.data_organizer.DataOrganizer
test:
- name: test
dataset:
_target_: src.dataset.MiniAN4Dataset
manifest_path: ./data/mini_an4/manifest/test.tsv
metrics:
- metric:
_target_: espnet3.systems.asr.metrics.wer.WER
clean_types: null
- metric:
_target_: espnet3.systems.asr.metrics.cer.CER
clean_types: null
2026-01-08 18:40:29 | INFO | espnet3.systems.base.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/system.py:49 | Initialized ASRSystem with train_config=True infer_config=True measure_config=True exp_dir=exp/train_asr_rnn_data_aug_debug
2026-01-08 18:40:29 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:165 | System: ASRSystem
2026-01-08 18:40:29 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:166 | Requested stages: ['all']
2026-01-08 18:40:29 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:167 | Resolved stages: ['create_dataset', 'train_tokenizer', 'collect_stats', 'train', 'infer', 'measure', 'publish']
2026-01-08 18:40:29 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:59 | === [START] stage: create_dataset ===
2026-01-08 18:40:29 | INFO | espnet3.systems.asr.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/system.py:54 | ASRSystem.create_dataset(): starting dataset creation process
2026-01-08 18:40:29 | INFO | espnet3.systems.asr.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/system.py:63 | Creating dataset with function src.create_dataset.create_dataset
2026-01-08 18:40:29 | INFO | src.create_dataset | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr/src/create_dataset.py:148 | Prepared Mini AN4 manifests under data/mini_an4/manifest
2026-01-08 18:40:29 | INFO | espnet3.systems.asr.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/system.py:65 | Dataset creation completed in 0.20s using src.create_dataset.create_dataset
2026-01-08 18:40:29 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:74 | === [DONE] stage: create_dataset (0.20s) ===
2026-01-08 18:40:29 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:59 | === [START] stage: train_tokenizer ===
2026-01-08 18:40:29 | INFO | espnet3.systems.asr.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/system.py:115 | Tokenizer already exists. Skipping train_tokenizer().
2026-01-08 18:40:29 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:74 | === [DONE] stage: train_tokenizer (0.00s) ===
2026-01-08 18:40:29 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:59 | === [START] stage: collect_stats ===
2026-01-08 18:40:29 | INFO | espnet3.systems.base.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/system.py:79 | Collecting stats | exp_dir=./exp/train_asr_rnn_data_aug_debug stats_dir=./exp/stats
2026-01-08 18:40:37 | INFO | root | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/task.py:548 | Vocabulary size: 30
2026-01-08 18:40:37 | INFO | espnet3.systems.base.train | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/train.py:33 | Model:
ESPnetASRModel(
(frontend): DefaultFrontend(
(stft): Stft(n_fft=512, win_length=400, hop_length=160, center=True, normalized=False, onesided=True)
(frontend): Frontend()
(logmel): LogMel(sr=16000, n_fft=512, n_mels=80, fmin=0, fmax=8000.0, htk=False)
)
(normalize): UtteranceMVN(norm_means=True, norm_vars=False)
(encoder): VGGRNNEncoder(
(enc): ModuleList(
(0): VGG2L(
(conv1_1): Conv2d(1, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
(conv1_2): Conv2d(64, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
(conv2_1): Conv2d(64, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
(conv2_2): Conv2d(128, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
)
(1): RNNP(
(birnn0): LSTM(2560, 2, batch_first=True, bidirectional=True)
(bt0): Linear(in_features=4, out_features=2, bias=True)
)
)
)
(decoder): RNNDecoder(
(embed): Embedding(30, 2)
(dropout_emb): Dropout(p=0.0, inplace=False)
(decoder): ModuleList(
(0): LSTMCell(4, 2)
)
(dropout_dec): ModuleList(
(0): Dropout(p=0.0, inplace=False)
)
(output): Linear(in_features=2, out_features=30, bias=True)
(att_list): ModuleList(
(0): AttLoc(
(mlp_enc): Linear(in_features=2, out_features=320, bias=True)
(mlp_dec): Linear(in_features=2, out_features=320, bias=False)
(mlp_att): Linear(in_features=10, out_features=320, bias=False)
(loc_conv): Conv2d(1, 10, kernel_size=(1, 201), stride=(1, 1), padding=(0, 100), bias=False)
(gvec): Linear(in_features=320, out_features=1, bias=True)
)
)
)
(criterion_att): LabelSmoothingLoss(
(criterion): KLDivLoss()
)
(ctc): CTC(
(ctc_lo): Linear(in_features=2, out_features=30, bias=True)
(ctc_loss): CTCLoss()
)
)
2026-01-08 18:40:37 | INFO | lightning.pytorch.utilities.rank_zero | /mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/lightning/pytorch/trainer/setup.py:155 | GPU available: True (cuda), used: True
2026-01-08 18:40:37 | INFO | lightning.pytorch.utilities.rank_zero | /mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/lightning/pytorch/trainer/setup.py:158 | TPU available: False, using: 0 TPU cores
2026-01-08 18:40:37 | INFO | lightning.pytorch.utilities.rank_zero | /mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/lightning/pytorch/trainer/setup.py:105 | `Trainer(limit_train_batches=1)` was configured so 1 batch per epoch will be used.
2026-01-08 18:40:37 | INFO | lightning.pytorch.utilities.rank_zero | /mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/lightning/pytorch/trainer/setup.py:105 | `Trainer(limit_val_batches=1)` was configured so 1 batch will be used.
2026-01-08 18:40:38 | INFO | root | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/task.py:548 | Vocabulary size: 30
2026-01-08 18:40:38 | ERROR | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:70 | Stage 'collect_stats' failed after 9.00s
Traceback (most recent call last):
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py", line 61, in run_stages
fn()
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/system.py", line 84, in collect_stats
return collect_stats(self.train_config)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/train.py", line 69, in collect_stats
trainer.collect_stats()
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/components/training/trainer.py", line 205, in collect_stats
return self.model.collect_stats(*args, **kwargs)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/components/modeling/model.py", line 438, in collect_stats
collect_stats(
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/components/data/collect_stats.py", line 512, in collect_stats
sum_dict, sq_dict, count_dict = _collect_stats_common(
^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/components/data/collect_stats.py", line 375, in _collect_stats_common
results = runner(index_batches)
^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/parallel/base_runner.py", line 340, in __call__
return self._run_local(indices)
^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/parallel/base_runner.py", line 209, in _run_local
return [f(i, **env) for i in tqdm(indices, total=len(indices))]
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/parallel/base_runner.py", line 209, in <listcomp>
return [f(i, **env) for i in tqdm(indices, total=len(indices))]
^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/components/data/collect_stats.py", line 328, in forward
return batch_collect_stats(
^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/components/data/collect_stats.py", line 44, in batch_collect_stats
item = dataset[i]
~~~~~~~^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/components/data/dataset.py", line 151, in __getitem__
transformed = self.transforms[i][1](str(idx), transformed)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet2/train/preprocessor.py", line 547, in __call__
data = self._speech_process(data)
^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet2/train/preprocessor.py", line 451, in _speech_process
data[self.speech_name] = self.data_aug(
^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet2/layers/augmentation.py", line 100, in __call__
if isinstance(effect[1], list):
~~~~~~^^^
IndexError: list index out of range
2026-01-08 19:02:51 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:152 | === ESPnet3 run started: 2026-01-08T19:02:51.458046 ===
2026-01-08 19:02:51 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:153 | Command: /mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/bin/python run.py --train_config conf/train_asr_rnn_data_aug_debug.yaml --infer_config conf/infer.yaml --measure_config conf/measure.yaml
2026-01-08 19:02:51 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:156 | Working directory: /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr
2026-01-08 19:02:51 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:162 | train config: /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr/conf/train_asr_rnn_data_aug_debug.yaml
2026-01-08 19:02:51 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:162 | infer config: /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr/conf/infer.yaml
2026-01-08 19:02:51 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:162 | measure config: /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr/conf/measure.yaml
2026-01-08 19:03:03 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:167 | Git: commit=7fae474de17d0a228f7ebb269ada1103e8014e6c, short_commit=7fae474de1, branch=espnet3/integration_test, worktree=dirty
2026-01-08 19:03:03 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:250 | Cluster env:
(none)
2026-01-08 19:03:03 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:251 | Runtime env:
LD_LIBRARY_PATH=/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/espeak-ng/lib:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/lib:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/lib64:/usr/local/cuda-10.1/lib64
PATH=/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/ffmpeg-release:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/festival/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/MBROLA/Bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/espeak-ng/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/BeamformIt:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kenlm/build/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/PESQ/P862_annex_A_2005_CD/source:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/nkf/nkf-2.1.4:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/moses/scripts/tokenizer:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/moses/scripts/generic:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/tools/moses/scripts/recaser:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/moses/scripts/training:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/mwerSegmenter:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/sctk/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/sph2pipe:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/sentencepiece_commands:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/chainbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/featbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/fgmmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/fstbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/gmmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/ivectorbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/kwsbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/latbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/lmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/nnet2bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/nnet3bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/nnetbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/online2bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/onlinebin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/rnnlmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/sgmm2bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/sgmmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/tfrnnlmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/cudadecoderbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/cudafeatbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/bin:/home/masao/.local/bin:/home/masao/utils:/home/masao/git/SCTK/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/games:/usr/local/games:/usr/lib/wsl/lib:/mnt/c/Users/might/AppData/Roaming/Code/User/globalStorage/github.copilot-chat/debugCommand:/mnt/c/Users/might/AppData/Roaming/Code/User/globalStorage/github.copilot-chat/copilotCli:/mnt/c/Windows/system32:/mnt/c/Windows:/mnt/c/Windows/System32/Wbem:/mnt/c/Windows/System32/WindowsPowerShell/v1.0/:/mnt/c/Windows/System32/OpenSSH/:/mnt/c/Program Files (x86)/NVIDIA Corporation/PhysX/Common:/mnt/c/Program Files/NVIDIA Corporation/NVIDIA NvDLISR:/mnt/c/Program Files/dotnet/:/mnt/c/Program Files/Git/cmd:/mnt/c/Program Files/Tailscale/:/mnt/c/Users/might/AppData/Local/Microsoft/WindowsApps:/mnt/c/Users/might/AppData/Local/Programs/Microsoft VS Code/bin:/mnt/c/Users/might/AppData/Local/Muse Hub/lib:/mnt/c/Software/cmake-3.30.2-windows-x86_64/bin:/mnt/c/Software/flutter/bin:/mnt/c/Users/might/.vscode/extensions/ms-python.debugpy-2025.18.0-win32-x64/bundled/scripts/noConfigScripts:/snap/bin:/mnt/c/Software/flutter/bin
PYTHONPATH=/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/RawNet/python/RawNet3:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/RawNet/python/RawNet3/models:../../../:../../TEMPLATE/asr:/mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr:
2026-01-08 19:03:03 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:140 | Train config content:
num_device: 1
num_nodes: 1
task: espnet3.systems.asr.task.ASRTask
recipe_dir: .
data_dir: ./data
exp_tag: train_asr_rnn_data_aug_debug
exp_dir: ./exp/train_asr_rnn_data_aug_debug
stats_dir: ./exp/stats
decode_dir: ./exp/train_asr_rnn_data_aug_debug/decode
dataset_dir: ./data/mini_an4
create_dataset:
func: src.create_dataset.create_dataset
dataset_dir: ./data/mini_an4
archive_path: ./../../egs2/mini_an4/asr1/downloads.tar.gz
dataset:
_target_: espnet3.components.data.data_organizer.DataOrganizer
train:
- name: train_nodev
dataset:
_target_: src.dataset.MiniAN4Dataset
manifest_path: ./data/mini_an4/manifest/train_nodev.tsv
valid:
- name: train_dev
dataset:
_target_: src.dataset.MiniAN4Dataset
manifest_path: ./data/mini_an4/manifest/train_dev.tsv
preprocessor:
_target_: espnet2.train.preprocessor.CommonPreprocessor
fs: 16000
data_aug_effects:
- - 0.1
- contrast
- enhancement_amount: 75.0
- - 0.1
- highpass
- cutoff_freq: 5000
Q: 0.707
- - 0.1
- equalization
- center_freq: 1000
gain: 0
Q: 0.707
- - 0.1
- - - 0.3
- speed_perturb
- factor: 0.9
- - 0.3
- speed_perturb
- factor: 1.1
- - 0.3
- speed_perturb
- factor: 1.3
data_aug_num:
- 1
- 4
data_aug_prob: 1.0
parallel:
env: local
n_workers: 1
dataloader:
collate_fn:
_target_: espnet2.train.collate_fn.CommonCollateFn
int_pad_value: -1
train:
multiple_iterator: false
num_shards: 1
iter_factory:
_target_: espnet2.iterators.sequence_iter_factory.SequenceIterFactory
shuffle: true
collate_fn:
_target_: espnet2.train.collate_fn.CommonCollateFn
int_pad_value: -1
num_workers: 0
batches:
type: folded
shape_files:
- ./exp/stats/train/feats_shape
batch_size: 2
batch_bins: 200000
valid:
multiple_iterator: false
num_shards: 1
iter_factory:
_target_: espnet2.iterators.sequence_iter_factory.SequenceIterFactory
shuffle: false
collate_fn:
_target_: espnet2.train.collate_fn.CommonCollateFn
int_pad_value: -1
batches:
type: folded
shape_files:
- ./exp/stats/valid/feats_shape
batch_size: 2
batch_bins: 200000
optim:
_target_: torch.optim.Adam
lr: 0.001
weight_decay: 0.0
scheduler:
_target_: torch.optim.lr_scheduler.ReduceLROnPlateau
mode: min
factor: 0.5
patience: 1
val_scheduler_criterion:
- valid
- loss
best_model_criterion:
- - valid/acc
- 1
- max
trainer:
accelerator: auto
devices: 1
num_nodes: 1
accumulate_grad_batches: 1
check_val_every_n_epoch: 1
gradient_clip_val: 1.0
log_every_n_steps: 1
max_epochs: 1
limit_train_batches: 1
limit_val_batches: 1
precision: 32
logger:
- _target_: lightning.pytorch.loggers.TensorBoardLogger
save_dir: ./exp/train_asr_rnn_data_aug_debug/tensorboard
name: tb_logger
strategy: auto
tokenizer:
vocab_size: 30
character_coverage: 1.0
model_type: bpe
save_path: ./data/bpe_30
text_builder:
func: src.tokenizer.gather_training_text
manifest_path: ./data/mini_an4/manifest/train_nodev.tsv
model:
vocab_size: 30
token_list: ./data/bpe_30/tokens.txt
encoder: vgg_rnn
encoder_conf:
num_layers: 1
hidden_size: 2
output_size: 2
decoder: rnn
decoder_conf:
hidden_size: 2
normalize: utterance_mvn
normalize_conf: {}
model_conf:
ctc_weight: 0.3
lsm_weight: 0.1
length_normalized_loss: false
frontend: default
frontend_conf:
n_fft: 512
win_length: 400
hop_length: 160
2026-01-08 19:03:03 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:144 | Infer config content:
num_device: 1
num_nodes: 1
recipe_dir: .
data_dir: ./data
exp_tag: mini_an4_asr_debug
exp_dir: ./exp/mini_an4_asr_debug
stats_dir: ./exp/stats
decode_dir: ./exp/mini_an4_asr_debug/decode
dataset_dir: ./data/mini_an4
dataset:
_target_: espnet3.components.data.data_organizer.DataOrganizer
test:
- name: test
dataset:
_target_: src.dataset.MiniAN4Dataset
manifest_path: ./data/mini_an4/manifest/test.tsv
parallel:
env: local
n_workers: 1
model:
_target_: espnet2.bin.asr_inference.Speech2Text
asr_train_config: ./exp/mini_an4_asr_debug/config.yaml
asr_model_file: ./exp/mini_an4_asr_debug/last.ckpt
beam_size: 1
ctc_weight: 0.3
tokenizer:
vocab_size: 30
character_coverage: 1.0
model_type: bpe
save_path: ./data/bpe_30
2026-01-08 19:03:03 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:148 | Measure config content:
recipe_dir: .
data_dir: ./data
exp_tag: mini_an4_asr_debug
exp_dir: ./exp/mini_an4_asr_debug
stats_dir: ./exp/stats
decode_dir: ./exp/mini_an4_asr_debug/decode
dataset_dir: ./data/mini_an4
dataset:
_target_: espnet3.components.data.data_organizer.DataOrganizer
test:
- name: test
dataset:
_target_: src.dataset.MiniAN4Dataset
manifest_path: ./data/mini_an4/manifest/test.tsv
metrics:
- metric:
_target_: espnet3.systems.asr.metrics.wer.WER
clean_types: null
- metric:
_target_: espnet3.systems.asr.metrics.cer.CER
clean_types: null
2026-01-08 19:03:03 | INFO | espnet3.systems.base.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/system.py:49 | Initialized ASRSystem with train_config=True infer_config=True measure_config=True exp_dir=exp/train_asr_rnn_data_aug_debug
2026-01-08 19:03:03 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:165 | System: ASRSystem
2026-01-08 19:03:03 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:166 | Requested stages: ['all']
2026-01-08 19:03:03 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:167 | Resolved stages: ['create_dataset', 'train_tokenizer', 'collect_stats', 'train', 'infer', 'measure', 'publish']
2026-01-08 19:03:03 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:59 | === [START] stage: create_dataset ===
2026-01-08 19:03:03 | INFO | espnet3.systems.asr.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/system.py:54 | ASRSystem.create_dataset(): starting dataset creation process
2026-01-08 19:03:03 | INFO | espnet3.systems.asr.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/system.py:63 | Creating dataset with function src.create_dataset.create_dataset
2026-01-08 19:03:04 | INFO | src.create_dataset | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr/src/create_dataset.py:148 | Prepared Mini AN4 manifests under data/mini_an4/manifest
2026-01-08 19:03:04 | INFO | espnet3.systems.asr.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/system.py:65 | Dataset creation completed in 0.18s using src.create_dataset.create_dataset
2026-01-08 19:03:04 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:74 | === [DONE] stage: create_dataset (0.18s) ===
2026-01-08 19:03:04 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:59 | === [START] stage: train_tokenizer ===
2026-01-08 19:03:04 | INFO | espnet3.systems.asr.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/system.py:115 | Tokenizer already exists. Skipping train_tokenizer().
2026-01-08 19:03:04 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:74 | === [DONE] stage: train_tokenizer (0.00s) ===
2026-01-08 19:03:04 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:59 | === [START] stage: collect_stats ===
2026-01-08 19:03:04 | INFO | espnet3.systems.base.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/system.py:79 | Collecting stats | exp_dir=./exp/train_asr_rnn_data_aug_debug stats_dir=./exp/stats
2026-01-08 19:03:11 | INFO | root | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/task.py:548 | Vocabulary size: 30
2026-01-08 19:03:11 | INFO | espnet3.systems.base.train | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/train.py:33 | Model:
ESPnetASRModel(
(frontend): DefaultFrontend(
(stft): Stft(n_fft=512, win_length=400, hop_length=160, center=True, normalized=False, onesided=True)
(frontend): Frontend()
(logmel): LogMel(sr=16000, n_fft=512, n_mels=80, fmin=0, fmax=8000.0, htk=False)
)
(normalize): UtteranceMVN(norm_means=True, norm_vars=False)
(encoder): VGGRNNEncoder(
(enc): ModuleList(
(0): VGG2L(
(conv1_1): Conv2d(1, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
(conv1_2): Conv2d(64, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
(conv2_1): Conv2d(64, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
(conv2_2): Conv2d(128, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
)
(1): RNNP(
(birnn0): LSTM(2560, 2, batch_first=True, bidirectional=True)
(bt0): Linear(in_features=4, out_features=2, bias=True)
)
)
)
(decoder): RNNDecoder(
(embed): Embedding(30, 2)
(dropout_emb): Dropout(p=0.0, inplace=False)
(decoder): ModuleList(
(0): LSTMCell(4, 2)
)
(dropout_dec): ModuleList(
(0): Dropout(p=0.0, inplace=False)
)
(output): Linear(in_features=2, out_features=30, bias=True)
(att_list): ModuleList(
(0): AttLoc(
(mlp_enc): Linear(in_features=2, out_features=320, bias=True)
(mlp_dec): Linear(in_features=2, out_features=320, bias=False)
(mlp_att): Linear(in_features=10, out_features=320, bias=False)
(loc_conv): Conv2d(1, 10, kernel_size=(1, 201), stride=(1, 1), padding=(0, 100), bias=False)
(gvec): Linear(in_features=320, out_features=1, bias=True)
)
)
)
(criterion_att): LabelSmoothingLoss(
(criterion): KLDivLoss()
)
(ctc): CTC(
(ctc_lo): Linear(in_features=2, out_features=30, bias=True)
(ctc_loss): CTCLoss()
)
)
2026-01-08 19:03:11 | ERROR | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:70 | Stage 'collect_stats' failed after 7.66s
Traceback (most recent call last):
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/hydra/_internal/instantiate/_instantiate2.py", line 92, in _call_target
return _target_(*args, **kwargs)
^^^^^^^^^^^^^^^^^^^^^^^^^
TypeError: CommonPreprocessor.__init__() missing 1 required positional argument: 'train'
The above exception was the direct cause of the following exception:
Traceback (most recent call last):
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py", line 61, in run_stages
fn()
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/system.py", line 84, in collect_stats
return collect_stats(self.train_config)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/train.py", line 68, in collect_stats
trainer = _build_trainer(cfg)
^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/train.py", line 34, in _build_trainer
lit_model = LitESPnetModel(model, cfg)
^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/components/modeling/model.py", line 47, in __init__
data_organizer = instantiate(config.dataset)
^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/hydra/_internal/instantiate/_instantiate2.py", line 226, in instantiate
return instantiate_node(
^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/hydra/_internal/instantiate/_instantiate2.py", line 342, in instantiate_node
value = instantiate_node(
^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/hydra/_internal/instantiate/_instantiate2.py", line 347, in instantiate_node
return _call_target(_target_, partial, args, kwargs, full_key)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/hydra/_internal/instantiate/_instantiate2.py", line 97, in _call_target
raise InstantiationException(msg) from e
hydra.errors.InstantiationException: Error in call to target 'espnet2.train.preprocessor.CommonPreprocessor':
TypeError("CommonPreprocessor.__init__() missing 1 required positional argument: 'train'")
full_key: dataset.preprocessor
2026-01-08 19:05:24 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:152 | === ESPnet3 run started: 2026-01-08T19:05:24.823181 ===
2026-01-08 19:05:24 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:153 | Command: /mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/bin/python run.py --train_config conf/train_asr_rnn_data_aug_debug.yaml --infer_config conf/infer.yaml --measure_config conf/measure.yaml
2026-01-08 19:05:24 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:156 | Working directory: /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr
2026-01-08 19:05:24 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:162 | train config: /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr/conf/train_asr_rnn_data_aug_debug.yaml
2026-01-08 19:05:24 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:162 | infer config: /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr/conf/infer.yaml
2026-01-08 19:05:24 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:162 | measure config: /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr/conf/measure.yaml
2026-01-08 19:05:37 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:167 | Git: commit=7fae474de17d0a228f7ebb269ada1103e8014e6c, short_commit=7fae474de1, branch=espnet3/integration_test, worktree=dirty
2026-01-08 19:05:37 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:250 | Cluster env:
(none)
2026-01-08 19:05:37 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:251 | Runtime env:
LD_LIBRARY_PATH=/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/espeak-ng/lib:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/lib:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/lib64:/usr/local/cuda-10.1/lib64
PATH=/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/ffmpeg-release:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/festival/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/MBROLA/Bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/espeak-ng/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/BeamformIt:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kenlm/build/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/PESQ/P862_annex_A_2005_CD/source:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/nkf/nkf-2.1.4:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/moses/scripts/tokenizer:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/moses/scripts/generic:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/tools/moses/scripts/recaser:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/moses/scripts/training:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/mwerSegmenter:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/sctk/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/sph2pipe:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/sentencepiece_commands:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/chainbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/featbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/fgmmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/fstbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/gmmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/ivectorbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/kwsbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/latbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/lmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/nnet2bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/nnet3bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/nnetbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/online2bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/onlinebin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/rnnlmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/sgmm2bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/sgmmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/tfrnnlmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/cudadecoderbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/cudafeatbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/bin:/home/masao/.local/bin:/home/masao/utils:/home/masao/git/SCTK/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/games:/usr/local/games:/usr/lib/wsl/lib:/mnt/c/Users/might/AppData/Roaming/Code/User/globalStorage/github.copilot-chat/debugCommand:/mnt/c/Users/might/AppData/Roaming/Code/User/globalStorage/github.copilot-chat/copilotCli:/mnt/c/Windows/system32:/mnt/c/Windows:/mnt/c/Windows/System32/Wbem:/mnt/c/Windows/System32/WindowsPowerShell/v1.0/:/mnt/c/Windows/System32/OpenSSH/:/mnt/c/Program Files (x86)/NVIDIA Corporation/PhysX/Common:/mnt/c/Program Files/NVIDIA Corporation/NVIDIA NvDLISR:/mnt/c/Program Files/dotnet/:/mnt/c/Program Files/Git/cmd:/mnt/c/Program Files/Tailscale/:/mnt/c/Users/might/AppData/Local/Microsoft/WindowsApps:/mnt/c/Users/might/AppData/Local/Programs/Microsoft VS Code/bin:/mnt/c/Users/might/AppData/Local/Muse Hub/lib:/mnt/c/Software/cmake-3.30.2-windows-x86_64/bin:/mnt/c/Software/flutter/bin:/mnt/c/Users/might/.vscode/extensions/ms-python.debugpy-2025.18.0-win32-x64/bundled/scripts/noConfigScripts:/snap/bin:/mnt/c/Software/flutter/bin
PYTHONPATH=/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/RawNet/python/RawNet3:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/RawNet/python/RawNet3/models:../../../:../../TEMPLATE/asr:/mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr:
2026-01-08 19:05:37 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:140 | Train config content:
num_device: 1
num_nodes: 1
task: espnet3.systems.asr.task.ASRTask
recipe_dir: .
data_dir: ./data
exp_tag: train_asr_rnn_data_aug_debug
exp_dir: ./exp/train_asr_rnn_data_aug_debug
stats_dir: ./exp/stats
decode_dir: ./exp/train_asr_rnn_data_aug_debug/decode
dataset_dir: ./data/mini_an4
create_dataset:
func: src.create_dataset.create_dataset
dataset_dir: ./data/mini_an4
archive_path: ./../../egs2/mini_an4/asr1/downloads.tar.gz
dataset:
_target_: espnet3.components.data.data_organizer.DataOrganizer
train:
- name: train_nodev
dataset:
_target_: src.dataset.MiniAN4Dataset
manifest_path: ./data/mini_an4/manifest/train_nodev.tsv
valid:
- name: train_dev
dataset:
_target_: src.dataset.MiniAN4Dataset
manifest_path: ./data/mini_an4/manifest/train_dev.tsv
preprocessor:
_target_: espnet2.train.preprocessor.CommonPreprocessor
fs: 16000
data_aug_effects:
- - 0.1
- contrast
- enhancement_amount: 75.0
- - 0.1
- highpass
- cutoff_freq: 5000
Q: 0.707
- - 0.1
- equalization
- center_freq: 1000
gain: 0
Q: 0.707
- - 0.1
- - - 0.3
- speed_perturb
- factor: 0.9
- - 0.3
- speed_perturb
- factor: 1.1
- - 0.3
- speed_perturb
- factor: 1.3
data_aug_num:
- 1
- 4
data_aug_prob: 1.0
parallel:
env: local
n_workers: 1
dataloader:
collate_fn:
_target_: espnet2.train.collate_fn.CommonCollateFn
int_pad_value: -1
train:
multiple_iterator: false
num_shards: 1
iter_factory:
_target_: espnet2.iterators.sequence_iter_factory.SequenceIterFactory
shuffle: true
collate_fn:
_target_: espnet2.train.collate_fn.CommonCollateFn
int_pad_value: -1
num_workers: 0
batches:
type: folded
shape_files:
- ./exp/stats/train/feats_shape
batch_size: 2
batch_bins: 200000
valid:
multiple_iterator: false
num_shards: 1
iter_factory:
_target_: espnet2.iterators.sequence_iter_factory.SequenceIterFactory
shuffle: false
collate_fn:
_target_: espnet2.train.collate_fn.CommonCollateFn
int_pad_value: -1
batches:
type: folded
shape_files:
- ./exp/stats/valid/feats_shape
batch_size: 2
batch_bins: 200000
optim:
_target_: torch.optim.Adam
lr: 0.001
weight_decay: 0.0
scheduler:
_target_: torch.optim.lr_scheduler.ReduceLROnPlateau
mode: min
factor: 0.5
patience: 1
val_scheduler_criterion:
- valid
- loss
best_model_criterion:
- - valid/acc
- 1
- max
trainer:
accelerator: auto
devices: 1
num_nodes: 1
accumulate_grad_batches: 1
check_val_every_n_epoch: 1
gradient_clip_val: 1.0
log_every_n_steps: 1
max_epochs: 1
limit_train_batches: 1
limit_val_batches: 1
precision: 32
logger:
- _target_: lightning.pytorch.loggers.TensorBoardLogger
save_dir: ./exp/train_asr_rnn_data_aug_debug/tensorboard
name: tb_logger
strategy: auto
tokenizer:
vocab_size: 30
character_coverage: 1.0
model_type: bpe
save_path: ./data/bpe_30
text_builder:
func: src.tokenizer.gather_training_text
manifest_path: ./data/mini_an4/manifest/train_nodev.tsv
model:
vocab_size: 30
token_list: ./data/bpe_30/tokens.txt
encoder: vgg_rnn
encoder_conf:
num_layers: 1
hidden_size: 2
output_size: 2
decoder: rnn
decoder_conf:
hidden_size: 2
normalize: utterance_mvn
normalize_conf: {}
model_conf:
ctc_weight: 0.3
lsm_weight: 0.1
length_normalized_loss: false
frontend: default
frontend_conf:
n_fft: 512
win_length: 400
hop_length: 160
2026-01-08 19:05:37 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:144 | Infer config content:
num_device: 1
num_nodes: 1
recipe_dir: .
data_dir: ./data
exp_tag: mini_an4_asr_debug
exp_dir: ./exp/mini_an4_asr_debug
stats_dir: ./exp/stats
decode_dir: ./exp/mini_an4_asr_debug/decode
dataset_dir: ./data/mini_an4
dataset:
_target_: espnet3.components.data.data_organizer.DataOrganizer
test:
- name: test
dataset:
_target_: src.dataset.MiniAN4Dataset
manifest_path: ./data/mini_an4/manifest/test.tsv
parallel:
env: local
n_workers: 1
model:
_target_: espnet2.bin.asr_inference.Speech2Text
asr_train_config: ./exp/mini_an4_asr_debug/config.yaml
asr_model_file: ./exp/mini_an4_asr_debug/last.ckpt
beam_size: 1
ctc_weight: 0.3
tokenizer:
vocab_size: 30
character_coverage: 1.0
model_type: bpe
save_path: ./data/bpe_30
2026-01-08 19:05:37 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:148 | Measure config content:
recipe_dir: .
data_dir: ./data
exp_tag: mini_an4_asr_debug
exp_dir: ./exp/mini_an4_asr_debug
stats_dir: ./exp/stats
decode_dir: ./exp/mini_an4_asr_debug/decode
dataset_dir: ./data/mini_an4
dataset:
_target_: espnet3.components.data.data_organizer.DataOrganizer
test:
- name: test
dataset:
_target_: src.dataset.MiniAN4Dataset
manifest_path: ./data/mini_an4/manifest/test.tsv
metrics:
- metric:
_target_: espnet3.systems.asr.metrics.wer.WER
clean_types: null
- metric:
_target_: espnet3.systems.asr.metrics.cer.CER
clean_types: null
2026-01-08 19:05:37 | INFO | espnet3.systems.base.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/system.py:49 | Initialized ASRSystem with train_config=True infer_config=True measure_config=True exp_dir=exp/train_asr_rnn_data_aug_debug
2026-01-08 19:05:37 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:165 | System: ASRSystem
2026-01-08 19:05:37 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:166 | Requested stages: ['all']
2026-01-08 19:05:37 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:167 | Resolved stages: ['create_dataset', 'train_tokenizer', 'collect_stats', 'train', 'infer', 'measure', 'publish']
2026-01-08 19:05:37 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:59 | === [START] stage: create_dataset ===
2026-01-08 19:05:37 | INFO | espnet3.systems.asr.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/system.py:54 | ASRSystem.create_dataset(): starting dataset creation process
2026-01-08 19:05:37 | INFO | espnet3.systems.asr.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/system.py:63 | Creating dataset with function src.create_dataset.create_dataset
2026-01-08 19:05:37 | INFO | src.create_dataset | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr/src/create_dataset.py:148 | Prepared Mini AN4 manifests under data/mini_an4/manifest
2026-01-08 19:05:37 | INFO | espnet3.systems.asr.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/system.py:65 | Dataset creation completed in 0.18s using src.create_dataset.create_dataset
2026-01-08 19:05:37 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:74 | === [DONE] stage: create_dataset (0.18s) ===
2026-01-08 19:05:37 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:59 | === [START] stage: train_tokenizer ===
2026-01-08 19:05:37 | INFO | espnet3.systems.asr.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/system.py:115 | Tokenizer already exists. Skipping train_tokenizer().
2026-01-08 19:05:37 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:74 | === [DONE] stage: train_tokenizer (0.00s) ===
2026-01-08 19:05:37 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:59 | === [START] stage: collect_stats ===
2026-01-08 19:05:37 | INFO | espnet3.systems.base.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/system.py:79 | Collecting stats | exp_dir=./exp/train_asr_rnn_data_aug_debug stats_dir=./exp/stats
2026-01-08 19:05:45 | INFO | root | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/task.py:548 | Vocabulary size: 30
2026-01-08 19:05:45 | INFO | espnet3.systems.base.train | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/train.py:33 | Model:
ESPnetASRModel(
(frontend): DefaultFrontend(
(stft): Stft(n_fft=512, win_length=400, hop_length=160, center=True, normalized=False, onesided=True)
(frontend): Frontend()
(logmel): LogMel(sr=16000, n_fft=512, n_mels=80, fmin=0, fmax=8000.0, htk=False)
)
(normalize): UtteranceMVN(norm_means=True, norm_vars=False)
(encoder): VGGRNNEncoder(
(enc): ModuleList(
(0): VGG2L(
(conv1_1): Conv2d(1, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
(conv1_2): Conv2d(64, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
(conv2_1): Conv2d(64, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
(conv2_2): Conv2d(128, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
)
(1): RNNP(
(birnn0): LSTM(2560, 2, batch_first=True, bidirectional=True)
(bt0): Linear(in_features=4, out_features=2, bias=True)
)
)
)
(decoder): RNNDecoder(
(embed): Embedding(30, 2)
(dropout_emb): Dropout(p=0.0, inplace=False)
(decoder): ModuleList(
(0): LSTMCell(4, 2)
)
(dropout_dec): ModuleList(
(0): Dropout(p=0.0, inplace=False)
)
(output): Linear(in_features=2, out_features=30, bias=True)
(att_list): ModuleList(
(0): AttLoc(
(mlp_enc): Linear(in_features=2, out_features=320, bias=True)
(mlp_dec): Linear(in_features=2, out_features=320, bias=False)
(mlp_att): Linear(in_features=10, out_features=320, bias=False)
(loc_conv): Conv2d(1, 10, kernel_size=(1, 201), stride=(1, 1), padding=(0, 100), bias=False)
(gvec): Linear(in_features=320, out_features=1, bias=True)
)
)
)
(criterion_att): LabelSmoothingLoss(
(criterion): KLDivLoss()
)
(ctc): CTC(
(ctc_lo): Linear(in_features=2, out_features=30, bias=True)
(ctc_loss): CTCLoss()
)
)
2026-01-08 19:05:45 | ERROR | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:70 | Stage 'collect_stats' failed after 7.81s
Traceback (most recent call last):
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/hydra/_internal/instantiate/_instantiate2.py", line 92, in _call_target
return _target_(*args, **kwargs)
^^^^^^^^^^^^^^^^^^^^^^^^^
TypeError: CommonPreprocessor.__init__() missing 1 required positional argument: 'train'
The above exception was the direct cause of the following exception:
Traceback (most recent call last):
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py", line 61, in run_stages
fn()
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/system.py", line 84, in collect_stats
return collect_stats(self.train_config)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/train.py", line 68, in collect_stats
trainer = _build_trainer(cfg)
^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/train.py", line 34, in _build_trainer
lit_model = LitESPnetModel(model, cfg)
^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/components/modeling/model.py", line 47, in __init__
data_organizer = instantiate(config.dataset)
^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/hydra/_internal/instantiate/_instantiate2.py", line 226, in instantiate
return instantiate_node(
^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/hydra/_internal/instantiate/_instantiate2.py", line 342, in instantiate_node
value = instantiate_node(
^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/hydra/_internal/instantiate/_instantiate2.py", line 347, in instantiate_node
return _call_target(_target_, partial, args, kwargs, full_key)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/hydra/_internal/instantiate/_instantiate2.py", line 97, in _call_target
raise InstantiationException(msg) from e
hydra.errors.InstantiationException: Error in call to target 'espnet2.train.preprocessor.CommonPreprocessor':
TypeError("CommonPreprocessor.__init__() missing 1 required positional argument: 'train'")
full_key: dataset.preprocessor
2026-01-08 19:08:11 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:152 | === ESPnet3 run started: 2026-01-08T19:08:11.791472 ===
2026-01-08 19:08:11 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:153 | Command: /mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/bin/python run.py --train_config conf/train_asr_rnn_data_aug_debug.yaml --infer_config conf/infer.yaml --measure_config conf/measure.yaml
2026-01-08 19:08:11 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:156 | Working directory: /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr
2026-01-08 19:08:11 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:162 | train config: /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr/conf/train_asr_rnn_data_aug_debug.yaml
2026-01-08 19:08:11 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:162 | infer config: /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr/conf/infer.yaml
2026-01-08 19:08:11 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:162 | measure config: /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr/conf/measure.yaml
2026-01-08 19:08:25 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:167 | Git: commit=7fae474de17d0a228f7ebb269ada1103e8014e6c, short_commit=7fae474de1, branch=espnet3/integration_test, worktree=dirty
2026-01-08 19:08:25 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:250 | Cluster env:
(none)
2026-01-08 19:08:25 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:251 | Runtime env:
LD_LIBRARY_PATH=/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/espeak-ng/lib:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/lib:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/lib64:/usr/local/cuda-10.1/lib64
PATH=/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/ffmpeg-release:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/festival/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/MBROLA/Bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/espeak-ng/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/BeamformIt:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kenlm/build/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/PESQ/P862_annex_A_2005_CD/source:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/nkf/nkf-2.1.4:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/moses/scripts/tokenizer:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/moses/scripts/generic:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/tools/moses/scripts/recaser:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/moses/scripts/training:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/mwerSegmenter:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/sctk/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/sph2pipe:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/sentencepiece_commands:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/chainbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/featbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/fgmmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/fstbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/gmmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/ivectorbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/kwsbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/latbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/lmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/nnet2bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/nnet3bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/nnetbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/online2bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/onlinebin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/rnnlmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/sgmm2bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/sgmmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/tfrnnlmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/cudadecoderbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/cudafeatbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/bin:/home/masao/.local/bin:/home/masao/utils:/home/masao/git/SCTK/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/games:/usr/local/games:/usr/lib/wsl/lib:/mnt/c/Users/might/AppData/Roaming/Code/User/globalStorage/github.copilot-chat/debugCommand:/mnt/c/Users/might/AppData/Roaming/Code/User/globalStorage/github.copilot-chat/copilotCli:/mnt/c/Windows/system32:/mnt/c/Windows:/mnt/c/Windows/System32/Wbem:/mnt/c/Windows/System32/WindowsPowerShell/v1.0/:/mnt/c/Windows/System32/OpenSSH/:/mnt/c/Program Files (x86)/NVIDIA Corporation/PhysX/Common:/mnt/c/Program Files/NVIDIA Corporation/NVIDIA NvDLISR:/mnt/c/Program Files/dotnet/:/mnt/c/Program Files/Git/cmd:/mnt/c/Program Files/Tailscale/:/mnt/c/Users/might/AppData/Local/Microsoft/WindowsApps:/mnt/c/Users/might/AppData/Local/Programs/Microsoft VS Code/bin:/mnt/c/Users/might/AppData/Local/Muse Hub/lib:/mnt/c/Software/cmake-3.30.2-windows-x86_64/bin:/mnt/c/Software/flutter/bin:/mnt/c/Users/might/.vscode/extensions/ms-python.debugpy-2025.18.0-win32-x64/bundled/scripts/noConfigScripts:/snap/bin:/mnt/c/Software/flutter/bin
PYTHONPATH=/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/RawNet/python/RawNet3:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/RawNet/python/RawNet3/models:../../../:../../TEMPLATE/asr:/mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr:
2026-01-08 19:08:25 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:140 | Train config content:
num_device: 1
num_nodes: 1
task: espnet3.systems.asr.task.ASRTask
recipe_dir: .
data_dir: ./data
exp_tag: train_asr_rnn_data_aug_debug
exp_dir: ./exp/train_asr_rnn_data_aug_debug
stats_dir: ./exp/stats
decode_dir: ./exp/train_asr_rnn_data_aug_debug/decode
dataset_dir: ./data/mini_an4
create_dataset:
func: src.create_dataset.create_dataset
dataset_dir: ./data/mini_an4
archive_path: ./../../egs2/mini_an4/asr1/downloads.tar.gz
dataset:
_target_: espnet3.components.data.data_organizer.DataOrganizer
train:
- name: train_nodev
dataset:
_target_: src.dataset.MiniAN4Dataset
manifest_path: ./data/mini_an4/manifest/train_nodev.tsv
valid:
- name: train_dev
dataset:
_target_: src.dataset.MiniAN4Dataset
manifest_path: ./data/mini_an4/manifest/train_dev.tsv
preprocessor:
_target_: espnet2.train.preprocessor.CommonPreprocessor
fs: 16000
train: true
data_aug_effects:
- - 0.1
- contrast
- enhancement_amount: 75.0
- - 0.1
- highpass
- cutoff_freq: 5000
Q: 0.707
- - 0.1
- equalization
- center_freq: 1000
gain: 0
Q: 0.707
- - 0.1
- - - 0.3
- speed_perturb
- factor: 0.9
- - 0.3
- speed_perturb
- factor: 1.1
- - 0.3
- speed_perturb
- factor: 1.3
data_aug_num:
- 1
- 4
data_aug_prob: 1.0
parallel:
env: local
n_workers: 1
dataloader:
collate_fn:
_target_: espnet2.train.collate_fn.CommonCollateFn
int_pad_value: -1
train:
multiple_iterator: false
num_shards: 1
iter_factory:
_target_: espnet2.iterators.sequence_iter_factory.SequenceIterFactory
shuffle: true
collate_fn:
_target_: espnet2.train.collate_fn.CommonCollateFn
int_pad_value: -1
num_workers: 0
batches:
type: folded
shape_files:
- ./exp/stats/train/feats_shape
batch_size: 2
batch_bins: 200000
valid:
multiple_iterator: false
num_shards: 1
iter_factory:
_target_: espnet2.iterators.sequence_iter_factory.SequenceIterFactory
shuffle: false
collate_fn:
_target_: espnet2.train.collate_fn.CommonCollateFn
int_pad_value: -1
batches:
type: folded
shape_files:
- ./exp/stats/valid/feats_shape
batch_size: 2
batch_bins: 200000
optim:
_target_: torch.optim.Adam
lr: 0.001
weight_decay: 0.0
scheduler:
_target_: torch.optim.lr_scheduler.ReduceLROnPlateau
mode: min
factor: 0.5
patience: 1
val_scheduler_criterion:
- valid
- loss
best_model_criterion:
- - valid/acc
- 1
- max
trainer:
accelerator: auto
devices: 1
num_nodes: 1
accumulate_grad_batches: 1
check_val_every_n_epoch: 1
gradient_clip_val: 1.0
log_every_n_steps: 1
max_epochs: 1
limit_train_batches: 1
limit_val_batches: 1
precision: 32
logger:
- _target_: lightning.pytorch.loggers.TensorBoardLogger
save_dir: ./exp/train_asr_rnn_data_aug_debug/tensorboard
name: tb_logger
strategy: auto
tokenizer:
vocab_size: 30
character_coverage: 1.0
model_type: bpe
save_path: ./data/bpe_30
text_builder:
func: src.tokenizer.gather_training_text
manifest_path: ./data/mini_an4/manifest/train_nodev.tsv
model:
vocab_size: 30
token_list: ./data/bpe_30/tokens.txt
encoder: vgg_rnn
encoder_conf:
num_layers: 1
hidden_size: 2
output_size: 2
decoder: rnn
decoder_conf:
hidden_size: 2
normalize: utterance_mvn
normalize_conf: {}
model_conf:
ctc_weight: 0.3
lsm_weight: 0.1
length_normalized_loss: false
frontend: default
frontend_conf:
n_fft: 512
win_length: 400
hop_length: 160
2026-01-08 19:08:25 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:144 | Infer config content:
num_device: 1
num_nodes: 1
recipe_dir: .
data_dir: ./data
exp_tag: mini_an4_asr_debug
exp_dir: ./exp/mini_an4_asr_debug
stats_dir: ./exp/stats
decode_dir: ./exp/mini_an4_asr_debug/decode
dataset_dir: ./data/mini_an4
dataset:
_target_: espnet3.components.data.data_organizer.DataOrganizer
test:
- name: test
dataset:
_target_: src.dataset.MiniAN4Dataset
manifest_path: ./data/mini_an4/manifest/test.tsv
parallel:
env: local
n_workers: 1
model:
_target_: espnet2.bin.asr_inference.Speech2Text
asr_train_config: ./exp/mini_an4_asr_debug/config.yaml
asr_model_file: ./exp/mini_an4_asr_debug/last.ckpt
beam_size: 1
ctc_weight: 0.3
tokenizer:
vocab_size: 30
character_coverage: 1.0
model_type: bpe
save_path: ./data/bpe_30
2026-01-08 19:08:25 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:148 | Measure config content:
recipe_dir: .
data_dir: ./data
exp_tag: mini_an4_asr_debug
exp_dir: ./exp/mini_an4_asr_debug
stats_dir: ./exp/stats
decode_dir: ./exp/mini_an4_asr_debug/decode
dataset_dir: ./data/mini_an4
dataset:
_target_: espnet3.components.data.data_organizer.DataOrganizer
test:
- name: test
dataset:
_target_: src.dataset.MiniAN4Dataset
manifest_path: ./data/mini_an4/manifest/test.tsv
metrics:
- metric:
_target_: espnet3.systems.asr.metrics.wer.WER
clean_types: null
- metric:
_target_: espnet3.systems.asr.metrics.cer.CER
clean_types: null
2026-01-08 19:08:25 | INFO | espnet3.systems.base.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/system.py:49 | Initialized ASRSystem with train_config=True infer_config=True measure_config=True exp_dir=exp/train_asr_rnn_data_aug_debug
2026-01-08 19:08:25 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:165 | System: ASRSystem
2026-01-08 19:08:25 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:166 | Requested stages: ['all']
2026-01-08 19:08:25 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:167 | Resolved stages: ['create_dataset', 'train_tokenizer', 'collect_stats', 'train', 'infer', 'measure', 'publish']
2026-01-08 19:08:25 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:59 | === [START] stage: create_dataset ===
2026-01-08 19:08:25 | INFO | espnet3.systems.asr.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/system.py:54 | ASRSystem.create_dataset(): starting dataset creation process
2026-01-08 19:08:25 | INFO | espnet3.systems.asr.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/system.py:63 | Creating dataset with function src.create_dataset.create_dataset
2026-01-08 19:08:25 | INFO | src.create_dataset | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr/src/create_dataset.py:148 | Prepared Mini AN4 manifests under data/mini_an4/manifest
2026-01-08 19:08:25 | INFO | espnet3.systems.asr.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/system.py:65 | Dataset creation completed in 0.27s using src.create_dataset.create_dataset
2026-01-08 19:08:25 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:74 | === [DONE] stage: create_dataset (0.28s) ===
2026-01-08 19:08:25 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:59 | === [START] stage: train_tokenizer ===
2026-01-08 19:08:25 | INFO | espnet3.systems.asr.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/system.py:115 | Tokenizer already exists. Skipping train_tokenizer().
2026-01-08 19:08:25 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:74 | === [DONE] stage: train_tokenizer (0.00s) ===
2026-01-08 19:08:25 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:59 | === [START] stage: collect_stats ===
2026-01-08 19:08:25 | INFO | espnet3.systems.base.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/system.py:79 | Collecting stats | exp_dir=./exp/train_asr_rnn_data_aug_debug stats_dir=./exp/stats
2026-01-08 19:08:32 | INFO | root | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/task.py:548 | Vocabulary size: 30
2026-01-08 19:08:33 | INFO | espnet3.systems.base.train | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/train.py:33 | Model:
ESPnetASRModel(
(frontend): DefaultFrontend(
(stft): Stft(n_fft=512, win_length=400, hop_length=160, center=True, normalized=False, onesided=True)
(frontend): Frontend()
(logmel): LogMel(sr=16000, n_fft=512, n_mels=80, fmin=0, fmax=8000.0, htk=False)
)
(normalize): UtteranceMVN(norm_means=True, norm_vars=False)
(encoder): VGGRNNEncoder(
(enc): ModuleList(
(0): VGG2L(
(conv1_1): Conv2d(1, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
(conv1_2): Conv2d(64, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
(conv2_1): Conv2d(64, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
(conv2_2): Conv2d(128, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
)
(1): RNNP(
(birnn0): LSTM(2560, 2, batch_first=True, bidirectional=True)
(bt0): Linear(in_features=4, out_features=2, bias=True)
)
)
)
(decoder): RNNDecoder(
(embed): Embedding(30, 2)
(dropout_emb): Dropout(p=0.0, inplace=False)
(decoder): ModuleList(
(0): LSTMCell(4, 2)
)
(dropout_dec): ModuleList(
(0): Dropout(p=0.0, inplace=False)
)
(output): Linear(in_features=2, out_features=30, bias=True)
(att_list): ModuleList(
(0): AttLoc(
(mlp_enc): Linear(in_features=2, out_features=320, bias=True)
(mlp_dec): Linear(in_features=2, out_features=320, bias=False)
(mlp_att): Linear(in_features=10, out_features=320, bias=False)
(loc_conv): Conv2d(1, 10, kernel_size=(1, 201), stride=(1, 1), padding=(0, 100), bias=False)
(gvec): Linear(in_features=320, out_features=1, bias=True)
)
)
)
(criterion_att): LabelSmoothingLoss(
(criterion): KLDivLoss()
)
(ctc): CTC(
(ctc_lo): Linear(in_features=2, out_features=30, bias=True)
(ctc_loss): CTCLoss()
)
)
2026-01-08 19:08:33 | INFO | lightning.pytorch.utilities.rank_zero | /mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/lightning/pytorch/trainer/setup.py:155 | GPU available: True (cuda), used: True
2026-01-08 19:08:33 | INFO | lightning.pytorch.utilities.rank_zero | /mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/lightning/pytorch/trainer/setup.py:158 | TPU available: False, using: 0 TPU cores
2026-01-08 19:08:33 | INFO | lightning.pytorch.utilities.rank_zero | /mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/lightning/pytorch/trainer/setup.py:105 | `Trainer(limit_train_batches=1)` was configured so 1 batch per epoch will be used.
2026-01-08 19:08:33 | INFO | lightning.pytorch.utilities.rank_zero | /mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/lightning/pytorch/trainer/setup.py:105 | `Trainer(limit_val_batches=1)` was configured so 1 batch will be used.
2026-01-08 19:08:33 | INFO | root | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/task.py:548 | Vocabulary size: 30
2026-01-08 19:08:33 | ERROR | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:70 | Stage 'collect_stats' failed after 8.50s
Traceback (most recent call last):
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py", line 61, in run_stages
fn()
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/system.py", line 84, in collect_stats
return collect_stats(self.train_config)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/train.py", line 69, in collect_stats
trainer.collect_stats()
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/components/training/trainer.py", line 205, in collect_stats
return self.model.collect_stats(*args, **kwargs)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/components/modeling/model.py", line 444, in collect_stats
collect_stats(
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/components/data/collect_stats.py", line 512, in collect_stats
sum_dict, sq_dict, count_dict = _collect_stats_common(
^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/components/data/collect_stats.py", line 375, in _collect_stats_common
results = runner(index_batches)
^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/parallel/base_runner.py", line 340, in __call__
return self._run_local(indices)
^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/parallel/base_runner.py", line 209, in _run_local
return [f(i, **env) for i in tqdm(indices, total=len(indices))]
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/parallel/base_runner.py", line 209, in <listcomp>
return [f(i, **env) for i in tqdm(indices, total=len(indices))]
^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/components/data/collect_stats.py", line 328, in forward
return batch_collect_stats(
^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/components/data/collect_stats.py", line 44, in batch_collect_stats
item = dataset[i]
~~~~~~~^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/components/data/dataset.py", line 151, in __getitem__
transformed = self.transforms[i][1](str(idx), transformed)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet2/train/preprocessor.py", line 547, in __call__
data = self._speech_process(data)
^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet2/train/preprocessor.py", line 451, in _speech_process
data[self.speech_name] = self.data_aug(
^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet2/layers/augmentation.py", line 100, in __call__
if isinstance(effect[1], list):
~~~~~~^^^
IndexError: list index out of range
2026-01-08 19:16:30 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:152 | === ESPnet3 run started: 2026-01-08T19:16:30.139396 ===
2026-01-08 19:16:30 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:153 | Command: /mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/bin/python run.py --train_config conf/train_asr_rnn_data_aug_debug.yaml --infer_config conf/infer.yaml --measure_config conf/measure.yaml
2026-01-08 19:16:30 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:156 | Working directory: /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr
2026-01-08 19:16:30 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:162 | train config: /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr/conf/train_asr_rnn_data_aug_debug.yaml
2026-01-08 19:16:30 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:162 | infer config: /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr/conf/infer.yaml
2026-01-08 19:16:30 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:162 | measure config: /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr/conf/measure.yaml
2026-01-08 19:16:43 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:167 | Git: commit=7fae474de17d0a228f7ebb269ada1103e8014e6c, short_commit=7fae474de1, branch=espnet3/integration_test, worktree=dirty
2026-01-08 19:16:43 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:250 | Cluster env:
(none)
2026-01-08 19:16:43 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:251 | Runtime env:
LD_LIBRARY_PATH=/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/espeak-ng/lib:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/lib:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/lib64:/usr/local/cuda-10.1/lib64
PATH=/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/ffmpeg-release:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/festival/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/MBROLA/Bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/espeak-ng/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/BeamformIt:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kenlm/build/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/PESQ/P862_annex_A_2005_CD/source:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/nkf/nkf-2.1.4:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/moses/scripts/tokenizer:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/moses/scripts/generic:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/tools/moses/scripts/recaser:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/moses/scripts/training:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/mwerSegmenter:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/sctk/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/sph2pipe:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/sentencepiece_commands:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/chainbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/featbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/fgmmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/fstbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/gmmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/ivectorbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/kwsbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/latbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/lmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/nnet2bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/nnet3bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/nnetbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/online2bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/onlinebin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/rnnlmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/sgmm2bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/sgmmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/tfrnnlmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/cudadecoderbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/cudafeatbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/bin:/home/masao/.local/bin:/home/masao/utils:/home/masao/git/SCTK/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/games:/usr/local/games:/usr/lib/wsl/lib:/mnt/c/Users/might/AppData/Roaming/Code/User/globalStorage/github.copilot-chat/debugCommand:/mnt/c/Users/might/AppData/Roaming/Code/User/globalStorage/github.copilot-chat/copilotCli:/mnt/c/Windows/system32:/mnt/c/Windows:/mnt/c/Windows/System32/Wbem:/mnt/c/Windows/System32/WindowsPowerShell/v1.0/:/mnt/c/Windows/System32/OpenSSH/:/mnt/c/Program Files (x86)/NVIDIA Corporation/PhysX/Common:/mnt/c/Program Files/NVIDIA Corporation/NVIDIA NvDLISR:/mnt/c/Program Files/dotnet/:/mnt/c/Program Files/Git/cmd:/mnt/c/Program Files/Tailscale/:/mnt/c/Users/might/AppData/Local/Microsoft/WindowsApps:/mnt/c/Users/might/AppData/Local/Programs/Microsoft VS Code/bin:/mnt/c/Users/might/AppData/Local/Muse Hub/lib:/mnt/c/Software/cmake-3.30.2-windows-x86_64/bin:/mnt/c/Software/flutter/bin:/mnt/c/Users/might/.vscode/extensions/ms-python.debugpy-2025.18.0-win32-x64/bundled/scripts/noConfigScripts:/snap/bin:/mnt/c/Software/flutter/bin
PYTHONPATH=/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/RawNet/python/RawNet3:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/RawNet/python/RawNet3/models:../../../:../../TEMPLATE/asr:/mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr:
2026-01-08 19:16:43 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:140 | Train config content:
num_device: 1
num_nodes: 1
task: espnet3.systems.asr.task.ASRTask
recipe_dir: .
data_dir: ./data
exp_tag: train_asr_rnn_data_aug_debug
exp_dir: ./exp/train_asr_rnn_data_aug_debug
stats_dir: ./exp/stats
decode_dir: ./exp/train_asr_rnn_data_aug_debug/decode
dataset_dir: ./data/mini_an4
create_dataset:
func: src.create_dataset.create_dataset
dataset_dir: ./data/mini_an4
archive_path: ./../../egs2/mini_an4/asr1/downloads.tar.gz
dataset:
_target_: espnet3.components.data.data_organizer.DataOrganizer
train:
- name: train_nodev
dataset:
_target_: src.dataset.MiniAN4Dataset
manifest_path: ./data/mini_an4/manifest/train_nodev.tsv
valid:
- name: train_dev
dataset:
_target_: src.dataset.MiniAN4Dataset
manifest_path: ./data/mini_an4/manifest/train_dev.tsv
preprocessor:
_target_: espnet2.train.preprocessor.CommonPreprocessor
_convert_: all
fs: 16000
train: true
data_aug_effects:
- - 0.1
- contrast
- enhancement_amount: 75.0
- - 0.1
- highpass
- cutoff_freq: 5000
Q: 0.707
- - 0.1
- equalization
- center_freq: 1000
gain: 0
Q: 0.707
- - 0.1
- - - 0.3
- speed_perturb
- factor: 0.9
- - 0.3
- speed_perturb
- factor: 1.1
- - 0.3
- speed_perturb
- factor: 1.3
data_aug_num:
- 1
- 4
data_aug_prob: 1.0
parallel:
env: local
n_workers: 1
dataloader:
collate_fn:
_target_: espnet2.train.collate_fn.CommonCollateFn
int_pad_value: -1
train:
multiple_iterator: false
num_shards: 1
iter_factory:
_target_: espnet2.iterators.sequence_iter_factory.SequenceIterFactory
shuffle: true
collate_fn:
_target_: espnet2.train.collate_fn.CommonCollateFn
int_pad_value: -1
num_workers: 0
batches:
type: folded
shape_files:
- ./exp/stats/train/feats_shape
batch_size: 2
batch_bins: 200000
valid:
multiple_iterator: false
num_shards: 1
iter_factory:
_target_: espnet2.iterators.sequence_iter_factory.SequenceIterFactory
shuffle: false
collate_fn:
_target_: espnet2.train.collate_fn.CommonCollateFn
int_pad_value: -1
batches:
type: folded
shape_files:
- ./exp/stats/valid/feats_shape
batch_size: 2
batch_bins: 200000
optim:
_target_: torch.optim.Adam
lr: 0.001
weight_decay: 0.0
scheduler:
_target_: torch.optim.lr_scheduler.ReduceLROnPlateau
mode: min
factor: 0.5
patience: 1
val_scheduler_criterion:
- valid
- loss
best_model_criterion:
- - valid/acc
- 1
- max
trainer:
accelerator: auto
devices: 1
num_nodes: 1
accumulate_grad_batches: 1
check_val_every_n_epoch: 1
gradient_clip_val: 1.0
log_every_n_steps: 1
max_epochs: 1
limit_train_batches: 1
limit_val_batches: 1
precision: 32
logger:
- _target_: lightning.pytorch.loggers.TensorBoardLogger
save_dir: ./exp/train_asr_rnn_data_aug_debug/tensorboard
name: tb_logger
strategy: auto
tokenizer:
vocab_size: 30
character_coverage: 1.0
model_type: bpe
save_path: ./data/bpe_30
text_builder:
func: src.tokenizer.gather_training_text
manifest_path: ./data/mini_an4/manifest/train_nodev.tsv
model:
vocab_size: 30
token_list: ./data/bpe_30/tokens.txt
encoder: vgg_rnn
encoder_conf:
num_layers: 1
hidden_size: 2
output_size: 2
decoder: rnn
decoder_conf:
hidden_size: 2
normalize: utterance_mvn
normalize_conf: {}
model_conf:
ctc_weight: 0.3
lsm_weight: 0.1
length_normalized_loss: false
frontend: default
frontend_conf:
n_fft: 512
win_length: 400
hop_length: 160
2026-01-08 19:16:43 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:144 | Infer config content:
num_device: 1
num_nodes: 1
recipe_dir: .
data_dir: ./data
exp_tag: mini_an4_asr_debug
exp_dir: ./exp/mini_an4_asr_debug
stats_dir: ./exp/stats
decode_dir: ./exp/mini_an4_asr_debug/decode
dataset_dir: ./data/mini_an4
dataset:
_target_: espnet3.components.data.data_organizer.DataOrganizer
test:
- name: test
dataset:
_target_: src.dataset.MiniAN4Dataset
manifest_path: ./data/mini_an4/manifest/test.tsv
parallel:
env: local
n_workers: 1
model:
_target_: espnet2.bin.asr_inference.Speech2Text
asr_train_config: ./exp/mini_an4_asr_debug/config.yaml
asr_model_file: ./exp/mini_an4_asr_debug/last.ckpt
beam_size: 1
ctc_weight: 0.3
tokenizer:
vocab_size: 30
character_coverage: 1.0
model_type: bpe
save_path: ./data/bpe_30
2026-01-08 19:16:43 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:148 | Measure config content:
recipe_dir: .
data_dir: ./data
exp_tag: mini_an4_asr_debug
exp_dir: ./exp/mini_an4_asr_debug
stats_dir: ./exp/stats
decode_dir: ./exp/mini_an4_asr_debug/decode
dataset_dir: ./data/mini_an4
dataset:
_target_: espnet3.components.data.data_organizer.DataOrganizer
test:
- name: test
dataset:
_target_: src.dataset.MiniAN4Dataset
manifest_path: ./data/mini_an4/manifest/test.tsv
metrics:
- metric:
_target_: espnet3.systems.asr.metrics.wer.WER
clean_types: null
- metric:
_target_: espnet3.systems.asr.metrics.cer.CER
clean_types: null
2026-01-08 19:16:43 | INFO | espnet3.systems.base.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/system.py:49 | Initialized ASRSystem with train_config=True infer_config=True measure_config=True exp_dir=exp/train_asr_rnn_data_aug_debug
2026-01-08 19:16:43 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:165 | System: ASRSystem
2026-01-08 19:16:43 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:166 | Requested stages: ['all']
2026-01-08 19:16:43 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:167 | Resolved stages: ['create_dataset', 'train_tokenizer', 'collect_stats', 'train', 'infer', 'measure', 'publish']
2026-01-08 19:16:43 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:59 | === [START] stage: create_dataset ===
2026-01-08 19:16:43 | INFO | espnet3.systems.asr.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/system.py:54 | ASRSystem.create_dataset(): starting dataset creation process
2026-01-08 19:16:43 | INFO | espnet3.systems.asr.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/system.py:63 | Creating dataset with function src.create_dataset.create_dataset
2026-01-08 19:16:44 | INFO | src.create_dataset | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr/src/create_dataset.py:148 | Prepared Mini AN4 manifests under data/mini_an4/manifest
2026-01-08 19:16:44 | INFO | espnet3.systems.asr.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/system.py:65 | Dataset creation completed in 0.20s using src.create_dataset.create_dataset
2026-01-08 19:16:44 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:74 | === [DONE] stage: create_dataset (0.20s) ===
2026-01-08 19:16:44 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:59 | === [START] stage: train_tokenizer ===
2026-01-08 19:16:44 | INFO | espnet3.systems.asr.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/system.py:115 | Tokenizer already exists. Skipping train_tokenizer().
2026-01-08 19:16:44 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:74 | === [DONE] stage: train_tokenizer (0.00s) ===
2026-01-08 19:16:44 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:59 | === [START] stage: collect_stats ===
2026-01-08 19:16:44 | INFO | espnet3.systems.base.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/system.py:79 | Collecting stats | exp_dir=./exp/train_asr_rnn_data_aug_debug stats_dir=./exp/stats
2026-01-08 19:16:51 | INFO | root | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/task.py:548 | Vocabulary size: 30
2026-01-08 19:16:51 | INFO | espnet3.systems.base.train | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/train.py:33 | Model:
ESPnetASRModel(
(frontend): DefaultFrontend(
(stft): Stft(n_fft=512, win_length=400, hop_length=160, center=True, normalized=False, onesided=True)
(frontend): Frontend()
(logmel): LogMel(sr=16000, n_fft=512, n_mels=80, fmin=0, fmax=8000.0, htk=False)
)
(normalize): UtteranceMVN(norm_means=True, norm_vars=False)
(encoder): VGGRNNEncoder(
(enc): ModuleList(
(0): VGG2L(
(conv1_1): Conv2d(1, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
(conv1_2): Conv2d(64, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
(conv2_1): Conv2d(64, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
(conv2_2): Conv2d(128, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
)
(1): RNNP(
(birnn0): LSTM(2560, 2, batch_first=True, bidirectional=True)
(bt0): Linear(in_features=4, out_features=2, bias=True)
)
)
)
(decoder): RNNDecoder(
(embed): Embedding(30, 2)
(dropout_emb): Dropout(p=0.0, inplace=False)
(decoder): ModuleList(
(0): LSTMCell(4, 2)
)
(dropout_dec): ModuleList(
(0): Dropout(p=0.0, inplace=False)
)
(output): Linear(in_features=2, out_features=30, bias=True)
(att_list): ModuleList(
(0): AttLoc(
(mlp_enc): Linear(in_features=2, out_features=320, bias=True)
(mlp_dec): Linear(in_features=2, out_features=320, bias=False)
(mlp_att): Linear(in_features=10, out_features=320, bias=False)
(loc_conv): Conv2d(1, 10, kernel_size=(1, 201), stride=(1, 1), padding=(0, 100), bias=False)
(gvec): Linear(in_features=320, out_features=1, bias=True)
)
)
)
(criterion_att): LabelSmoothingLoss(
(criterion): KLDivLoss()
)
(ctc): CTC(
(ctc_lo): Linear(in_features=2, out_features=30, bias=True)
(ctc_loss): CTCLoss()
)
)
2026-01-08 19:16:51 | INFO | lightning.pytorch.utilities.rank_zero | /mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/lightning/pytorch/trainer/setup.py:155 | GPU available: True (cuda), used: True
2026-01-08 19:16:51 | INFO | lightning.pytorch.utilities.rank_zero | /mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/lightning/pytorch/trainer/setup.py:158 | TPU available: False, using: 0 TPU cores
2026-01-08 19:16:51 | INFO | lightning.pytorch.utilities.rank_zero | /mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/lightning/pytorch/trainer/setup.py:105 | `Trainer(limit_train_batches=1)` was configured so 1 batch per epoch will be used.
2026-01-08 19:16:51 | INFO | lightning.pytorch.utilities.rank_zero | /mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/lightning/pytorch/trainer/setup.py:105 | `Trainer(limit_val_batches=1)` was configured so 1 batch will be used.
2026-01-08 19:16:52 | INFO | root | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/task.py:548 | Vocabulary size: 30
2026-01-08 19:16:52 | ERROR | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:70 | Stage 'collect_stats' failed after 8.42s
Traceback (most recent call last):
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py", line 61, in run_stages
fn()
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/system.py", line 84, in collect_stats
return collect_stats(self.train_config)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/train.py", line 69, in collect_stats
trainer.collect_stats()
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/components/training/trainer.py", line 205, in collect_stats
return self.model.collect_stats(*args, **kwargs)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/components/modeling/model.py", line 444, in collect_stats
collect_stats(
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/components/data/collect_stats.py", line 512, in collect_stats
sum_dict, sq_dict, count_dict = _collect_stats_common(
^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/components/data/collect_stats.py", line 375, in _collect_stats_common
results = runner(index_batches)
^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/parallel/base_runner.py", line 340, in __call__
return self._run_local(indices)
^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/parallel/base_runner.py", line 209, in _run_local
return [f(i, **env) for i in tqdm(indices, total=len(indices))]
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/parallel/base_runner.py", line 209, in <listcomp>
return [f(i, **env) for i in tqdm(indices, total=len(indices))]
^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/components/data/collect_stats.py", line 328, in forward
return batch_collect_stats(
^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/components/data/collect_stats.py", line 56, in batch_collect_stats
batch = collate_fn(structured_items)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet2/train/collate_fn.py", line 38, in __call__
return common_collate_fn(
^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet2/train/collate_fn.py", line 404, in common_collate_fn
if data[0][key].dtype.kind == "i":
^^^^^^^^^^^^^^^^^^
AttributeError: 'str' object has no attribute 'dtype'
2026-01-08 19:22:11 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:152 | === ESPnet3 run started: 2026-01-08T19:22:11.289407 ===
2026-01-08 19:22:11 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:153 | Command: /mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/bin/python run.py --train_config conf/train_asr_rnn_data_aug_debug.yaml --infer_config conf/infer.yaml --measure_config conf/measure.yaml
2026-01-08 19:22:11 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:156 | Working directory: /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr
2026-01-08 19:22:11 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:162 | train config: /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr/conf/train_asr_rnn_data_aug_debug.yaml
2026-01-08 19:22:11 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:162 | infer config: /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr/conf/infer.yaml
2026-01-08 19:22:11 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:162 | measure config: /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr/conf/measure.yaml
2026-01-08 19:22:24 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:167 | Git: commit=7fae474de17d0a228f7ebb269ada1103e8014e6c, short_commit=7fae474de1, branch=espnet3/integration_test, worktree=dirty
2026-01-08 19:22:24 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:250 | Cluster env:
(none)
2026-01-08 19:22:24 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:251 | Runtime env:
LD_LIBRARY_PATH=/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/espeak-ng/lib:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/lib:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/lib64:/usr/local/cuda-10.1/lib64
PATH=/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/ffmpeg-release:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/festival/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/MBROLA/Bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/espeak-ng/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/BeamformIt:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kenlm/build/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/PESQ/P862_annex_A_2005_CD/source:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/nkf/nkf-2.1.4:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/moses/scripts/tokenizer:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/moses/scripts/generic:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/tools/moses/scripts/recaser:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/moses/scripts/training:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/mwerSegmenter:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/sctk/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/sph2pipe:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/sentencepiece_commands:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/chainbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/featbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/fgmmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/fstbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/gmmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/ivectorbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/kwsbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/latbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/lmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/nnet2bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/nnet3bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/nnetbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/online2bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/onlinebin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/rnnlmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/sgmm2bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/sgmmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/tfrnnlmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/cudadecoderbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/cudafeatbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/bin:/home/masao/.local/bin:/home/masao/utils:/home/masao/git/SCTK/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/games:/usr/local/games:/usr/lib/wsl/lib:/mnt/c/Users/might/AppData/Roaming/Code/User/globalStorage/github.copilot-chat/debugCommand:/mnt/c/Users/might/AppData/Roaming/Code/User/globalStorage/github.copilot-chat/copilotCli:/mnt/c/Windows/system32:/mnt/c/Windows:/mnt/c/Windows/System32/Wbem:/mnt/c/Windows/System32/WindowsPowerShell/v1.0/:/mnt/c/Windows/System32/OpenSSH/:/mnt/c/Program Files (x86)/NVIDIA Corporation/PhysX/Common:/mnt/c/Program Files/NVIDIA Corporation/NVIDIA NvDLISR:/mnt/c/Program Files/dotnet/:/mnt/c/Program Files/Git/cmd:/mnt/c/Program Files/Tailscale/:/mnt/c/Users/might/AppData/Local/Microsoft/WindowsApps:/mnt/c/Users/might/AppData/Local/Programs/Microsoft VS Code/bin:/mnt/c/Users/might/AppData/Local/Muse Hub/lib:/mnt/c/Software/cmake-3.30.2-windows-x86_64/bin:/mnt/c/Software/flutter/bin:/mnt/c/Users/might/.vscode/extensions/ms-python.debugpy-2025.18.0-win32-x64/bundled/scripts/noConfigScripts:/snap/bin:/mnt/c/Software/flutter/bin
PYTHONPATH=/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/RawNet/python/RawNet3:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/RawNet/python/RawNet3/models:../../../:../../TEMPLATE/asr:/mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr:
2026-01-08 19:22:24 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:140 | Train config content:
num_device: 1
num_nodes: 1
task: espnet3.systems.asr.task.ASRTask
recipe_dir: .
data_dir: ./data
exp_tag: train_asr_rnn_data_aug_debug
exp_dir: ./exp/train_asr_rnn_data_aug_debug
stats_dir: ./exp/stats
decode_dir: ./exp/train_asr_rnn_data_aug_debug/decode
dataset_dir: ./data/mini_an4
create_dataset:
func: src.create_dataset.create_dataset
dataset_dir: ./data/mini_an4
archive_path: ./../../egs2/mini_an4/asr1/downloads.tar.gz
dataset:
_target_: espnet3.components.data.data_organizer.DataOrganizer
train:
- name: train_nodev
dataset:
_target_: src.dataset.MiniAN4Dataset
manifest_path: ./data/mini_an4/manifest/train_nodev.tsv
valid:
- name: train_dev
dataset:
_target_: src.dataset.MiniAN4Dataset
manifest_path: ./data/mini_an4/manifest/train_dev.tsv
preprocessor:
_target_: espnet2.train.preprocessor.CommonPreprocessor
_convert_: all
fs: 16000
train: true
data_aug_effects:
- - 0.1
- contrast
- enhancement_amount: 75.0
- - 0.1
- highpass
- cutoff_freq: 5000
Q: 0.707
- - 0.1
- equalization
- center_freq: 1000
gain: 0
Q: 0.707
- - 0.1
- - - 0.3
- speed_perturb
- factor: 0.9
- - 0.3
- speed_perturb
- factor: 1.1
- - 0.3
- speed_perturb
- factor: 1.3
data_aug_num:
- 1
- 4
data_aug_prob: 1.0
parallel:
env: local
n_workers: 1
dataloader:
collate_fn:
_target_: espnet2.train.collate_fn.CommonCollateFn
int_pad_value: -1
train:
multiple_iterator: false
num_shards: 1
iter_factory:
_target_: espnet2.iterators.sequence_iter_factory.SequenceIterFactory
shuffle: true
collate_fn:
_target_: espnet2.train.collate_fn.CommonCollateFn
int_pad_value: -1
num_workers: 0
batches:
type: folded
shape_files:
- ./exp/stats/train/feats_shape
batch_size: 2
batch_bins: 200000
valid:
multiple_iterator: false
num_shards: 1
iter_factory:
_target_: espnet2.iterators.sequence_iter_factory.SequenceIterFactory
shuffle: false
collate_fn:
_target_: espnet2.train.collate_fn.CommonCollateFn
int_pad_value: -1
batches:
type: folded
shape_files:
- ./exp/stats/valid/feats_shape
batch_size: 2
batch_bins: 200000
optim:
_target_: torch.optim.Adam
lr: 0.001
weight_decay: 0.0
scheduler:
_target_: torch.optim.lr_scheduler.ReduceLROnPlateau
mode: min
factor: 0.5
patience: 1
val_scheduler_criterion:
- valid
- loss
best_model_criterion:
- - valid/acc
- 1
- max
trainer:
accelerator: auto
devices: 1
num_nodes: 1
accumulate_grad_batches: 1
check_val_every_n_epoch: 1
gradient_clip_val: 1.0
log_every_n_steps: 1
max_epochs: 1
limit_train_batches: 1
limit_val_batches: 1
precision: 32
logger:
- _target_: lightning.pytorch.loggers.TensorBoardLogger
save_dir: ./exp/train_asr_rnn_data_aug_debug/tensorboard
name: tb_logger
strategy: auto
tokenizer:
vocab_size: 30
character_coverage: 1.0
model_type: bpe
save_path: ./data/bpe_30
text_builder:
func: src.tokenizer.gather_training_text
manifest_path: ./data/mini_an4/manifest/train_nodev.tsv
model:
vocab_size: 30
token_list: ./data/bpe_30/tokens.txt
encoder: vgg_rnn
encoder_conf:
num_layers: 1
hidden_size: 2
output_size: 2
decoder: rnn
decoder_conf:
hidden_size: 2
normalize: utterance_mvn
normalize_conf: {}
model_conf:
ctc_weight: 0.3
lsm_weight: 0.1
length_normalized_loss: false
frontend: default
frontend_conf:
n_fft: 512
win_length: 400
hop_length: 160
2026-01-08 19:22:24 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:144 | Infer config content:
num_device: 1
num_nodes: 1
recipe_dir: .
data_dir: ./data
exp_tag: mini_an4_asr_debug
exp_dir: ./exp/mini_an4_asr_debug
stats_dir: ./exp/stats
decode_dir: ./exp/mini_an4_asr_debug/decode
dataset_dir: ./data/mini_an4
dataset:
_target_: espnet3.components.data.data_organizer.DataOrganizer
test:
- name: test
dataset:
_target_: src.dataset.MiniAN4Dataset
manifest_path: ./data/mini_an4/manifest/test.tsv
parallel:
env: local
n_workers: 1
model:
_target_: espnet2.bin.asr_inference.Speech2Text
asr_train_config: ./exp/mini_an4_asr_debug/config.yaml
asr_model_file: ./exp/mini_an4_asr_debug/last.ckpt
beam_size: 1
ctc_weight: 0.3
tokenizer:
vocab_size: 30
character_coverage: 1.0
model_type: bpe
save_path: ./data/bpe_30
2026-01-08 19:22:24 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:148 | Measure config content:
recipe_dir: .
data_dir: ./data
exp_tag: mini_an4_asr_debug
exp_dir: ./exp/mini_an4_asr_debug
stats_dir: ./exp/stats
decode_dir: ./exp/mini_an4_asr_debug/decode
dataset_dir: ./data/mini_an4
dataset:
_target_: espnet3.components.data.data_organizer.DataOrganizer
test:
- name: test
dataset:
_target_: src.dataset.MiniAN4Dataset
manifest_path: ./data/mini_an4/manifest/test.tsv
metrics:
- metric:
_target_: espnet3.systems.asr.metrics.wer.WER
clean_types: null
- metric:
_target_: espnet3.systems.asr.metrics.cer.CER
clean_types: null
2026-01-08 19:22:24 | INFO | espnet3.systems.base.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/system.py:49 | Initialized ASRSystem with train_config=True infer_config=True measure_config=True exp_dir=exp/train_asr_rnn_data_aug_debug
2026-01-08 19:22:24 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:165 | System: ASRSystem
2026-01-08 19:22:24 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:166 | Requested stages: ['all']
2026-01-08 19:22:24 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:167 | Resolved stages: ['create_dataset', 'train_tokenizer', 'collect_stats', 'train', 'infer', 'measure', 'publish']
2026-01-08 19:22:24 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:59 | === [START] stage: create_dataset ===
2026-01-08 19:22:24 | INFO | espnet3.systems.asr.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/system.py:54 | ASRSystem.create_dataset(): starting dataset creation process
2026-01-08 19:22:24 | INFO | espnet3.systems.asr.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/system.py:63 | Creating dataset with function src.create_dataset.create_dataset
2026-01-08 19:22:24 | INFO | src.create_dataset | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr/src/create_dataset.py:148 | Prepared Mini AN4 manifests under data/mini_an4/manifest
2026-01-08 19:22:24 | INFO | espnet3.systems.asr.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/system.py:65 | Dataset creation completed in 0.18s using src.create_dataset.create_dataset
2026-01-08 19:22:24 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:74 | === [DONE] stage: create_dataset (0.18s) ===
2026-01-08 19:22:24 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:59 | === [START] stage: train_tokenizer ===
2026-01-08 19:22:24 | INFO | espnet3.systems.asr.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/system.py:115 | Tokenizer already exists. Skipping train_tokenizer().
2026-01-08 19:22:24 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:74 | === [DONE] stage: train_tokenizer (0.00s) ===
2026-01-08 19:22:24 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:59 | === [START] stage: collect_stats ===
2026-01-08 19:22:24 | INFO | espnet3.systems.base.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/system.py:79 | Collecting stats | exp_dir=./exp/train_asr_rnn_data_aug_debug stats_dir=./exp/stats
2026-01-08 19:22:32 | INFO | root | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/task.py:548 | Vocabulary size: 30
2026-01-08 19:22:33 | INFO | espnet3.systems.base.train | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/train.py:33 | Model:
ESPnetASRModel(
(frontend): DefaultFrontend(
(stft): Stft(n_fft=512, win_length=400, hop_length=160, center=True, normalized=False, onesided=True)
(frontend): Frontend()
(logmel): LogMel(sr=16000, n_fft=512, n_mels=80, fmin=0, fmax=8000.0, htk=False)
)
(normalize): UtteranceMVN(norm_means=True, norm_vars=False)
(encoder): VGGRNNEncoder(
(enc): ModuleList(
(0): VGG2L(
(conv1_1): Conv2d(1, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
(conv1_2): Conv2d(64, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
(conv2_1): Conv2d(64, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
(conv2_2): Conv2d(128, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
)
(1): RNNP(
(birnn0): LSTM(2560, 2, batch_first=True, bidirectional=True)
(bt0): Linear(in_features=4, out_features=2, bias=True)
)
)
)
(decoder): RNNDecoder(
(embed): Embedding(30, 2)
(dropout_emb): Dropout(p=0.0, inplace=False)
(decoder): ModuleList(
(0): LSTMCell(4, 2)
)
(dropout_dec): ModuleList(
(0): Dropout(p=0.0, inplace=False)
)
(output): Linear(in_features=2, out_features=30, bias=True)
(att_list): ModuleList(
(0): AttLoc(
(mlp_enc): Linear(in_features=2, out_features=320, bias=True)
(mlp_dec): Linear(in_features=2, out_features=320, bias=False)
(mlp_att): Linear(in_features=10, out_features=320, bias=False)
(loc_conv): Conv2d(1, 10, kernel_size=(1, 201), stride=(1, 1), padding=(0, 100), bias=False)
(gvec): Linear(in_features=320, out_features=1, bias=True)
)
)
)
(criterion_att): LabelSmoothingLoss(
(criterion): KLDivLoss()
)
(ctc): CTC(
(ctc_lo): Linear(in_features=2, out_features=30, bias=True)
(ctc_loss): CTCLoss()
)
)
2026-01-08 19:22:33 | INFO | lightning.pytorch.utilities.rank_zero | /mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/lightning/pytorch/trainer/setup.py:155 | GPU available: True (cuda), used: True
2026-01-08 19:22:33 | INFO | lightning.pytorch.utilities.rank_zero | /mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/lightning/pytorch/trainer/setup.py:158 | TPU available: False, using: 0 TPU cores
2026-01-08 19:22:33 | INFO | lightning.pytorch.utilities.rank_zero | /mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/lightning/pytorch/trainer/setup.py:105 | `Trainer(limit_train_batches=1)` was configured so 1 batch per epoch will be used.
2026-01-08 19:22:33 | INFO | lightning.pytorch.utilities.rank_zero | /mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/lightning/pytorch/trainer/setup.py:105 | `Trainer(limit_val_batches=1)` was configured so 1 batch will be used.
2026-01-08 19:22:33 | INFO | root | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/task.py:548 | Vocabulary size: 30
2026-01-08 19:22:33 | ERROR | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:70 | Stage 'collect_stats' failed after 9.07s
Traceback (most recent call last):
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py", line 61, in run_stages
fn()
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/system.py", line 84, in collect_stats
return collect_stats(self.train_config)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/train.py", line 69, in collect_stats
trainer.collect_stats()
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/components/training/trainer.py", line 205, in collect_stats
return self.model.collect_stats(*args, **kwargs)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/components/modeling/model.py", line 444, in collect_stats
collect_stats(
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/components/data/collect_stats.py", line 512, in collect_stats
sum_dict, sq_dict, count_dict = _collect_stats_common(
^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/components/data/collect_stats.py", line 375, in _collect_stats_common
results = runner(index_batches)
^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/parallel/base_runner.py", line 340, in __call__
return self._run_local(indices)
^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/parallel/base_runner.py", line 209, in _run_local
return [f(i, **env) for i in tqdm(indices, total=len(indices))]
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/parallel/base_runner.py", line 209, in <listcomp>
return [f(i, **env) for i in tqdm(indices, total=len(indices))]
^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/components/data/collect_stats.py", line 328, in forward
return batch_collect_stats(
^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/components/data/collect_stats.py", line 56, in batch_collect_stats
batch = collate_fn(structured_items)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet2/train/collate_fn.py", line 38, in __call__
return common_collate_fn(
^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet2/train/collate_fn.py", line 405, in common_collate_fn
if data[0][key].dtype.kind == "i":
^^^^^^^^^^^^^^^^^^
AttributeError: 'str' object has no attribute 'dtype'
2026-01-08 19:24:25 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:152 | === ESPnet3 run started: 2026-01-08T19:24:25.834405 ===
2026-01-08 19:24:25 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:153 | Command: /mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/bin/python run.py --train_config conf/train_asr_rnn_data_aug_debug.yaml --infer_config conf/infer.yaml --measure_config conf/measure.yaml
2026-01-08 19:24:25 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:156 | Working directory: /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr
2026-01-08 19:24:25 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:162 | train config: /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr/conf/train_asr_rnn_data_aug_debug.yaml
2026-01-08 19:24:25 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:162 | infer config: /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr/conf/infer.yaml
2026-01-08 19:24:25 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:162 | measure config: /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr/conf/measure.yaml
2026-01-08 19:24:38 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:167 | Git: commit=7fae474de17d0a228f7ebb269ada1103e8014e6c, short_commit=7fae474de1, branch=espnet3/integration_test, worktree=dirty
2026-01-08 19:24:38 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:250 | Cluster env:
(none)
2026-01-08 19:24:38 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/logging.py:251 | Runtime env:
LD_LIBRARY_PATH=/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/espeak-ng/lib:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/lib:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/lib64:/usr/local/cuda-10.1/lib64
PATH=/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/ffmpeg-release:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/festival/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/MBROLA/Bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/espeak-ng/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/BeamformIt:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kenlm/build/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/PESQ/P862_annex_A_2005_CD/source:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/nkf/nkf-2.1.4:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/moses/scripts/tokenizer:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/moses/scripts/generic:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/tools/moses/scripts/recaser:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/moses/scripts/training:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/mwerSegmenter:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/sctk/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/sph2pipe:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/sentencepiece_commands:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/chainbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/featbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/fgmmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/fstbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/gmmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/ivectorbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/kwsbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/latbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/lmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/nnet2bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/nnet3bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/nnetbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/online2bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/onlinebin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/rnnlmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/sgmm2bin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/sgmmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/tfrnnlmbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/cudadecoderbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/kaldi/src/cudafeatbin:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/bin:/home/masao/.local/bin:/home/masao/utils:/home/masao/git/SCTK/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/games:/usr/local/games:/usr/lib/wsl/lib:/mnt/c/Users/might/AppData/Roaming/Code/User/globalStorage/github.copilot-chat/debugCommand:/mnt/c/Users/might/AppData/Roaming/Code/User/globalStorage/github.copilot-chat/copilotCli:/mnt/c/Windows/system32:/mnt/c/Windows:/mnt/c/Windows/System32/Wbem:/mnt/c/Windows/System32/WindowsPowerShell/v1.0/:/mnt/c/Windows/System32/OpenSSH/:/mnt/c/Program Files (x86)/NVIDIA Corporation/PhysX/Common:/mnt/c/Program Files/NVIDIA Corporation/NVIDIA NvDLISR:/mnt/c/Program Files/dotnet/:/mnt/c/Program Files/Git/cmd:/mnt/c/Program Files/Tailscale/:/mnt/c/Users/might/AppData/Local/Microsoft/WindowsApps:/mnt/c/Users/might/AppData/Local/Programs/Microsoft VS Code/bin:/mnt/c/Users/might/AppData/Local/Muse Hub/lib:/mnt/c/Software/cmake-3.30.2-windows-x86_64/bin:/mnt/c/Software/flutter/bin:/mnt/c/Users/might/.vscode/extensions/ms-python.debugpy-2025.18.0-win32-x64/bundled/scripts/noConfigScripts:/snap/bin:/mnt/c/Software/flutter/bin
PYTHONPATH=/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/RawNet/python/RawNet3:/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/RawNet/python/RawNet3/models:../../../:../../TEMPLATE/asr:/mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr:
2026-01-08 19:24:38 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:140 | Train config content:
num_device: 1
num_nodes: 1
task: espnet3.systems.asr.task.ASRTask
recipe_dir: .
data_dir: ./data
exp_tag: train_asr_rnn_data_aug_debug
exp_dir: ./exp/train_asr_rnn_data_aug_debug
stats_dir: ./exp/stats
decode_dir: ./exp/train_asr_rnn_data_aug_debug/decode
dataset_dir: ./data/mini_an4
create_dataset:
func: src.create_dataset.create_dataset
dataset_dir: ./data/mini_an4
archive_path: ./../../egs2/mini_an4/asr1/downloads.tar.gz
dataset:
_target_: espnet3.components.data.data_organizer.DataOrganizer
train:
- name: train_nodev
dataset:
_target_: src.dataset.MiniAN4Dataset
manifest_path: ./data/mini_an4/manifest/train_nodev.tsv
valid:
- name: train_dev
dataset:
_target_: src.dataset.MiniAN4Dataset
manifest_path: ./data/mini_an4/manifest/train_dev.tsv
preprocessor:
_target_: espnet2.train.preprocessor.CommonPreprocessor
_convert_: all
fs: 16000
train: true
data_aug_effects:
- - 0.1
- contrast
- enhancement_amount: 75.0
- - 0.1
- highpass
- cutoff_freq: 5000
Q: 0.707
- - 0.1
- equalization
- center_freq: 1000
gain: 0
Q: 0.707
- - 0.1
- - - 0.3
- speed_perturb
- factor: 0.9
- - 0.3
- speed_perturb
- factor: 1.1
- - 0.3
- speed_perturb
- factor: 1.3
data_aug_num:
- 1
- 4
data_aug_prob: 1.0
token_type: bpe
token_list: ./data/bpe_30/tokens.txt
bpemodel: ./data/bpe_30/bpe.model
parallel:
env: local
n_workers: 1
dataloader:
collate_fn:
_target_: espnet2.train.collate_fn.CommonCollateFn
int_pad_value: -1
train:
multiple_iterator: false
num_shards: 1
iter_factory:
_target_: espnet2.iterators.sequence_iter_factory.SequenceIterFactory
shuffle: true
collate_fn:
_target_: espnet2.train.collate_fn.CommonCollateFn
int_pad_value: -1
num_workers: 0
batches:
type: folded
shape_files:
- ./exp/stats/train/feats_shape
batch_size: 2
batch_bins: 200000
valid:
multiple_iterator: false
num_shards: 1
iter_factory:
_target_: espnet2.iterators.sequence_iter_factory.SequenceIterFactory
shuffle: false
collate_fn:
_target_: espnet2.train.collate_fn.CommonCollateFn
int_pad_value: -1
batches:
type: folded
shape_files:
- ./exp/stats/valid/feats_shape
batch_size: 2
batch_bins: 200000
optim:
_target_: torch.optim.Adam
lr: 0.001
weight_decay: 0.0
scheduler:
_target_: torch.optim.lr_scheduler.ReduceLROnPlateau
mode: min
factor: 0.5
patience: 1
val_scheduler_criterion:
- valid
- loss
best_model_criterion:
- - valid/acc
- 1
- max
trainer:
accelerator: auto
devices: 1
num_nodes: 1
accumulate_grad_batches: 1
check_val_every_n_epoch: 1
gradient_clip_val: 1.0
log_every_n_steps: 1
max_epochs: 1
limit_train_batches: 1
limit_val_batches: 1
precision: 32
logger:
- _target_: lightning.pytorch.loggers.TensorBoardLogger
save_dir: ./exp/train_asr_rnn_data_aug_debug/tensorboard
name: tb_logger
strategy: auto
tokenizer:
vocab_size: 30
character_coverage: 1.0
model_type: bpe
save_path: ./data/bpe_30
text_builder:
func: src.tokenizer.gather_training_text
manifest_path: ./data/mini_an4/manifest/train_nodev.tsv
model:
vocab_size: 30
token_list: ./data/bpe_30/tokens.txt
encoder: vgg_rnn
encoder_conf:
num_layers: 1
hidden_size: 2
output_size: 2
decoder: rnn
decoder_conf:
hidden_size: 2
normalize: utterance_mvn
normalize_conf: {}
model_conf:
ctc_weight: 0.3
lsm_weight: 0.1
length_normalized_loss: false
frontend: default
frontend_conf:
n_fft: 512
win_length: 400
hop_length: 160
2026-01-08 19:24:38 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:144 | Infer config content:
num_device: 1
num_nodes: 1
recipe_dir: .
data_dir: ./data
exp_tag: mini_an4_asr_debug
exp_dir: ./exp/mini_an4_asr_debug
stats_dir: ./exp/stats
decode_dir: ./exp/mini_an4_asr_debug/decode
dataset_dir: ./data/mini_an4
dataset:
_target_: espnet3.components.data.data_organizer.DataOrganizer
test:
- name: test
dataset:
_target_: src.dataset.MiniAN4Dataset
manifest_path: ./data/mini_an4/manifest/test.tsv
parallel:
env: local
n_workers: 1
model:
_target_: espnet2.bin.asr_inference.Speech2Text
asr_train_config: ./exp/mini_an4_asr_debug/config.yaml
asr_model_file: ./exp/mini_an4_asr_debug/last.ckpt
beam_size: 1
ctc_weight: 0.3
tokenizer:
vocab_size: 30
character_coverage: 1.0
model_type: bpe
save_path: ./data/bpe_30
2026-01-08 19:24:38 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:148 | Measure config content:
recipe_dir: .
data_dir: ./data
exp_tag: mini_an4_asr_debug
exp_dir: ./exp/mini_an4_asr_debug
stats_dir: ./exp/stats
decode_dir: ./exp/mini_an4_asr_debug/decode
dataset_dir: ./data/mini_an4
dataset:
_target_: espnet3.components.data.data_organizer.DataOrganizer
test:
- name: test
dataset:
_target_: src.dataset.MiniAN4Dataset
manifest_path: ./data/mini_an4/manifest/test.tsv
metrics:
- metric:
_target_: espnet3.systems.asr.metrics.wer.WER
clean_types: null
- metric:
_target_: espnet3.systems.asr.metrics.cer.CER
clean_types: null
2026-01-08 19:24:38 | INFO | espnet3.systems.base.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/system.py:49 | Initialized ASRSystem with train_config=True infer_config=True measure_config=True exp_dir=exp/train_asr_rnn_data_aug_debug
2026-01-08 19:24:38 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:165 | System: ASRSystem
2026-01-08 19:24:38 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:166 | Requested stages: ['all']
2026-01-08 19:24:38 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/TEMPLATE/asr/run.py:167 | Resolved stages: ['create_dataset', 'train_tokenizer', 'collect_stats', 'train', 'infer', 'measure', 'publish']
2026-01-08 19:24:38 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:59 | === [START] stage: create_dataset ===
2026-01-08 19:24:38 | INFO | espnet3.systems.asr.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/system.py:54 | ASRSystem.create_dataset(): starting dataset creation process
2026-01-08 19:24:38 | INFO | espnet3.systems.asr.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/system.py:63 | Creating dataset with function src.create_dataset.create_dataset
2026-01-08 19:24:39 | INFO | src.create_dataset | /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr/src/create_dataset.py:148 | Prepared Mini AN4 manifests under data/mini_an4/manifest
2026-01-08 19:24:39 | INFO | espnet3.systems.asr.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/system.py:65 | Dataset creation completed in 0.16s using src.create_dataset.create_dataset
2026-01-08 19:24:39 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:74 | === [DONE] stage: create_dataset (0.16s) ===
2026-01-08 19:24:39 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:59 | === [START] stage: train_tokenizer ===
2026-01-08 19:24:39 | INFO | espnet3.systems.asr.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/system.py:115 | Tokenizer already exists. Skipping train_tokenizer().
2026-01-08 19:24:39 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:74 | === [DONE] stage: train_tokenizer (0.00s) ===
2026-01-08 19:24:39 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:59 | === [START] stage: collect_stats ===
2026-01-08 19:24:39 | INFO | espnet3.systems.base.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/system.py:79 | Collecting stats | exp_dir=./exp/train_asr_rnn_data_aug_debug stats_dir=./exp/stats
2026-01-08 19:24:46 | INFO | root | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/task.py:548 | Vocabulary size: 30
2026-01-08 19:24:46 | INFO | espnet3.systems.base.train | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/train.py:33 | Model:
ESPnetASRModel(
(frontend): DefaultFrontend(
(stft): Stft(n_fft=512, win_length=400, hop_length=160, center=True, normalized=False, onesided=True)
(frontend): Frontend()
(logmel): LogMel(sr=16000, n_fft=512, n_mels=80, fmin=0, fmax=8000.0, htk=False)
)
(normalize): UtteranceMVN(norm_means=True, norm_vars=False)
(encoder): VGGRNNEncoder(
(enc): ModuleList(
(0): VGG2L(
(conv1_1): Conv2d(1, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
(conv1_2): Conv2d(64, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
(conv2_1): Conv2d(64, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
(conv2_2): Conv2d(128, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
)
(1): RNNP(
(birnn0): LSTM(2560, 2, batch_first=True, bidirectional=True)
(bt0): Linear(in_features=4, out_features=2, bias=True)
)
)
)
(decoder): RNNDecoder(
(embed): Embedding(30, 2)
(dropout_emb): Dropout(p=0.0, inplace=False)
(decoder): ModuleList(
(0): LSTMCell(4, 2)
)
(dropout_dec): ModuleList(
(0): Dropout(p=0.0, inplace=False)
)
(output): Linear(in_features=2, out_features=30, bias=True)
(att_list): ModuleList(
(0): AttLoc(
(mlp_enc): Linear(in_features=2, out_features=320, bias=True)
(mlp_dec): Linear(in_features=2, out_features=320, bias=False)
(mlp_att): Linear(in_features=10, out_features=320, bias=False)
(loc_conv): Conv2d(1, 10, kernel_size=(1, 201), stride=(1, 1), padding=(0, 100), bias=False)
(gvec): Linear(in_features=320, out_features=1, bias=True)
)
)
)
(criterion_att): LabelSmoothingLoss(
(criterion): KLDivLoss()
)
(ctc): CTC(
(ctc_lo): Linear(in_features=2, out_features=30, bias=True)
(ctc_loss): CTCLoss()
)
)
2026-01-08 19:24:46 | INFO | lightning.pytorch.utilities.rank_zero | /mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/lightning/pytorch/trainer/setup.py:155 | GPU available: True (cuda), used: True
2026-01-08 19:24:46 | INFO | lightning.pytorch.utilities.rank_zero | /mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/lightning/pytorch/trainer/setup.py:158 | TPU available: False, using: 0 TPU cores
2026-01-08 19:24:46 | INFO | lightning.pytorch.utilities.rank_zero | /mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/lightning/pytorch/trainer/setup.py:105 | `Trainer(limit_train_batches=1)` was configured so 1 batch per epoch will be used.
2026-01-08 19:24:46 | INFO | lightning.pytorch.utilities.rank_zero | /mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/lightning/pytorch/trainer/setup.py:105 | `Trainer(limit_val_batches=1)` was configured so 1 batch will be used.
2026-01-08 19:24:47 | INFO | root | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/task.py:548 | Vocabulary size: 30
2026-01-08 19:24:48 | INFO | root | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/task.py:548 | Vocabulary size: 30
2026-01-08 19:24:48 | INFO | espnet3.systems.base.train | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/train.py:70 | Collect stats finished in 9.26s | exp_dir=./exp/train_asr_rnn_data_aug_debug stats_dir=./exp/stats
2026-01-08 19:24:48 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:74 | === [DONE] stage: collect_stats (9.27s) ===
2026-01-08 19:24:48 | INFO | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:59 | === [START] stage: train ===
2026-01-08 19:24:48 | INFO | espnet3.systems.asr.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/system.py:82 | ASRSystem.train(): starting training process
2026-01-08 19:24:48 | INFO | espnet3.systems.base.system | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/system.py:94 | Training start | exp_dir=./exp/train_asr_rnn_data_aug_debug model=<unknown>
2026-01-08 19:24:48 | INFO | root | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/task.py:548 | Vocabulary size: 30
2026-01-08 19:24:48 | INFO | espnet3.systems.base.train | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/train.py:33 | Model:
ESPnetASRModel(
(frontend): DefaultFrontend(
(stft): Stft(n_fft=512, win_length=400, hop_length=160, center=True, normalized=False, onesided=True)
(frontend): Frontend()
(logmel): LogMel(sr=16000, n_fft=512, n_mels=80, fmin=0, fmax=8000.0, htk=False)
)
(normalize): UtteranceMVN(norm_means=True, norm_vars=False)
(encoder): VGGRNNEncoder(
(enc): ModuleList(
(0): VGG2L(
(conv1_1): Conv2d(1, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
(conv1_2): Conv2d(64, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
(conv2_1): Conv2d(64, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
(conv2_2): Conv2d(128, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
)
(1): RNNP(
(birnn0): LSTM(2560, 2, batch_first=True, bidirectional=True)
(bt0): Linear(in_features=4, out_features=2, bias=True)
)
)
)
(decoder): RNNDecoder(
(embed): Embedding(30, 2)
(dropout_emb): Dropout(p=0.0, inplace=False)
(decoder): ModuleList(
(0): LSTMCell(4, 2)
)
(dropout_dec): ModuleList(
(0): Dropout(p=0.0, inplace=False)
)
(output): Linear(in_features=2, out_features=30, bias=True)
(att_list): ModuleList(
(0): AttLoc(
(mlp_enc): Linear(in_features=2, out_features=320, bias=True)
(mlp_dec): Linear(in_features=2, out_features=320, bias=False)
(mlp_att): Linear(in_features=10, out_features=320, bias=False)
(loc_conv): Conv2d(1, 10, kernel_size=(1, 201), stride=(1, 1), padding=(0, 100), bias=False)
(gvec): Linear(in_features=320, out_features=1, bias=True)
)
)
)
(criterion_att): LabelSmoothingLoss(
(criterion): KLDivLoss()
)
(ctc): CTC(
(ctc_lo): Linear(in_features=2, out_features=30, bias=True)
(ctc_loss): CTCLoss()
)
)
2026-01-08 19:24:48 | INFO | lightning.pytorch.utilities.rank_zero | /mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/lightning/pytorch/trainer/setup.py:155 | GPU available: True (cuda), used: True
2026-01-08 19:24:48 | INFO | lightning.pytorch.utilities.rank_zero | /mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/lightning/pytorch/trainer/setup.py:158 | TPU available: False, using: 0 TPU cores
2026-01-08 19:24:48 | INFO | lightning.pytorch.utilities.rank_zero | /mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/lightning/pytorch/trainer/setup.py:105 | `Trainer(limit_train_batches=1)` was configured so 1 batch per epoch will be used.
2026-01-08 19:24:48 | INFO | lightning.pytorch.utilities.rank_zero | /mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/lightning/pytorch/trainer/setup.py:105 | `Trainer(limit_val_batches=1)` was configured so 1 batch will be used.
2026-01-08 19:24:52 | WARNING | py.warnings | /home/masao/.local/share/uv/python/cpython-3.11.13-linux-x86_64-gnu/lib/python3.11/warnings.py:110 | /mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/lightning/pytorch/callbacks/model_checkpoint.py:751: Checkpoint directory /mnt/c/Users/might/Documents/git/espnets/espnet3/egs3/mini_an4/asr/exp/train_asr_rnn_data_aug_debug exists and is not empty.
2026-01-08 19:24:52 | INFO | lightning.pytorch.accelerators.cuda | /mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/lightning/pytorch/accelerators/cuda.py:61 | LOCAL_RANK: 0 - CUDA_VISIBLE_DEVICES: [0]
2026-01-08 19:24:52 | ERROR | espnet3 | /mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py:70 | Stage 'train' failed after 3.81s
Traceback (most recent call last):
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/utils/stages.py", line 61, in run_stages
fn()
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/asr/system.py", line 93, in train
return super().train()
^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/system.py", line 99, in train
return train(self.train_config)
^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/systems/base/train.py", line 101, in train
trainer.fit(**fit_kwargs)
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/components/training/trainer.py", line 175, in fit
self.trainer.fit(
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/lightning/pytorch/trainer/trainer.py", line 560, in fit
call._call_and_handle_interrupt(
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/lightning/pytorch/trainer/call.py", line 49, in _call_and_handle_interrupt
return trainer_fn(*args, **kwargs)
^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/lightning/pytorch/trainer/trainer.py", line 598, in _fit_impl
self._run(model, ckpt_path=ckpt_path)
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/lightning/pytorch/trainer/trainer.py", line 987, in _run
self.strategy.setup(self)
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/lightning/pytorch/strategies/strategy.py", line 159, in setup
self.setup_optimizers(trainer)
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/lightning/pytorch/strategies/strategy.py", line 139, in setup_optimizers
self.optimizers, self.lr_scheduler_configs = _init_optimizers_and_lr_schedulers(self.lightning_module)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/lightning/pytorch/core/optimizer.py", line 180, in _init_optimizers_and_lr_schedulers
optim_conf = call._call_lightning_module_hook(model.trainer, "configure_optimizers", pl_module=model)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/tools/.venv/lib/python3.11/site-packages/lightning/pytorch/trainer/call.py", line 177, in _call_lightning_module_hook
output = fn(*args, **kwargs)
^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/components/modeling/model.py", line 367, in configure_optimizers
lr_scheduler = _build_lr_scheduler_config(scheduler)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/components/modeling/model.py", line 358, in _build_lr_scheduler_config
monitor = _get_val_scheduler_monitor()
^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/c/Users/might/Documents/git/espnets/espnet3/espnet3/components/modeling/model.py", line 250, in _get_val_scheduler_monitor
raise ValueError("val_scheduler_criterion must be a string like 'valid/loss'")
ValueError: val_scheduler_criterion must be a string like 'valid/loss'