Tyl3rDrden's picture
Upload folder using huggingface_hub
a67cb12 verified
[NeMo W 2026-04-09 13:10:29 nemo_logging:364] /opt/venv/lib/python3.12/site-packages/pydub/utils.py:170: RuntimeWarning: Couldn't find ffmpeg or avconv - defaulting to ffmpeg, but may not work
warn("Couldn't find ffmpeg or avconv - defaulting to ffmpeg, but may not work", RuntimeWarning)
[NeMo W 2026-04-09 13:10:30 exp_manager:1177] No version folders would be created under the log folder as 'resume_if_exists' is enabled.
[NeMo W 2026-04-09 13:10:30 exp_manager:1022] There were no checkpoints found in checkpoint_dir or no checkpoint folder at checkpoint_dir :/workspace/nemo_experiments/Speech_To_Text_Phase2/checkpoints. Training from scratch.
[NeMo W 2026-04-09 13:10:30 exp_manager:1413] The checkpoint callback was told to monitor a validation value and trainer's max_steps was set to 50000. Please ensure that max_steps will run for at least 1 epochs to ensure that checkpointing will not error out.
[NeMo W 2026-04-09 13:11:35 modelPT:188] If you intend to do training or fine-tuning, please call the ModelPT.setup_training_data() method and provide a valid configuration file to setup the train data loader.
Train config :
use_lhotse: true
skip_missing_manifest_entries: true
input_cfg: null
tarred_audio_filepaths: null
manifest_filepath: null
sample_rate: 16000
shuffle: true
num_workers: 2
pin_memory: true
max_duration: 10.0
min_duration: 1.0
text_field: answer
batch_duration: null
max_tps: null
use_bucketing: true
bucket_duration_bins: null
bucket_batch_size: null
num_buckets: 30
bucket_buffer_size: 20000
shuffle_buffer_size: 10000
[NeMo W 2026-04-09 13:11:35 modelPT:195] If you intend to do validation, please call the ModelPT.setup_validation_data() or ModelPT.setup_multiple_validation_data() method and provide a valid configuration file to setup the validation data loader(s).
Validation config :
use_lhotse: true
manifest_filepath: null
sample_rate: 16000
batch_size: 16
shuffle: false
max_duration: 40.0
min_duration: 0.1
num_workers: 2
pin_memory: true
text_field: answer
[NeMo W 2026-04-09 13:11:39 modelPT:300] You tried to register an artifact under config key=tokenizer.model_path but an artifact for it has already been registered.
[NeMo W 2026-04-09 13:11:39 modelPT:300] You tried to register an artifact under config key=tokenizer.vocab_path but an artifact for it has already been registered.
[NeMo W 2026-04-09 13:11:39 modelPT:300] You tried to register an artifact under config key=tokenizer.spe_tokenizer_vocab but an artifact for it has already been registered.
[NeMo W 2026-04-09 13:11:44 speech_to_text_finetune:167] The vocabulary size of the new tokenizer differs from that of the loaded model. As a result, finetuning will proceed with the new vocabulary, and the decoder will be reinitialized.
[NeMo W 2026-04-09 13:11:46 dataloader:826] The following configuration keys are ignored by Lhotse dataloader: is_tarred,allow_missing_data
[NeMo W 2026-04-09 13:11:48 dataloader:826] The following configuration keys are ignored by Lhotse dataloader: is_tarred,allow_missing_data
[NeMo W 2026-04-09 13:11:57 nemo_logging:364] /opt/venv/lib/python3.12/site-packages/numba_cuda/numba/cuda/dispatcher.py:685: NumbaPerformanceWarning: Grid size 4 will likely result in GPU under-utilization due to low occupancy.
warn(NumbaPerformanceWarning(msg))
[NeMo W 2026-04-09 13:11:57 nemo_logging:364] /opt/venv/lib/python3.12/site-packages/numba_cuda/numba/cuda/dispatcher.py:685: NumbaPerformanceWarning: Grid size 4 will likely result in GPU under-utilization due to low occupancy.
warn(NumbaPerformanceWarning(msg))
[NeMo W 2026-04-09 13:11:57 nemo_logging:364] /opt/venv/lib/python3.12/site-packages/numba_cuda/numba/cuda/dispatcher.py:685: NumbaPerformanceWarning: Grid size 1 will likely result in GPU under-utilization due to low occupancy.
warn(NumbaPerformanceWarning(msg))
[NeMo W 2026-04-09 13:11:58 nemo_logging:364] /opt/venv/lib/python3.12/site-packages/numba_cuda/numba/cuda/dispatcher.py:685: NumbaPerformanceWarning: Grid size 4 will likely result in GPU under-utilization due to low occupancy.
warn(NumbaPerformanceWarning(msg))
[NeMo W 2026-04-09 13:11:58 nemo_logging:364] /opt/venv/lib/python3.12/site-packages/numba_cuda/numba/cuda/dispatcher.py:685: NumbaPerformanceWarning: Grid size 1 will likely result in GPU under-utilization due to low occupancy.
warn(NumbaPerformanceWarning(msg))