Upload folder using huggingface_hub

a67cb12 verified 10 days ago

4.59 kB

	[NeMo W 2026-04-09 13:10:29 nemo_logging:364] /opt/venv/lib/python3.12/site-packages/pydub/utils.py:170: RuntimeWarning: Couldn't find ffmpeg or avconv - defaulting to ffmpeg, but may not work
	warn("Couldn't find ffmpeg or avconv - defaulting to ffmpeg, but may not work", RuntimeWarning)

	[NeMo W 2026-04-09 13:10:30 exp_manager:1177] No version folders would be created under the log folder as 'resume_if_exists' is enabled.
	[NeMo W 2026-04-09 13:10:30 exp_manager:1022] There were no checkpoints found in checkpoint_dir or no checkpoint folder at checkpoint_dir :/workspace/nemo_experiments/Speech_To_Text_Phase2/checkpoints. Training from scratch.
	[NeMo W 2026-04-09 13:10:30 exp_manager:1413] The checkpoint callback was told to monitor a validation value and trainer's max_steps was set to 50000. Please ensure that max_steps will run for at least 1 epochs to ensure that checkpointing will not error out.
	[NeMo W 2026-04-09 13:11:35 modelPT:188] If you intend to do training or fine-tuning, please call the ModelPT.setup_training_data() method and provide a valid configuration file to setup the train data loader.
	Train config :
	use_lhotse: true
	skip_missing_manifest_entries: true
	input_cfg: null
	tarred_audio_filepaths: null
	manifest_filepath: null
	sample_rate: 16000
	shuffle: true
	num_workers: 2
	pin_memory: true
	max_duration: 10.0
	min_duration: 1.0
	text_field: answer
	batch_duration: null
	max_tps: null
	use_bucketing: true
	bucket_duration_bins: null
	bucket_batch_size: null
	num_buckets: 30
	bucket_buffer_size: 20000
	shuffle_buffer_size: 10000

	[NeMo W 2026-04-09 13:11:35 modelPT:195] If you intend to do validation, please call the ModelPT.setup_validation_data() or ModelPT.setup_multiple_validation_data() method and provide a valid configuration file to setup the validation data loader(s).
	Validation config :
	use_lhotse: true
	manifest_filepath: null
	sample_rate: 16000
	batch_size: 16
	shuffle: false
	max_duration: 40.0
	min_duration: 0.1
	num_workers: 2
	pin_memory: true
	text_field: answer

	[NeMo W 2026-04-09 13:11:39 modelPT:300] You tried to register an artifact under config key=tokenizer.model_path but an artifact for it has already been registered.
	[NeMo W 2026-04-09 13:11:39 modelPT:300] You tried to register an artifact under config key=tokenizer.vocab_path but an artifact for it has already been registered.
	[NeMo W 2026-04-09 13:11:39 modelPT:300] You tried to register an artifact under config key=tokenizer.spe_tokenizer_vocab but an artifact for it has already been registered.
	[NeMo W 2026-04-09 13:11:44 speech_to_text_finetune:167] The vocabulary size of the new tokenizer differs from that of the loaded model. As a result, finetuning will proceed with the new vocabulary, and the decoder will be reinitialized.
	[NeMo W 2026-04-09 13:11:46 dataloader:826] The following configuration keys are ignored by Lhotse dataloader: is_tarred,allow_missing_data
	[NeMo W 2026-04-09 13:11:48 dataloader:826] The following configuration keys are ignored by Lhotse dataloader: is_tarred,allow_missing_data
	[NeMo W 2026-04-09 13:11:57 nemo_logging:364] /opt/venv/lib/python3.12/site-packages/numba_cuda/numba/cuda/dispatcher.py:685: NumbaPerformanceWarning: [1mGrid size 4 will likely result in GPU under-utilization due to low occupancy.[0m
	warn(NumbaPerformanceWarning(msg))

	[NeMo W 2026-04-09 13:11:57 nemo_logging:364] /opt/venv/lib/python3.12/site-packages/numba_cuda/numba/cuda/dispatcher.py:685: NumbaPerformanceWarning: [1mGrid size 4 will likely result in GPU under-utilization due to low occupancy.[0m
	warn(NumbaPerformanceWarning(msg))

	[NeMo W 2026-04-09 13:11:57 nemo_logging:364] /opt/venv/lib/python3.12/site-packages/numba_cuda/numba/cuda/dispatcher.py:685: NumbaPerformanceWarning: [1mGrid size 1 will likely result in GPU under-utilization due to low occupancy.[0m
	warn(NumbaPerformanceWarning(msg))

	[NeMo W 2026-04-09 13:11:58 nemo_logging:364] /opt/venv/lib/python3.12/site-packages/numba_cuda/numba/cuda/dispatcher.py:685: NumbaPerformanceWarning: [1mGrid size 4 will likely result in GPU under-utilization due to low occupancy.[0m
	warn(NumbaPerformanceWarning(msg))

	[NeMo W 2026-04-09 13:11:58 nemo_logging:364] /opt/venv/lib/python3.12/site-packages/numba_cuda/numba/cuda/dispatcher.py:685: NumbaPerformanceWarning: [1mGrid size 1 will likely result in GPU under-utilization due to low occupancy.[0m
	warn(NumbaPerformanceWarning(msg))