Buckets:

hf-doc-build
/

doc-dev

Files

xet

hf-doc-build/doc-dev / optimum-neuron /pr_1097 /en /training_api /trainer.md

rtrm

about 1 month ago

preview code

download

raw

11.7 kB

	# NeuronTrainer

	Training classes for AWS Trainium accelerators.

	## NeuronTrainingArguments[[optimum.neuron.NeuronTrainingArguments]]

	#### optimum.neuron.NeuronTrainingArguments[[optimum.neuron.NeuronTrainingArguments]]

	[Source](https://github.com/huggingface/optimum-neuron/blob/vr_1097/optimum/neuron/trainers/training_args.py#L49)

	get_process_log_leveloptimum.neuron.NeuronTrainingArguments.get_process_log_levelhttps://github.com/huggingface/optimum-neuron/blob/vr_1097/optimum/neuron/trainers/training_args.py#L761[]

	Returns the log level to be used depending on whether this process is the main process of node 0, main process
	of node non-0, or a non-main process.

	For the main process the log level defaults to the logging level set (`logging.WARNING` if you didn't do
	anything) unless overridden by `log_level` argument.

	For the replica processes the log level defaults to `logging.WARNING` unless overridden by `log_level_replica`
	argument.

	The choice between the main and replica process settings is made according to the return value of `should_log`.
	#### get_warmup_steps[[optimum.neuron.NeuronTrainingArguments.get_warmup_steps]]

	[Source](https://github.com/huggingface/optimum-neuron/blob/vr_1097/optimum/neuron/trainers/training_args.py#L783)

	Get number of steps used for a linear warmup.
	#### to_dict[[optimum.neuron.NeuronTrainingArguments.to_dict]]

	[Source](https://github.com/huggingface/optimum-neuron/blob/vr_1097/optimum/neuron/trainers/training_args.py#L803)

	Serializes this instance while replace `Enum` by their values (for JSON serialization support). It obfuscates
	the token values by removing their value.
	#### to_json_string[[optimum.neuron.NeuronTrainingArguments.to_json_string]]

	[Source](https://github.com/huggingface/optimum-neuron/blob/vr_1097/optimum/neuron/trainers/training_args.py#L830)

	Serializes this instance to a JSON string.
	#### to_sanitized_dict[[optimum.neuron.NeuronTrainingArguments.to_sanitized_dict]]

	[Source](https://github.com/huggingface/optimum-neuron/blob/vr_1097/optimum/neuron/trainers/training_args.py#L836)

	Sanitized serialization to use with TensorBoard’s hparams

	## NeuronTrainer[[optimum.neuron.NeuronTrainer]]

	#### optimum.neuron.NeuronTrainer[[optimum.neuron.NeuronTrainer]]

	[Source](https://github.com/huggingface/optimum-neuron/blob/vr_1097/optimum/neuron/trainers/transformers.py#L119)

	add_callbackoptimum.neuron.NeuronTrainer.add_callbackhttps://github.com/huggingface/optimum-neuron/blob/vr_1097/optimum/neuron/trainers/transformers.py#L375[{"name": "callback", "val": ": typing.Union[typing.Type[transformers.trainer_callback.TrainerCallback], transformers.trainer_callback.TrainerCallback]"}]- callback (`Type[TrainerCallback] \| TrainerCallback`) --
	A `TrainerCallback` class or an instance of a `TrainerCallback`. In the
	first case, will instantiate a member of that class.0

	Add a callback to the current list of `TrainerCallback`.

	Parameters:

	callback (`Type[TrainerCallback] \| TrainerCallback`) : A `TrainerCallback` class or an instance of a `TrainerCallback`. In the first case, will instantiate a member of that class.
	#### autocast_smart_context_manager[[optimum.neuron.NeuronTrainer.autocast_smart_context_manager]]

	[Source](https://github.com/huggingface/optimum-neuron/blob/vr_1097/optimum/neuron/trainers/transformers.py#L743)

	A helper wrapper that creates an appropriate context manager for `autocast` while feeding it the desired
	arguments, depending on the situation.
	#### create_accelerator_and_postprocess[[optimum.neuron.NeuronTrainer.create_accelerator_and_postprocess]]

	[Source](https://github.com/huggingface/optimum-neuron/blob/vr_1097/optimum/neuron/trainers/transformers.py#L291)

	Creates NeuronAccelerator instance and prepares model for distributed training.
	#### create_optimizer[[optimum.neuron.NeuronTrainer.create_optimizer]]

	[Source](https://github.com/huggingface/optimum-neuron/blob/vr_1097/optimum/neuron/trainers/transformers.py#L569)

	Setup the optimizer.

	We provide a reasonable default that works well. If you want to use something else, you can pass a tuple in the
	NeuronTrainer's init through `optimizers`, or subclass and override this method in a subclass.
	#### create_optimizer_and_scheduler[[optimum.neuron.NeuronTrainer.create_optimizer_and_scheduler]]

	[Source](https://github.com/huggingface/optimum-neuron/blob/vr_1097/optimum/neuron/trainers/transformers.py#L547)

	Setup the optimizer and the learning rate scheduler.

	We provide a reasonable default that works well. If you want to use something else, you can pass a tuple in the
	NeuronTrainer's init through `optimizers`, or subclass and override this method (or `create_optimizer` and/or
	`create_scheduler`) in a subclass.
	#### create_scheduler[[optimum.neuron.NeuronTrainer.create_scheduler]]

	[Source](https://github.com/huggingface/optimum-neuron/blob/vr_1097/optimum/neuron/trainers/transformers.py#L683)

	Setup the scheduler. The optimizer of the trainer must have been set up either before this method is called or
	passed as an argument.

	Parameters:

	num_training_steps (int) : The number of training steps to do.
	#### get_decay_parameter_names[[optimum.neuron.NeuronTrainer.get_decay_parameter_names]]

	[Source](https://github.com/huggingface/optimum-neuron/blob/vr_1097/optimum/neuron/trainers/transformers.py#L558)

	Get all parameter names that weight decay will be applied to.

	This function filters out parameters in two ways:
	1. By layer type (instances of layers specified in ALL_LAYERNORM_LAYERS)
	2. By parameter name patterns (containing 'bias', 'layernorm', or 'rmsnorm')
	#### get_learning_rates[[optimum.neuron.NeuronTrainer.get_learning_rates]]

	[Source](https://github.com/huggingface/optimum-neuron/blob/vr_1097/optimum/neuron/trainers/transformers.py#L626)

	Returns the learning rate of each parameter from self.optimizer.
	#### get_num_trainable_parameters[[optimum.neuron.NeuronTrainer.get_num_trainable_parameters]]

	[Source](https://github.com/huggingface/optimum-neuron/blob/vr_1097/optimum/neuron/trainers/transformers.py#L620)

	Get the number of trainable parameters.
	#### get_optimizer_cls_and_kwargs[[optimum.neuron.NeuronTrainer.get_optimizer_cls_and_kwargs]]

	[Source](https://github.com/huggingface/optimum-neuron/blob/vr_1097/optimum/neuron/trainers/transformers.py#L650)

	Returns the optimizer class and optimizer parameters based on the training arguments.

	Parameters:

	args (`transformers.training_args.TrainingArguments`) : The training arguments for the training session.
	#### get_optimizer_group[[optimum.neuron.NeuronTrainer.get_optimizer_group]]

	[Source](https://github.com/huggingface/optimum-neuron/blob/vr_1097/optimum/neuron/trainers/transformers.py#L634)

	Returns optimizer group for a parameter if given, else returns all optimizer groups for params.

	Parameters:

	param (`str \| torch.nn.parameter.Parameter \| None`, defaults to `None`) : The parameter for which optimizer group needs to be returned.
	#### get_train_dataloader[[optimum.neuron.NeuronTrainer.get_train_dataloader]]

	[Source](https://github.com/huggingface/optimum-neuron/blob/vr_1097/optimum/neuron/trainers/transformers.py#L528)

	Returns the training DataLoader with appropriate sampler and batch size.
	#### is_local_process_zero[[optimum.neuron.NeuronTrainer.is_local_process_zero]]

	[Source](https://github.com/huggingface/optimum-neuron/blob/vr_1097/optimum/neuron/trainers/transformers.py#L1251)

	Whether or not this process is the local (e.g., on one machine if training in a distributed fashion on several
	machines) main process.
	#### is_world_process_zero[[optimum.neuron.NeuronTrainer.is_world_process_zero]]

	[Source](https://github.com/huggingface/optimum-neuron/blob/vr_1097/optimum/neuron/trainers/transformers.py#L1258)

	Whether or not this process is the global main process (when training in a distributed fashion on several
	machines, this is only going to be `True` for one process).
	#### log[[optimum.neuron.NeuronTrainer.log]]

	[Source](https://github.com/huggingface/optimum-neuron/blob/vr_1097/optimum/neuron/trainers/transformers.py#L1318)

	Log training metrics to the state history and callbacks.
	#### maybe_log_train_step_metrics[[optimum.neuron.NeuronTrainer.maybe_log_train_step_metrics]]

	[Source](https://github.com/huggingface/optimum-neuron/blob/vr_1097/optimum/neuron/trainers/transformers.py#L1028)

	Log training step metrics if logging is due.
	#### maybe_save_checkpoint[[optimum.neuron.NeuronTrainer.maybe_save_checkpoint]]

	[Source](https://github.com/huggingface/optimum-neuron/blob/vr_1097/optimum/neuron/trainers/transformers.py#L1080)

	Save checkpoint if saving is due.
	#### num_examples[[optimum.neuron.NeuronTrainer.num_examples]]

	[Source](https://github.com/huggingface/optimum-neuron/blob/vr_1097/optimum/neuron/trainers/transformers.py#L703)

	Helper to get number of samples in a `~torch.utils.data.DataLoader` by accessing its dataset. When
	dataloader.dataset does not exist or has no length, estimates as best it can
	#### num_tokens[[optimum.neuron.NeuronTrainer.num_tokens]]

	[Source](https://github.com/huggingface/optimum-neuron/blob/vr_1097/optimum/neuron/trainers/transformers.py#L717)

	Helper to get number of tokens in a `~torch.utils.data.DataLoader` by enumerating dataloader.
	#### pop_callback[[optimum.neuron.NeuronTrainer.pop_callback]]

	[Source](https://github.com/huggingface/optimum-neuron/blob/vr_1097/optimum/neuron/trainers/transformers.py#L386)

	Remove a callback from the current list of `TrainerCallback` and returns it.

	If the callback is not found, returns `None` (and no error is raised).

	Parameters:

	callback (`Type[TrainerCallback] \| TrainerCallback`) : A `TrainerCallback` class or an instance of a `TrainerCallback`. In the first case, will pop the first member of that class found in the list of callbacks.

	Returns:

	``TrainerCallback \| None``

	The callback removed, if found.
	#### remove_callback[[optimum.neuron.NeuronTrainer.remove_callback]]

	[Source](https://github.com/huggingface/optimum-neuron/blob/vr_1097/optimum/neuron/trainers/transformers.py#L402)

	Remove a callback from the current list of `TrainerCallback`.

	Parameters:

	callback (`Type[TrainerCallback] \| TrainerCallback`) : A `TrainerCallback` class or an instance of a `TrainerCallback`. In the first case, will remove the first member of that class found in the list of callbacks.
	#### report_and_save_summary_metrics[[optimum.neuron.NeuronTrainer.report_and_save_summary_metrics]]

	[Source](https://github.com/huggingface/optimum-neuron/blob/vr_1097/optimum/neuron/trainers/transformers.py#L1326)

	Report and save comprehensive training summary metrics at the end of training.
	#### set_initial_training_values[[optimum.neuron.NeuronTrainer.set_initial_training_values]]

	[Source](https://github.com/huggingface/optimum-neuron/blob/vr_1097/optimum/neuron/trainers/transformers.py#L754)

	Calculates and returns the following values:
	- `num_train_epochs`
	- `num_update_steps_per_epoch`
	- `num_examples`
	- `num_train_samples`
	- `epoch_based`
	- `len_dataloader`
	- `max_steps`
	#### setup_training[[optimum.neuron.NeuronTrainer.setup_training]]

	[Source](https://github.com/huggingface/optimum-neuron/blob/vr_1097/optimum/neuron/trainers/transformers.py#L818)

	Setup everything to prepare for the training loop.
	This methods does not return anything but initializes many attributes of the class for training.
	#### train[[optimum.neuron.NeuronTrainer.train]]

	[Source](https://github.com/huggingface/optimum-neuron/blob/vr_1097/optimum/neuron/trainers/transformers.py#L1240)

	Main training entry point.
	Wraps around `self._train()` to handle cache synchronization.

Xet Storage Details

Size:: 11.7 kB
Xet hash:: a021789aa106f29ce46ef8a773e02ada6a6d52c55d491dfcf8104e43c6b7a518

Xet efficiently stores files, intelligently splitting them into unique chunks and accelerating uploads and downloads. More info.