Buckets:
| # DeepSpeed | |
| [DeepSpeed](https://github.com/deepspeedai/DeepSpeed), powered by Zero Redundancy Optimizer (ZeRO), is an optimization library for training and fitting very large models onto a GPU. It is available in several ZeRO stages, where each stage progressively saves more GPU memory by partitioning the optimizer state, gradients, parameters, and enabling offloading to a CPU or NVMe. DeepSpeed is integrated with the [Trainer](/docs/transformers/pr_26617/en/main_classes/trainer#transformers.Trainer) class and most of the setup is automatically taken care of for you. | |
| However, if you want to use DeepSpeed without the [Trainer](/docs/transformers/pr_26617/en/main_classes/trainer#transformers.Trainer), Transformers provides a `HfDeepSpeedConfig` class. | |
| Learn more about using DeepSpeed with [Trainer](/docs/transformers/pr_26617/en/main_classes/trainer#transformers.Trainer) in the [DeepSpeed](../deepspeed) guide. | |
| ## HfDeepSpeedConfig[[transformers.integrations.HfDeepSpeedConfig]] | |
| #### transformers.integrations.HfDeepSpeedConfig[[transformers.integrations.HfDeepSpeedConfig]] | |
| [Source](https://github.com/huggingface/transformers/blob/vr_26617/src/transformers/integrations/deepspeed.py#L57) | |
| This object contains a DeepSpeed configuration dictionary and can be quickly queried for things like zero stage. | |
| A `weakref` of this object is stored in the module's globals to be able to access the config from areas where | |
| things like the Trainer object is not available (e.g. `from_pretrained` and `_get_resized_embeddings`). Therefore | |
| it's important that this object remains alive while the program is still running. | |
| [Trainer](/docs/transformers/pr_26617/en/main_classes/trainer#transformers.Trainer) uses the `HfTrainerDeepSpeedConfig` subclass instead. That subclass has logic to sync the configuration | |
| with values of [TrainingArguments](/docs/transformers/pr_26617/en/main_classes/trainer#transformers.TrainingArguments) by replacing special placeholder values: `"auto"`. Without this special logic | |
| the DeepSpeed configuration is not modified in any way. | |
| **Parameters:** | |
| config_file_or_dict (`Union[str, Dict]`) : path to DeepSpeed config file or dict. | |
Xet Storage Details
- Size:
- 2.17 kB
- Xet hash:
- a4de047325da8cde736b66c316623e861c8d2052b1a8c7c6f21d3a59fdd13b0f
·
Xet efficiently stores files, intelligently splitting them into unique chunks and accelerating uploads and downloads. More info.