Buckets:
MergeModelCallback[[trl.experimental.merge_model_callback.MergeModelCallback]]
trl.experimental.merge_model_callback.MergeModelCallback[[trl.experimental.merge_model_callback.MergeModelCallback]]
A TrainerCallback that merges the policy model (the model being trained) with another model based on a merge configuration.
Example:
from trl.experimental.merge_model_callback import MergeConfig, MergeModelCallback
config = MergeConfig()
merge_callback = MergeModelCallback(config)
trainer = DPOTrainer(..., callbacks=[merge_callback])
Parameters:
merge_config (experimental.merge_model_callback.MergeConfig, optional) : Configuration used for the merging process. If not provided, the default MergeConfig is used.
merge_at_every_checkpoint (bool, optional, defaults to False) : Whether to merge the model at every checkpoint.
push_to_hub (bool, optional, defaults to False) : Whether to push the merged model to the Hub after merging.
Xet Storage Details
- Size:
- 1.19 kB
- Xet hash:
- 9b1c69228a8e0cde47d95d2899a51a84995490838929a25b5eba48916b073f7d
·
Xet efficiently stores files, intelligently splitting them into unique chunks and accelerating uploads and downloads. More info.