Buckets:
MergeModelCallback[[trl.experimental.merge_model_callback.MergeModelCallback]]
trl.experimental.merge_model_callback.MergeModelCallback[[trl.experimental.merge_model_callback.MergeModelCallback]]
A TrainerCallback that merges the policy model (the model being trained) with another model based on a merge configuration.
Example:
from trl.experimental.merge_model_callback import MergeConfig, MergeModelCallback
config = MergeConfig()
merge_callback = MergeModelCallback(config)
trainer = DPOTrainer(..., callbacks=[merge_callback])
Parameters:
merge_config (experimental.merge_model_callback.MergeConfig, optional) : Configuration used for the merging process. If not provided, the default MergeConfig is used.
merge_at_every_checkpoint (bool, optional, defaults to False) : Whether to merge the model at every checkpoint.
push_to_hub (bool, optional, defaults to False) : Whether to push the merged model to the Hub after merging.
Xet Storage Details
- Size:
- 1.19 kB
- Xet hash:
- 19ae3eca84d295d3532847942a3672f2c470850fad9802b9b68cd90692ce4ecb
·
Xet efficiently stores files, intelligently splitting them into unique chunks and accelerating uploads and downloads. More info.