FW-merged / README.md
hmarkc's picture
Add model card (#1)
17d746d verified
metadata
license: apache-2.0
library_name: transformers
pipeline_tag: text-classification

FW-Merging: Scaling Model Merging with Frank-Wolfe Optimization

This repository contains the Roberta model checkpoints resulting from applying Frank-Wolfe merging, as described in FW-Merging: Scaling Model Merging with Frank-Wolfe Optimization.

FW-Merging frames large-scale model merging as a constrained optimization problem. Fine-tuned checkpoints define the constraint set, while the objective dictates the desired properties of the merged model. It is designed to be robust to irrelevant models and effectively utilize relevant models for improved performance.

The merged model checkpoints can be found at: https://huggingface.co/hmarkc/FW-merged/tree/main/roberta

The code for merging the model and further details can be found at: https://github.com/hmarkc/FW-merged