YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

Combined Task Vector Model

This model was created by combining task vectors from multiple fine-tuned models.

Task Vector Computation

t_1 = TaskVector("Qwen/Qwen2.5-Coder-7B-Instruct", "/lustre10/scratch/tkwang/SecSteer/axolotl-outputs/lora-merged/plus-step-90")
t_2 = TaskVector("Qwen/Qwen2.5-Coder-7B-Instruct", "/lustre10/scratch/tkwang/SecSteer/axolotl-outputs/lora-merged/minus-step-90")
t_2 = TaskVector("Qwen/Qwen2.5-Coder-7B-Instruct", "None")
t_combined = 1.0 * t_1 + -1.0 * t_2
new_model = t_combined.apply_to("Qwen/Qwen2.5-Coder-7B-Instruct", scaling_coef=0.1)

Models Used

Technical Details

  • Creation Script Git Hash: fb62f919e9796b294f1ffb6297b05d11fa945ac0
  • Task Vector Method: Additive combination
  • Args: { "pretrained_model": "Qwen/Qwen2.5-Coder-7B-Instruct", "finetuned_model1": "/lustre10/scratch/tkwang/SecSteer/axolotl-outputs/lora-merged/plus-step-90", "finetuned_model2": "/lustre10/scratch/tkwang/SecSteer/axolotl-outputs/lora-merged/minus-step-90", "finetuned_model3": null, "apply_to_diff_model_architecure": null, "output_model_name": "felixwangg/Qwen2.5-Coder-7B-Instruct-cpp-sec-step90-lam0.1", "output_dir": "/lustre10/scratch/tkwang/SecSteer/axolotl-outputs/weight-arithmetic/step-90/lambda-0.1", "scaling_coef": 0.1, "apply_line_scaling_t1": false, "apply_line_scaling_t2": false, "apply_line_scaling_t3": false, "scale_t1": 1.0, "scale_t2": -1.0, "scale_t3": null }
Downloads last month
1
Safetensors
Model size
8B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support