vkerkez
/

GitVac-R-14B

Model card Files Files and versions

vkerkez commited on Mar 4, 2025

Commit

7eda14a

·

verified ·

1 Parent(s): 0b37687

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -1,5 +1,5 @@
 ---
-license: cc-by-nc-2.0
 ---
 # GitVac
 Don't forget to vacuum your git repo.
@@ -544,4 +544,4 @@ This does a few things:
 3. Updates the reasoning to further improve its reasoning
 With this dataset, we can fine-tune to get a base model. This model can then be further improved through RLHF (Reinforcement Learning from Human Feedback) and GRPO (Guided Reward Policy Optimization) training, where it will continuously learn from new datasets generated by the pipeline. This creates a virtuous cycle of improvement, with each iteration building on the knowledge gained from previous runs.
-I should probably write up a whole separate post on this extended pipeline someday. For now enjoy this repo!

 ---
+license: apache-2.0
 ---
 # GitVac
 Don't forget to vacuum your git repo.
 3. Updates the reasoning to further improve its reasoning
 With this dataset, we can fine-tune to get a base model. This model can then be further improved through RLHF (Reinforcement Learning from Human Feedback) and GRPO (Guided Reward Policy Optimization) training, where it will continuously learn from new datasets generated by the pipeline. This creates a virtuous cycle of improvement, with each iteration building on the knowledge gained from previous runs.
+I should probably write up a whole separate post on this extended pipeline someday. For now enjoy this repo!