Thanks for your reply, I have added PR here https://github.com/huggingface/peft/pull/3342. Please feel free to provide your suggestions.
Komal Kumar
ItsMaxNorm
AI & ML interests
None yet
Recent Activity
commentedon an article 1 day ago
Beyond LoRA: Can you beat the most popular fine-tuning technique? commentedon an article 2 days ago
Beyond LoRA: Can you beat the most popular fine-tuning technique? updated a dataset 4 days ago
ItsMaxNorm/privasis-reasoning-qaOrganizations
commented on Beyond LoRA: Can you beat the most popular fine-tuning technique? 1 day ago
commented on Beyond LoRA: Can you beat the most popular fine-tuning technique? 2 days ago
Thanks for analysis. Very helpful.
What about DEFT?
It adapts a pre-trained weight matrix by decomposing its update into two components with two trainable matrices: (1) a projection onto the complement of a low-rank subspace spanned by a low-rank matrix, and (2) a low-rank update. The single trainable low-rank matrix defines the subspace, while the other trainable low-rank matrix enables flexible parameter adaptation within that subspace.
upvoted a paper about 1 month ago
upvoted a collection about 2 months ago
upvoted an article about 2 months ago
Article
Pallas for people who know JAX but not kernels yet
ariG23498
• • 21upvoted a paper 2 months ago