Finetuning using LoRA

by titoghose - opened Mar 18, 2025

Mar 18, 2025

•

edited Mar 18, 2025

Hello,

I was wondering if you were able to successfully finetune the mamba model using LoRA? I've been trying to finetune the "state-spaces/mamba-130m-hf" model using LoRA but the x_proj and out_proj layers do not train and always have the param.grad as None. Since I noticed that you've been trying to train the same model using LoRA, i was hoping to check if you've found a solution to this issue?

A detailed description of my issue with code and output is here.

Thanks!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment