Supporting gradient checkpointing for QLORA

#16

by ospanbatyr - opened Mar 5, 2024

Mar 5, 2024

Hi everyone,

While trying to finetune OLMo-7B with QLORA, OLMoForCausalLM does not support gradient checkpointing error is thrown in the prepare_model_for_kbit_training(model) line. Traceback:

Traceback (most recent call last):
  File "/scratch/users/oince22/hpc_run/CartographyFT/src/driver.py", line 39, in
main
    run_main(P, logger)
  File "/scratch/users/oince22/hpc_run/CartographyFT/src/driver.py", line 68, in
run_main
    llm, tokenizer = P.get_lm()
                     ^^^^^^^^^^
  File "/scratch/users/oince22/hpc_run/CartographyFT/src/params.py", line 311, 
in get_lm
    model = prepare_model_for_kbit_training(model)
            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File 
"/kuacc/users/oince22/.conda/envs/icl/lib/python3.11/site-packages/peft/utils/ot
her.py", line 139, in prepare_model_for_kbit_training
    model.gradient_checkpointing_enable(**gc_enable_kwargs)
  File 
"/kuacc/users/oince22/.conda/envs/icl/lib/python3.11/site-packages/transformers/
modeling_utils.py", line 2092, in gradient_checkpointing_enable
    raise ValueError(f"{self.__class__.__name__} does not support gradient 
checkpointing.")
ValueError: OLMoForCausalLM does not support gradient checkpointing.

amanrangapur

Oct 17, 2024

Hello @ospanbatyr , did you resolve this issue? If yes, I will close this.

ospanbatyr

Oct 18, 2024

Hi @amanrangapur ,

Nope, I didn't bother with OLMo and used other open-source LLMs.

Best regards.

amanrangapur changed discussion status to closed Oct 18, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment