fine-tuning with autotrain-advanced

by KnutJaegersberg - opened Nov 3, 2023

Nov 3, 2023

Hi I was trying to fine tune this model with autotrain advanced, but I face this exception:
out, q, k, v, out_padded, softmax_lse, S_dmask, rng_state = flash_attn_cuda.fwd(
RuntimeError: FlashAttention only support fp16 and bf16 data type

this is my command:
autotrain llm --train --project_name deacon-34b --model /run/media/knut/HD2/Yi-34B/ --data_path . --train_batch_size 1 --num_train_epochs 5 --trainer sft --use_int4 --use_peft --merge_adapter --target_modules "q_proj,k_proj,v_proj,o_proj"

I tried to reinstall a few libs, but could not resolve it. Is it already possible to fine tune your model with hf autotrain-advanced?

silentriverg

01-ai org Nov 3, 2023

When loading the model, considering add torch_type='auto' in the model_class.from_pretrained API

KnutJaegersberg

Nov 3, 2023

I'm trying this with 4bit or 8bit peft fine tuning. I hacked around both in the models python files and in the main.py of autotrain-advanced, tried to add toch_dtype='auto' where I could see it might work. So far I had no luck. https://github.com/huggingface/autotrain-advanced/blob/main/src/autotrain/trainers/clm/__main__.py

KnutJaegersberg

Nov 5, 2023

I will close this.Please help make peft fine tuning with hf transformers work.

KnutJaegersberg changed discussion status to closed Nov 5, 2023

FancyZhao

Nov 5, 2023

Please help make peft fine tuning with hf transformers work.

Will do!

And please watch our github repo for the latest progress. 🤗

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment