Continuous training pre-trained Bloom on custom biomedical dataset

#139

by Siddharth63 - opened Nov 11, 2022

Nov 11, 2022

I want to use an already pretrained bloom model and fine-tune (continue training) it on my custom biomedical dataset. Has anyone solved it and share a link to the script to do this finetuning?

TimeRobber

BigScience Workshop org Nov 14, 2022

Hi @Siddharth63 ! If you want to use Megatron-DeepSpeed, we were able to do it (typically that's how we built BLOOMZ, there's a README in the GH repo https://github.com/bigscience-workshop/xmtf). Otherwise I'd suggest looking at this: https://huggingface.co/bigscience/bloom/discussions/46

Closing as this seems to be a duplicate of https://huggingface.co/bigscience/bloom/discussions/46. Feel free to re-open if you think I mistakenly closed it.

TimeRobber changed discussion status to closed Nov 14, 2022

pe65374

May 23, 2023

I guess Siddharth63 prefer continuous pretraining instead of finetune? https://huggingface.co/bigscience/bloom/discussions/46 is more likely finetuning discussion thread.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment