How many GPU's are required to fine tuning bge-m3 over 1 million tripplets ?

#18

by wilfoderek - opened Feb 28, 2024

Feb 28, 2024

Congrulation to all the team of BAAI for the excellent work!
Actually I am collecting 1 million of tripplets (query, list[pos] , list[neg] ). Now, I wonder how many GPU's are required for the fine tuning?
Any suggestion is welcomed friends.

Shitao

Beijing Academy of Artificial Intelligence org Feb 29, 2024

Thanks for your interest in our work! I think 8*A100 is enough.

dlitoria

Mar 22, 2024

@wilfoderek were you able to finetune the model. I fine-tuned model and is now giving me .9995 similarity score for everything no matter what the string is. I must have goofed up the training process I guess.

wilfoderek

Jun 2, 2024

@wilfoderek were you able to finetune the model. I fine-tuned model and is now giving me .9995 similarity score for everything no matter what the string is. I must have goofed up the training process I guess.

Still working on collecting data! But I see , as you describe your problem might to be relationated with overfitting.

nafi-ahmed

May 1, 2025

@dlitoria how much GPU VRAM did it take to fine tune it? While evaluating it, it took 20GB of my GPU, so I wonder if I'm doing anything wrong?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment