Do you plan to open-source the training code?

by adol01 - opened Aug 2, 2024

Discussion

adol01

Aug 2, 2024

This model is really good; it would be great if it could be open-sourced.

thenlper

Alibaba-NLP org Aug 5, 2024

In the short term, we do not plan to open-source the training code. Our main focus remains on how to build better and more efficient models, which we will then open-source to the community.

izhx

Alibaba-NLP org Aug 5, 2024

The MLM pre-training code is adapted from Hugging Face code (run_mlm.py) to fit the large dataset, without too many additional modifications for optimization.

The contrastive learning code is similar to texttron/tevatron/, nomic-ai/contrastors, or FlagOpen/FlagEmbedding.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment