MNLP_M3_document_encoder_old

This repository hosts a document encoder model intended for the MNLP M3 project.

Important Update Regarding Model Compatibility

Reason for the recent repository update:

Initially, this repository contained a model that was pushed as a standard Hugging Face transformers model. When attempting to load this model using the sentence-transformers library, I encountered an error indicating that it was not recognized as a sentence-transformers compatible model.

This issue arose because the sentence-transformers library expects specific configuration files (such as modules.json and sentence_bert_config.json) within the model's directory. These files are crucial for defining the model's specific architecture, including how its token embeddings are converted into a single sentence or document embedding (e.g., via mean pooling, CLS token pooling, etc.). Without these configurations, sentence-transformers cannot properly interpret the model's structure.

The intended model for this repository has always been the pre-trained sentence-transformers/all-MiniLM-L12-v2 model. The previous upload method inadvertently created a transformers model structure that lacked the necessary sentence-transformers-specific configurations for direct loading.

What has been changed:

To rectify this, I have re-uploaded the original sentence-transformers/all-MiniLM-L12-v2 model directly to a new MNLP_M3_document_encoder repository. This was done using the sentence-transformers library's save_to_hub function, which ensures that all the correct and required sentence-transformers configuration files are present.

Impact of the change:

Please note that the underlying model weights and architecture have not changed from the originally intended sentence-transformers/all-MiniLM-L12-v2 model. The update purely addresses the repository's internal file structure to ensure proper loading and seamless compatibility with the sentence-transformers library.

You can verify the previous model content by checking the commit history of this repository.

I sincerely apologize for this initial oversight and any confusion it may have caused.

Downloads last month: -

Safetensors

Model size

33.4M params

Tensor type

F32

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support