max_seq_length seems not to be properly reported in sentence_bert_config.json

#35

by yjoonjang - opened Jan 24, 2025

Jan 24, 2025

Hi authors, I was looking at the max_seq_length of this model and found out that it's not properly written in sentence_bert_config.json(https://huggingface.co/nvidia/NV-Embed-v2/blob/main/sentence_bert_config.json)

In the "Usage" section (https://huggingface.co/nvidia/NV-Embed-v2#usage-huggingface-transformers), the max_seq_length is 32768. Maybe the max_seq_length you wrote (4096) in sentence_bert_config.json might represent the hidden state (or the size of the output vector)

Can you please check?
It would be awesome if @tomaarsen could check too.

Thank you.

tomaarsen

Jan 24, 2025

Hello!
You're right, it should be 32768 in sentence_bert_config.json. The reason it didn't affect the README snippet is because the 4096 immediately gets overridden due to model.max_seq_length = 32768. But we should probably update the config.

Tom Aarsen

nada5

NVIDIA org Feb 23, 2025

Hi, @yjoonjang . Thanks for the question. This issue is resolved by merging your request: https://huggingface.co/nvidia/NV-Embed-v2/discussions/38.

nada5 changed discussion status to closed Feb 23, 2025

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment