missing files

by igprod - opened Aug 23, 2025

Aug 23, 2025

•

edited Aug 23, 2025

Hi HumbleWorth team,

Thanks a lot for releasing the TLD embedding model and the domain-MLM checkpoint.
We’re integrating your models in a domain appraisal pipeline
(Kaggle/Colab + GPU), and we’d like to load humbleworth/tld-embedding as a
drop-in SentenceTransformers model when possible.

Currently, the repo provides model.safetensors and tokenizer files, but it lacks
modules.json and config_sentence_transformers.json (404 on the Hub).
Also, the config has model_type: "dual_task_tld", which suggests a custom
architecture. Could you please provide one of the following?

A SentenceTransformers export (i.e., modules.json + config_sentence_transformers.json)
so we can load via SentenceTransformer(<local_dir>), or
The minimal custom class / loader code for "dual_task_tld" in Transformers,
and the expected MASK token / special tokens behavior to replicate your inference.

Repo links:

TLD embeddings: https://huggingface.co/humbleworth/tld-embedding

Many thanks for your great work!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment