Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

jxm
/
cde-small-v2

Feature Extraction
sentence-transformers
Safetensors
Transformers
mteb
modernbert
custom_code
Eval Results (legacy)
Model card Files Files and versions
xet
Community
13

Instructions to use jxm/cde-small-v2 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

  • Libraries
  • sentence-transformers

    How to use jxm/cde-small-v2 with sentence-transformers:

    from sentence_transformers import SentenceTransformer
    
    model = SentenceTransformer("jxm/cde-small-v2", trust_remote_code=True)
    
    sentences = [
        "The weather is lovely today.",
        "It's so sunny outside!",
        "He drove to the stadium."
    ]
    embeddings = model.encode(sentences)
    
    similarities = model.similarity(embeddings, embeddings)
    print(similarities.shape)
    # [3, 3]
  • Transformers

    How to use jxm/cde-small-v2 with Transformers:

    # Use a pipeline as a high-level helper
    from transformers import pipeline
    
    pipe = pipeline("feature-extraction", model="jxm/cde-small-v2", trust_remote_code=True)
    # Load model directly
    from transformers import AutoModel
    model = AutoModel.from_pretrained("jxm/cde-small-v2", trust_remote_code=True, dtype="auto")
  • Notebooks
  • Google Colab
  • Kaggle
cde-small-v2
1.23 GB
Ctrl+K
Ctrl+K
  • 2 contributors
History: 26 commits
jxm's picture
jxm
Fix tokenizer loading (#12)
4e1d021 verified 12 months ago
  • .gitattributes
    1.52 kB
    initial commit over 1 year ago
  • README.md
    252 kB
    Clean up README slightly (#7) over 1 year ago
  • config.json
    1.16 kB
    Update config.json over 1 year ago
  • config_sentence_transformers.json
    287 Bytes
    Create config_sentence_transformers.json over 1 year ago
  • misc.py
    17.5 kB
    Upload ContextualDocumentEmbeddingTransformer over 1 year ago
  • model.py
    41.5 kB
    edit source over 1 year ago
  • model.safetensors
    1.22 GB
    xet
    Upload ContextualDocumentEmbeddingTransformer over 1 year ago
  • modules.json
    149 Bytes
    Create modules.json over 1 year ago
  • sentence_bert_config.json
    2 Bytes
    Create sentence_bert_config.json over 1 year ago
  • sentence_transformers_impl.py
    6.1 kB
    Fix tokenizer loading (#12) 12 months ago
  • tokenizer.json
    2.13 MB
    Upload 2 files about 1 year ago
  • tokenizer_config.json
    20.8 kB
    Upload 2 files about 1 year ago