Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

Fe2x
/
bge-base-financial-matryoshka

Sentence Similarity
sentence-transformers
Safetensors
English
bert
feature-extraction
Generated from Trainer
dataset_size:6300
loss:MatryoshkaLoss
loss:MultipleNegativesRankingLoss
Eval Results (legacy)
text-embeddings-inference
Model card Files Files and versions
xet
Community

Instructions to use Fe2x/bge-base-financial-matryoshka with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

  • Libraries
  • sentence-transformers

    How to use Fe2x/bge-base-financial-matryoshka with sentence-transformers:

    from sentence_transformers import SentenceTransformer
    
    model = SentenceTransformer("Fe2x/bge-base-financial-matryoshka")
    
    sentences = [
        "What year do the patent families related to DARZALEX expire in the United States?",
        "Amortization for owned content predominantly monetized on an individual basis and accrued costs associated with participations and residuals payments are recorded using the individual film forecast computation method, which recognizes the costs in the same ratio as the associated ultimate revenue.",
        "The two patent families both expire in the United States in 2029.",
        "For the year ended December 31, 2022, net cash used in investing activities of $371.9 million was primarily from the purchase of $247.3 million marketable securities, net of sale and maturities, $62.2 million net cash used to acquire GreenCom, SolarLeadFactory and ClipperCreek, $46.4 million used in purchases of test and assembly equipment to expand our supply capacity, related facility improvements and information technology enhancements, including capitalized costs related to internal-use software and $16.0 million used to invest in private companies."
    ]
    embeddings = model.encode(sentences)
    
    similarities = model.similarity(embeddings, embeddings)
    print(similarities.shape)
    # [4, 4]
  • Notebooks
  • Google Colab
  • Kaggle
bge-base-financial-matryoshka
439 MB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 2 commits
Fe2x's picture
Fe2x
Add new SentenceTransformer model
5619cce verified over 1 year ago
  • 1_Pooling
    Add new SentenceTransformer model over 1 year ago
  • .gitattributes
    1.52 kB
    initial commit over 1 year ago
  • README.md
    30 kB
    Add new SentenceTransformer model over 1 year ago
  • config.json
    740 Bytes
    Add new SentenceTransformer model over 1 year ago
  • config_sentence_transformers.json
    205 Bytes
    Add new SentenceTransformer model over 1 year ago
  • model.safetensors
    438 MB
    xet
    Add new SentenceTransformer model over 1 year ago
  • modules.json
    349 Bytes
    Add new SentenceTransformer model over 1 year ago
  • sentence_bert_config.json
    52 Bytes
    Add new SentenceTransformer model over 1 year ago
  • special_tokens_map.json
    695 Bytes
    Add new SentenceTransformer model over 1 year ago
  • tokenizer.json
    712 kB
    Add new SentenceTransformer model over 1 year ago
  • tokenizer_config.json
    1.27 kB
    Add new SentenceTransformer model over 1 year ago
  • vocab.txt
    232 kB
    Add new SentenceTransformer model over 1 year ago