Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

bobox
/
RoBERTa-base-unsupervised-TSDAE

Sentence Similarity
sentence-transformers
PyTorch
roberta
feature-extraction
Generated from Trainer
dataset_size:300000
loss:DenoisingAutoEncoderLoss
Eval Results (legacy)
text-embeddings-inference
Model card Files Files and versions
xet
Community
1

Instructions to use bobox/RoBERTa-base-unsupervised-TSDAE with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

  • Libraries
  • sentence-transformers

    How to use bobox/RoBERTa-base-unsupervised-TSDAE with sentence-transformers:

    from sentence_transformers import SentenceTransformer
    
    model = SentenceTransformer("bobox/RoBERTa-base-unsupervised-TSDAE")
    
    sentences = [
        "free in spain? Are Spain free Motorways toll-free Spain, renewing old concessions coming",
        "how to calculate weighted grade percentage in excel? To find the grade, multiply the grade for each assignment against the weight, and then add these totals all up. So for each cell (in the Total column) we will enter =SUM(Grade Cell * Weight Cell), so my first formula is =SUM(B2*C2), the next one would be =SUM(B3*C3) and so on.",
        "In Red Dead Redemption 2's story mode, players have to go to \"Story\" in the menu and then click the save icon from there. However, in Red Dead Online, there is no such option. On the contrary, players have no way to manually save their game, which is pretty much par for the course in an online multiplayer experience.",
        "are motorways free in spain? Are motorways in Spain free? Motorways are 90% toll-free in Spain. Since 2018, Spain isn't renewing old concessions coming to end."
    ]
    embeddings = model.encode(sentences)
    
    similarities = model.similarity(embeddings, embeddings)
    print(similarities.shape)
    # [4, 4]
  • Notebooks
  • Google Colab
  • Kaggle
RoBERTa-base-unsupervised-TSDAE
502 MB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 2 commits
bobox's picture
bobox
trained on the initial 100k + 100k
177db6c verified almost 2 years ago
  • 1_Pooling
    trained on the initial 100k + 100k almost 2 years ago
  • .gitattributes
    1.52 kB
    initial commit almost 2 years ago
  • README.md
    22.2 kB
    trained on the initial 100k + 100k almost 2 years ago
  • config.json
    677 Bytes
    trained on the initial 100k + 100k almost 2 years ago
  • config_sentence_transformers.json
    195 Bytes
    trained on the initial 100k + 100k almost 2 years ago
  • merges.txt
    456 kB
    trained on the initial 100k + 100k almost 2 years ago
  • modules.json
    229 Bytes
    trained on the initial 100k + 100k almost 2 years ago
  • pytorch_model.bin

    Detected Pickle imports (3)

    • "torch._utils._rebuild_tensor_v2",
    • "collections.OrderedDict",
    • "torch.FloatStorage"

    What is a pickle import?

    499 MB
    xet
    trained on the initial 100k + 100k almost 2 years ago
  • sentence_bert_config.json
    53 Bytes
    trained on the initial 100k + 100k almost 2 years ago
  • special_tokens_map.json
    280 Bytes
    trained on the initial 100k + 100k almost 2 years ago
  • tokenizer.json
    2.11 MB
    trained on the initial 100k + 100k almost 2 years ago
  • tokenizer_config.json
    1.22 kB
    trained on the initial 100k + 100k almost 2 years ago
  • vocab.json
    798 kB
    trained on the initial 100k + 100k almost 2 years ago