Sentence Similarity
sentence-transformers
Safetensors
xlm-roberta
feature-extraction
Generated from Trainer
dataset_size:80
loss:CoSENTLoss
Eval Results (legacy)
text-embeddings-inference
Instructions to use Ouchbara/result_model with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- sentence-transformers
How to use Ouchbara/result_model with sentence-transformers:
from sentence_transformers import SentenceTransformer model = SentenceTransformer("Ouchbara/result_model") sentences = [ "A man with blond-hair, and a brown shirt drinking out of a public water fountain.", "A blond man wearing a brown shirt is reading a book on a bench in the park", "The friends scowl at each other over a full dinner table.", "Two adults walk across a street." ] embeddings = model.encode(sentences) similarities = model.similarity(embeddings, embeddings) print(similarities.shape) # [4, 4] - Notebooks
- Google Colab
- Kaggle
metadata
tags:
- sentence-transformers
- sentence-similarity
- feature-extraction
- generated_from_trainer
- dataset_size:80
- loss:CoSENTLoss
base_model: abdeljalilELmajjodi/model
widget:
- source_sentence: >-
A man with blond-hair, and a brown shirt drinking out of a public water
fountain.
sentences:
- >-
A blond man wearing a brown shirt is reading a book on a bench in the
park
- The friends scowl at each other over a full dinner table.
- Two adults walk across a street.
- source_sentence: >-
An older man sits with his orange juice at a small table in a coffee shop
while employees in bright colored shirts smile in the background.
sentences:
- The woman and man are playing baseball together.
- >-
The friends have just met for the first time in 20 years, and have had a
great time catching up.
- >-
An older man drinks his juice as he waits for his daughter to get off
work.
- source_sentence: >-
Two adults, one female in white, with shades and one male, gray clothes,
walking across a street, away from a eatery with a blurred image of a dark
colored red shirted person in the foreground.
sentences:
- There are no women in the picture.
- A person eating.
- The woman is wearing green.
- source_sentence: A man, woman, and child enjoying themselves on a beach.
sentences:
- A family of three is at the beach.
- The mans briefcase is for work.
- A person is training his horse for a competition.
- source_sentence: Children smiling and waving at camera
sentences:
- The family is on vacation.
- Two groups of rival gang members flipped each other off.
- There are children present
pipeline_tag: sentence-similarity
library_name: sentence-transformers
metrics:
- pearson_cosine
- spearman_cosine
model-index:
- name: SentenceTransformer based on abdeljalilELmajjodi/model
results:
- task:
type: semantic-similarity
name: Semantic Similarity
dataset:
name: pair score evaluator dev
type: pair-score-evaluator-dev
metrics:
- type: pearson_cosine
value: -0.1629711561999381
name: Pearson Cosine
- type: spearman_cosine
value: 0.01599191652998732
name: Spearman Cosine
SentenceTransformer based on abdeljalilELmajjodi/model
This is a sentence-transformers model finetuned from abdeljalilELmajjodi/model on the all-nli dataset. It maps sentences & paragraphs to a 1024-dimensional dense vector space and can be used for retrieval.
Model Details
Model Description
- Model Type: Sentence Transformer
- Base model: abdeljalilELmajjodi/model
- Maximum Sequence Length: 512 tokens
- Output Dimensionality: 1024 dimensions
- Similarity Function: Cosine Similarity
- Supported Modality: Text
- Training Dataset:
- all-nli
Model Sources
- Documentation: Sentence Transformers Documentation
- Repository: Sentence Transformers on GitHub
- Hugging Face: Sentence Transformers on Hugging Face
Full Model Architecture
SentenceTransformer(
(0): Transformer({'transformer_task': 'feature-extraction', 'modality_config': {'text': {'method': 'forward', 'method_output_name': 'last_hidden_state'}}, 'module_output_name': 'token_embeddings', 'architecture': 'XLMRobertaModel'})
(1): Pooling({'embedding_dimension': 1024, 'pooling_mode': 'mean', 'include_prompt': True})
)
Usage
Direct Usage (Sentence Transformers)
First install the Sentence Transformers library:
pip install -U sentence-transformers
Then you can load this model and run inference.
from sentence_transformers import SentenceTransformer
# Download from the 🤗 Hub
model = SentenceTransformer("sentence_transformers_model_id")
# Run inference
sentences = [
'Children smiling and waving at camera',
'There are children present',
'The family is on vacation.',
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 1024]
# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities)
# tensor([[1.0000, 0.9857, 0.9845],
# [0.9857, 1.0000, 0.9931],
# [0.9845, 0.9931, 1.0000]])
Evaluation
Metrics
Semantic Similarity
- Dataset:
pair-score-evaluator-dev - Evaluated with
EmbeddingSimilarityEvaluator
| Metric | Value |
|---|---|
| pearson_cosine | -0.163 |
| spearman_cosine | 0.016 |
Training Details
Training Dataset
all-nli
- Dataset: all-nli
- Size: 80 training samples
- Columns:
sentence1,sentence2, andscore - Approximate statistics based on the first 80 samples:
sentence1 sentence2 score type string string float details - min: 10 tokens
- mean: 26.0 tokens
- max: 52 tokens
- min: 5 tokens
- mean: 11.95 tokens
- max: 29 tokens
- min: 0.0
- mean: 0.49
- max: 1.0
- Samples:
sentence1 sentence2 score A boy is jumping on skateboard in the middle of a red bridge.The boy is wearing safety equipment.0.5A Little League team tries to catch a runner sliding into a base in an afternoon game.A team is trying to score the games winning out.0.5Two blond women are hugging one another.The women are sleeping.0.0 - Loss:
CoSENTLosswith these parameters:{ "scale": 20.0, "similarity_fct": "pairwise_cos_sim" }
Evaluation Dataset
all-nli
- Dataset: all-nli
- Size: 20 evaluation samples
- Columns:
sentence1,sentence2, andscore - Approximate statistics based on the first 20 samples:
sentence1 sentence2 score type string string float details - min: 10 tokens
- mean: 24.65 tokens
- max: 52 tokens
- min: 6 tokens
- mean: 12.1 tokens
- max: 25 tokens
- min: 0.0
- mean: 0.55
- max: 1.0
- Samples:
sentence1 sentence2 score A woman is walking across the street eating a banana, while a man is following with his briefcase.The woman and man are playing baseball together.0.0A couple play in the tide with their young son.The family is on vacation.0.5Woman in white in foreground and a man slightly behind walking with a sign for John's Pizza and Gyro in the background.The woman and man are outdoors.1.0 - Loss:
CoSENTLosswith these parameters:{ "scale": 20.0, "similarity_fct": "pairwise_cos_sim" }
Training Hyperparameters
Non-Default Hyperparameters
num_train_epochs: 1warmup_steps: 0.05bf16: Truefp16_full_eval: Trueload_best_model_at_end: Truepush_to_hub: Truegradient_checkpointing: True
All Hyperparameters
Click to expand
do_predict: Falseprediction_loss_only: Trueper_device_train_batch_size: 8per_device_eval_batch_size: 8gradient_accumulation_steps: 1eval_accumulation_steps: Nonetorch_empty_cache_steps: Nonelearning_rate: 5e-05weight_decay: 0.0adam_beta1: 0.9adam_beta2: 0.999adam_epsilon: 1e-08max_grad_norm: 1.0num_train_epochs: 1max_steps: -1lr_scheduler_type: linearlr_scheduler_kwargs: Nonewarmup_ratio: Nonewarmup_steps: 0.05log_level: passivelog_level_replica: warninglog_on_each_node: Truelogging_nan_inf_filter: Trueenable_jit_checkpoint: Falsesave_on_each_node: Falsesave_only_model: Falserestore_callback_states_from_checkpoint: Falseuse_cpu: Falseseed: 42data_seed: Nonebf16: Truefp16: Falsebf16_full_eval: Falsefp16_full_eval: Truetf32: Nonelocal_rank: -1ddp_backend: Nonedebug: []dataloader_drop_last: Falsedataloader_num_workers: 0dataloader_prefetch_factor: Nonedisable_tqdm: Falseremove_unused_columns: Truelabel_names: Noneload_best_model_at_end: Trueignore_data_skip: Falsefsdp: []fsdp_config: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}accelerator_config: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}parallelism_config: Nonedeepspeed: Nonelabel_smoothing_factor: 0.0optim: adamw_torch_fusedoptim_args: Nonegroup_by_length: Falselength_column_name: lengthproject: huggingfacetrackio_space_id: trackioddp_find_unused_parameters: Noneddp_bucket_cap_mb: Noneddp_broadcast_buffers: Falsedataloader_pin_memory: Truedataloader_persistent_workers: Falseskip_memory_metrics: Truepush_to_hub: Trueresume_from_checkpoint: Nonehub_model_id: Nonehub_strategy: every_savehub_private_repo: Nonehub_always_push: Falsehub_revision: Nonegradient_checkpointing: Truegradient_checkpointing_kwargs: Noneinclude_for_metrics: []eval_do_concat_batches: Trueauto_find_batch_size: Falsefull_determinism: Falseddp_timeout: 1800torch_compile: Falsetorch_compile_backend: Nonetorch_compile_mode: Noneinclude_num_input_tokens_seen: noneftune_noise_alpha: Noneoptim_target_modules: Nonebatch_eval_metrics: Falseeval_on_start: Falseuse_liger_kernel: Falseliger_kernel_config: Noneeval_use_gather_object: Falseaverage_tokens_across_devices: Trueuse_cache: Falseprompts: Nonebatch_sampler: batch_samplermulti_dataset_batch_sampler: proportionalrouter_mapping: {}learning_rate_mapping: {}
Training Logs
| Epoch | Step | Training Loss | Validation Loss | pair-score-evaluator-dev_spearman_cosine |
|---|---|---|---|---|
| 0.1 | 1 | 2.9349 | - | - |
| 0.5 | 5 | 3.0658 | - | - |
| 1.0 | 10 | 2.926 | 2.8427 | 0.016 |
- The bold row denotes the saved checkpoint.
Training Time
- Training: 22.5 minutes
Framework Versions
- Python: 3.12.13
- Sentence Transformers: 5.4.1
- Transformers: 5.0.0
- PyTorch: 2.10.0+cu128
- Accelerate: 1.13.0
- Datasets: 4.8.5
- Tokenizers: 0.22.2
Citation
BibTeX
Sentence Transformers
@inproceedings{reimers-2019-sentence-bert,
title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
author = "Reimers, Nils and Gurevych, Iryna",
booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
month = "11",
year = "2019",
publisher = "Association for Computational Linguistics",
url = "https://arxiv.org/abs/1908.10084",
}
CoSENTLoss
@article{10531646,
author={Huang, Xiang and Peng, Hao and Zou, Dongcheng and Liu, Zhiwei and Li, Jianxin and Liu, Kay and Wu, Jia and Su, Jianlin and Yu, Philip S.},
journal={IEEE/ACM Transactions on Audio, Speech, and Language Processing},
title={CoSENT: Consistent Sentence Embedding via Similarity Ranking},
year={2024},
doi={10.1109/TASLP.2024.3402087}
}