Text Ranking
sentence-transformers
Safetensors
xlm-roberta
cross-encoder
reranker
Generated from Trainer
dataset_size:147
loss:BinaryCrossEntropyLoss
Eval Results (legacy)
text-embeddings-inference
Instructions to use pujithapsx/test with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- sentence-transformers
How to use pujithapsx/test with sentence-transformers:
from sentence_transformers import CrossEncoder model = CrossEncoder("pujithapsx/test") query = "Which planet is known as the Red Planet?" passages = [ "Venus is often called Earth's twin because of its similar size and proximity.", "Mars, known for its reddish appearance, is often referred to as the Red Planet.", "Jupiter, the largest planet in our solar system, has a prominent red spot.", "Saturn, famous for its rings, is sometimes mistaken for the Red Planet." ] scores = model.predict([(query, passage) for passage in passages]) print(scores) - Notebooks
- Google Colab
- Kaggle
metadata
tags:
- sentence-transformers
- cross-encoder
- reranker
- generated_from_trainer
- dataset_size:147
- loss:BinaryCrossEntropyLoss
base_model: BAAI/bge-reranker-v2-m3
pipeline_tag: text-ranking
library_name: sentence-transformers
metrics:
- accuracy
- accuracy_threshold
- f1
- f1_threshold
- precision
- recall
- average_precision
model-index:
- name: CrossEncoder based on BAAI/bge-reranker-v2-m3
results:
- task:
type: cross-encoder-classification
name: Cross Encoder Classification
dataset:
name: entity matching
type: entity-matching
metrics:
- type: accuracy
value: 1
name: Accuracy
- type: accuracy_threshold
value: 0.5193085670471191
name: Accuracy Threshold
- type: f1
value: 1
name: F1
- type: f1_threshold
value: 0.5193085670471191
name: F1 Threshold
- type: precision
value: 1
name: Precision
- type: recall
value: 1
name: Recall
- type: average_precision
value: 1
name: Average Precision
CrossEncoder based on BAAI/bge-reranker-v2-m3
This is a Cross Encoder model finetuned from BAAI/bge-reranker-v2-m3 using the sentence-transformers library. It computes scores for pairs of texts, which can be used for text reranking and semantic search.
Model Details
Model Description
- Model Type: Cross Encoder
- Base model: BAAI/bge-reranker-v2-m3
- Maximum Sequence Length: 512 tokens
- Number of Output Labels: 1 label
Model Sources
- Documentation: Sentence Transformers Documentation
- Documentation: Cross Encoder Documentation
- Repository: Sentence Transformers on GitHub
- Hugging Face: Cross Encoders on Hugging Face
Usage
Direct Usage (Sentence Transformers)
First install the Sentence Transformers library:
pip install -U sentence-transformers
Then you can load this model and run inference.
from sentence_transformers import CrossEncoder
# Download from the 🤗 Hub
model = CrossEncoder("pujithapsx/test")
# Get scores for pairs of texts
pairs = [
['Name: Sneha Reddy , First: Sneha , Middle: , Last: Reddy , Gender: F , DOB: 1991-07-11 , Spouse: , Mother: , Father: , Company: FINANCE CORP INDIA , ParentCompany: , TaxID: , LicenseID: , PassportID: , Address: HOUSE 687, BLOCK A kolkata , City: KOLKATA , State: WEST BENGAL , ZIP: 700001 , Address1: HOUSE 687, BLOCK A kolkata , City1: KOLKATA , State1: WEST BENGAL , ZIP1: 700001 , Address2: , City2: , State2: , ZIP2: , Address3: , City3: , State3: , ZIP3: , Address4: , City4: , State4: , ZIP4: , Phone: 9454123990 , Phone1: , Phone2: , Phone3: , Phone4: , Email: snehareddy@gmail.com , Email1: , Email2: , Email3: , Email4: ', 'Name: Swati Gupta , First: Swati , Middle: , Last: Gupta , Gender: F , DOB: 2000-09-02 , Spouse: , Mother: , Father: , Company: RETAIL GIANT LTD , ParentCompany: , TaxID: , LicenseID: , PassportID: , Address: PLOT 91, NEAR TEMPLE lucknow , City: LUCKNOW , State: UTTAR PRADESH , ZIP: 226001 , Address1: PLOT 91, NEAR TEMPLE lucknow , City1: LUCKNOW , State1: UTTAR PRADESH , ZIP1: 226001 , Address2: , City2: , State2: , ZIP2: , Address3: , City3: , State3: , ZIP3: , Address4: , City4: , State4: , ZIP4: , Phone: 9447112584 , Phone1: , Phone2: , Phone3: , Phone4: , Email: swatigupta@yahoo.com , Email1: , Email2: , Email3: , Email4: '],
['Name: Pradeep Kumar , First: Pradeep , Middle: , Last: Kumar , Gender: M , DOB: 1984-05-19 , Spouse: , Mother: , Father: , Company: MICROSOFT INDIA , ParentCompany: , TaxID: , LicenseID: , PassportID: , Address: 169 KONDHWA NEAR SCHOOL , City: BANGALORE , State: KARNATAKA , ZIP: 560066 , Address1: 169 KONDHWA NEAR SCHOOL , City1: BANGALORE , State1: KARNATAKA , ZIP1: 560066 , Address2: , City2: , State2: , ZIP2: , Address3: , City3: , State3: , ZIP3: , Address4: , City4: , State4: , ZIP4: , Phone: 8574585188 , Phone1: , Phone2: , Phone3: , Phone4: , Email: pradeepkumar@workmail.com , Email1: , Email2: , Email3: , Email4: ', 'Name: Kavya Sharma , First: Kavya , Middle: , Last: Sharma , Gender: F , DOB: 2002-02-20 , Spouse: , Mother: , Father: , Company: PAYTM , ParentCompany: , TaxID: , LicenseID: , PassportID: , Address: 477 LAJPAT NAGAR NEAR TEMPLE , City: MUMBAI , State: MAHARASHTRA , ZIP: 400017 , Address1: 477 LAJPAT NAGAR NEAR TEMPLE , City1: MUMBAI , State1: MAHARASHTRA , ZIP1: 400017 , Address2: , City2: , State2: , ZIP2: , Address3: , City3: , State3: , ZIP3: , Address4: , City4: , State4: , ZIP4: , Phone: 7168571579 , Phone1: , Phone2: , Phone3: , Phone4: , Email: kavyasharma@workmail.com , Email1: , Email2: , Email3: , Email4: '],
['Name: Priya Prasad Reddy , First: Priya , Middle: Prasad , Last: Reddy , Gender: F , DOB: 1983-07-03 , Spouse: , Mother: , Father: , Company: GLOBAL INFOTECH , ParentCompany: , TaxID: , LicenseID: , PassportID: , Address: FLAT 401, LAKE APT, 24 MAIN RD , City: LUCKNOW , State: UTTAR PRADESH , ZIP: 226001 , Address1: FLAT 401, LAKE APT, 24 MAIN RD , City1: LUCKNOW , State1: UTTAR PRADESH , ZIP1: 226001 , Address2: , City2: , State2: , ZIP2: , Address3: , City3: , State3: , ZIP3: , Address4: , City4: , State4: , ZIP4: , Phone: 9149203558 , Phone1: , Phone2: , Phone3: , Phone4: , Email: priyareddy@gmail.com , Email1: priya.reddy@globalinfotech.com , Email2: , Email3: , Email4: ', 'Name: Priya Reddy , First: Priya , Middle: , Last: Reddy , Gender: F , DOB: 1983-07-03 , Spouse: , Mother: , Father: , Company: GLOBAL INFOTECH PVT LTD , ParentCompany: , TaxID: , LicenseID: , PassportID: , Address: FLAT 401, LAKE APARTMENT, 24 MAIN ROAD , City: LUCKNOW , State: UTTAR PRADESH , ZIP: 226001 , Address1: FLAT 401, LAKE APARTMENT, 24 MAIN ROAD , City1: LUCKNOW , State1: UTTAR PRADESH , ZIP1: 226001 , Address2: , City2: , State2: , ZIP2: , Address3: , City3: , State3: , ZIP3: , Address4: , City4: , State4: , ZIP4: , Phone: 9208449460 , Phone1: , Phone2: , Phone3: , Phone4: , Email: priyareddy@gmail.com , Email1: priya.reddy@globalinfotech.com , Email2: , Email3: , Email4: '],
['Name: Divya Patel , First: Divya , Middle: , Last: Patel , Gender: F , DOB: 1985-02-01 , Spouse: , Mother: , Father: , Company: INFOSYS TECHNOLOGIES , ParentCompany: , TaxID: , LicenseID: , PassportID: , Address: 316 SECTOR 12 NEAR PARK , City: HYDERABAD , State: TELANGANA , ZIP: 500048 , Address1: 316 SECTOR 12 NEAR PARK , City1: HYDERABAD , State1: TELANGANA , ZIP1: 500048 , Address2: , City2: , State2: , ZIP2: , Address3: , City3: , State3: , ZIP3: , Address4: , City4: , State4: , ZIP4: , Phone: 8883181644 , Phone1: , Phone2: , Phone3: , Phone4: , Email: divyapatel@outlook.com , Email1: , Email2: , Email3: , Email4: ', 'Name: Nitin Rao , First: Nitin , Middle: , Last: Rao , Gender: M , DOB: 1987-12-02 , Spouse: , Mother: , Father: , Company: RELIANCE JIO , ParentCompany: , TaxID: , LicenseID: , PassportID: , Address: 612 BANJARA HILLS NEAR PARK , City: COIMBATORE , State: TAMIL NADU , ZIP: 641039 , Address1: 612 BANJARA HILLS NEAR PARK , City1: COIMBATORE , State1: TAMIL NADU , ZIP1: 641039 , Address2: , City2: , State2: , ZIP2: , Address3: , City3: , State3: , ZIP3: , Address4: , City4: , State4: , ZIP4: , Phone: 9358550137 , Phone1: , Phone2: , Phone3: , Phone4: , Email: nitinrao@gmail.com , Email1: , Email2: , Email3: , Email4: '],
['Name: Rahul Iyer , First: Rahul , Middle: , Last: Iyer , Gender: M , DOB: 1980-02-25 , Spouse: , Mother: , Father: , Company: MANUFACTURING CO , ParentCompany: , TaxID: , LicenseID: , PassportID: , Address: FLAT 271, PARK APT, 32 MAIN RD , City: BANGALORE , State: KARNATAKA , ZIP: 560001 , Address1: FLAT 271, PARK APT, 32 MAIN RD , City1: BANGALORE , State1: KARNATAKA , ZIP1: 560001 , Address2: , City2: , State2: , ZIP2: , Address3: , City3: , State3: , ZIP3: , Address4: , City4: , State4: , ZIP4: , Phone: 9162959284 , Phone1: , Phone2: , Phone3: , Phone4: , Email: rahuliyer@gmail.com , Email1: rahul.iyer@manufacturingco.com , Email2: , Email3: , Email4: ', 'Name: Rahul Iyer , First: Rahul , Middle: , Last: Iyer , Gender: M , DOB: 1980-02-25 , Spouse: , Mother: , Father: , Company: MANUFACTURING CO PVT LTD , ParentCompany: , TaxID: , LicenseID: , PassportID: , Address: FLAT 271, PARK APT, 32 MAIN RD , City: BANGALORE , State: KARNATAKA , ZIP: 560001 , Address1: FLAT 271, PARK APT, 32 MAIN RD , City1: BANGALORE , State1: KARNATAKA , ZIP1: 560001 , Address2: , City2: , State2: , ZIP2: , Address3: , City3: , State3: , ZIP3: , Address4: , City4: , State4: , ZIP4: , Phone: 9162959284 , Phone1: , Phone2: , Phone3: , Phone4: , Email: rahuliyer@gmail.com , Email1: rahul.iyer@manufacturingco.com , Email2: , Email3: , Email4: '],
]
scores = model.predict(pairs)
print(scores.shape)
# (5,)
# Or rank different texts based on similarity to a single text
ranks = model.rank(
'Name: Sneha Reddy , First: Sneha , Middle: , Last: Reddy , Gender: F , DOB: 1991-07-11 , Spouse: , Mother: , Father: , Company: FINANCE CORP INDIA , ParentCompany: , TaxID: , LicenseID: , PassportID: , Address: HOUSE 687, BLOCK A kolkata , City: KOLKATA , State: WEST BENGAL , ZIP: 700001 , Address1: HOUSE 687, BLOCK A kolkata , City1: KOLKATA , State1: WEST BENGAL , ZIP1: 700001 , Address2: , City2: , State2: , ZIP2: , Address3: , City3: , State3: , ZIP3: , Address4: , City4: , State4: , ZIP4: , Phone: 9454123990 , Phone1: , Phone2: , Phone3: , Phone4: , Email: snehareddy@gmail.com , Email1: , Email2: , Email3: , Email4: ',
[
'Name: Swati Gupta , First: Swati , Middle: , Last: Gupta , Gender: F , DOB: 2000-09-02 , Spouse: , Mother: , Father: , Company: RETAIL GIANT LTD , ParentCompany: , TaxID: , LicenseID: , PassportID: , Address: PLOT 91, NEAR TEMPLE lucknow , City: LUCKNOW , State: UTTAR PRADESH , ZIP: 226001 , Address1: PLOT 91, NEAR TEMPLE lucknow , City1: LUCKNOW , State1: UTTAR PRADESH , ZIP1: 226001 , Address2: , City2: , State2: , ZIP2: , Address3: , City3: , State3: , ZIP3: , Address4: , City4: , State4: , ZIP4: , Phone: 9447112584 , Phone1: , Phone2: , Phone3: , Phone4: , Email: swatigupta@yahoo.com , Email1: , Email2: , Email3: , Email4: ',
'Name: Kavya Sharma , First: Kavya , Middle: , Last: Sharma , Gender: F , DOB: 2002-02-20 , Spouse: , Mother: , Father: , Company: PAYTM , ParentCompany: , TaxID: , LicenseID: , PassportID: , Address: 477 LAJPAT NAGAR NEAR TEMPLE , City: MUMBAI , State: MAHARASHTRA , ZIP: 400017 , Address1: 477 LAJPAT NAGAR NEAR TEMPLE , City1: MUMBAI , State1: MAHARASHTRA , ZIP1: 400017 , Address2: , City2: , State2: , ZIP2: , Address3: , City3: , State3: , ZIP3: , Address4: , City4: , State4: , ZIP4: , Phone: 7168571579 , Phone1: , Phone2: , Phone3: , Phone4: , Email: kavyasharma@workmail.com , Email1: , Email2: , Email3: , Email4: ',
'Name: Priya Reddy , First: Priya , Middle: , Last: Reddy , Gender: F , DOB: 1983-07-03 , Spouse: , Mother: , Father: , Company: GLOBAL INFOTECH PVT LTD , ParentCompany: , TaxID: , LicenseID: , PassportID: , Address: FLAT 401, LAKE APARTMENT, 24 MAIN ROAD , City: LUCKNOW , State: UTTAR PRADESH , ZIP: 226001 , Address1: FLAT 401, LAKE APARTMENT, 24 MAIN ROAD , City1: LUCKNOW , State1: UTTAR PRADESH , ZIP1: 226001 , Address2: , City2: , State2: , ZIP2: , Address3: , City3: , State3: , ZIP3: , Address4: , City4: , State4: , ZIP4: , Phone: 9208449460 , Phone1: , Phone2: , Phone3: , Phone4: , Email: priyareddy@gmail.com , Email1: priya.reddy@globalinfotech.com , Email2: , Email3: , Email4: ',
'Name: Nitin Rao , First: Nitin , Middle: , Last: Rao , Gender: M , DOB: 1987-12-02 , Spouse: , Mother: , Father: , Company: RELIANCE JIO , ParentCompany: , TaxID: , LicenseID: , PassportID: , Address: 612 BANJARA HILLS NEAR PARK , City: COIMBATORE , State: TAMIL NADU , ZIP: 641039 , Address1: 612 BANJARA HILLS NEAR PARK , City1: COIMBATORE , State1: TAMIL NADU , ZIP1: 641039 , Address2: , City2: , State2: , ZIP2: , Address3: , City3: , State3: , ZIP3: , Address4: , City4: , State4: , ZIP4: , Phone: 9358550137 , Phone1: , Phone2: , Phone3: , Phone4: , Email: nitinrao@gmail.com , Email1: , Email2: , Email3: , Email4: ',
'Name: Rahul Iyer , First: Rahul , Middle: , Last: Iyer , Gender: M , DOB: 1980-02-25 , Spouse: , Mother: , Father: , Company: MANUFACTURING CO PVT LTD , ParentCompany: , TaxID: , LicenseID: , PassportID: , Address: FLAT 271, PARK APT, 32 MAIN RD , City: BANGALORE , State: KARNATAKA , ZIP: 560001 , Address1: FLAT 271, PARK APT, 32 MAIN RD , City1: BANGALORE , State1: KARNATAKA , ZIP1: 560001 , Address2: , City2: , State2: , ZIP2: , Address3: , City3: , State3: , ZIP3: , Address4: , City4: , State4: , ZIP4: , Phone: 9162959284 , Phone1: , Phone2: , Phone3: , Phone4: , Email: rahuliyer@gmail.com , Email1: rahul.iyer@manufacturingco.com , Email2: , Email3: , Email4: ',
]
)
# [{'corpus_id': ..., 'score': ...}, {'corpus_id': ..., 'score': ...}, ...]
Evaluation
Metrics
Cross Encoder Classification
- Dataset:
entity-matching - Evaluated with
CrossEncoderClassificationEvaluator
| Metric | Value |
|---|---|
| accuracy | 1.0 |
| accuracy_threshold | 0.5193 |
| f1 | 1.0 |
| f1_threshold | 0.5193 |
| precision | 1.0 |
| recall | 1.0 |
| average_precision | 1.0 |
Training Details
Training Dataset
Unnamed Dataset
- Size: 147 training samples
- Columns:
sentence1,sentence2, andlabel - Approximate statistics based on the first 147 samples:
sentence1 sentence2 label type string string int details - min: 620 characters
- mean: 659.54 characters
- max: 715 characters
- min: 627 characters
- mean: 662.53 characters
- max: 731 characters
- 0: ~53.74%
- 1: ~46.26%
- Samples:
sentence1 sentence2 label Name: Riya Singh , First: Riya , Middle: , Last: Singh , Gender: F , DOB: 1992-01-11 , Spouse: , Mother: , Father: , Company: EDU SERVICES , ParentCompany: , TaxID: , LicenseID: , PassportID: , Address: HOUSE 821, GALI NO 5 chennai , City: CHENNAI , State: TAMIL NADU , ZIP: 600001 , Address1: HOUSE 821, GALI NO 5 chennai , City1: CHENNAI , State1: TAMIL NADU , ZIP1: 600001 , Address2: , City2: , State2: , ZIP2: , Address3: , City3: , State3: , ZIP3: , Address4: , City4: , State4: , ZIP4: , Phone: 9993381017 , Phone1: , Phone2: , Phone3: , Phone4: , Email: riyasingh@gmail.com , Email1: , Email2: , Email3: , Email4:Name: Sneha Rao , First: Sneha , Middle: , Last: Rao , Gender: F , DOB: 1981-04-14 , Spouse: , Mother: , Father: , Company: MANUFACTURING CO , ParentCompany: , TaxID: , LicenseID: , PassportID: , Address: PLOT 537, PHASE 2 ahmedabad , City: AHMEDABAD , State: GUJARAT , ZIP: 380001 , Address1: PLOT 537, PHASE 2 ahmedabad , City1: AHMEDABAD , State1: GUJARAT , ZIP1: 380001 , Address2: , City2: , State2: , ZIP2: , Address3: , City3: , State3: , ZIP3: , Address4: , City4: , State4: , ZIP4: , Phone: 9433154770 , Phone1: , Phone2: , Phone3: , Phone4: , Email: sneharao@yahoo.com , Email1: , Email2: , Email3: , Email4:0Name: Pooja Sharma , First: Pooja , Middle: , Last: Sharma , Gender: F , DOB: 1993-01-13 , Spouse: , Mother: , Father: , Company: MANUFACTURING CO , ParentCompany: , TaxID: , LicenseID: , PassportID: , Address: HOUSE 383, BLOCK A lucknow , City: LUCKNOW , State: UTTAR PRADESH , ZIP: 226001 , Address1: HOUSE 383, BLOCK A lucknow , City1: LUCKNOW , State1: UTTAR PRADESH , ZIP1: 226001 , Address2: , City2: , State2: , ZIP2: , Address3: , City3: , State3: , ZIP3: , Address4: , City4: , State4: , ZIP4: , Phone: 9187623171 , Phone1: , Phone2: , Phone3: , Phone4: , Email: poojasharma@gmail.com , Email1: , Email2: , Email3: , Email4:Name: Meena Sharma , First: Meena , Middle: , Last: Sharma , Gender: F , DOB: 1977-05-27 , Spouse: , Mother: , Father: , Company: AUTO PARTS PVT , ParentCompany: , TaxID: , LicenseID: , PassportID: , Address: PLOT 713, PHASE 2 ahmedabad , City: AHMEDABAD , State: GUJARAT , ZIP: 380001 , Address1: PLOT 713, PHASE 2 ahmedabad , City1: AHMEDABAD , State1: GUJARAT , ZIP1: 380001 , Address2: , City2: , State2: , ZIP2: , Address3: , City3: , State3: , ZIP3: , Address4: , City4: , State4: , ZIP4: , Phone: 9581682167 , Phone1: , Phone2: , Phone3: , Phone4: , Email: meenasharma@yahoo.com , Email1: , Email2: , Email3: , Email4:0Name: Swati Iyer , First: Swati , Middle: , Last: Iyer , Gender: F , DOB: 1996-06-14 , Spouse: , Mother: , Father: , Company: EDU SERVICES , ParentCompany: , TaxID: , LicenseID: , PassportID: , Address: HOUSE 339, SECTOR 12 lucknow , City: LUCKNOW , State: UTTAR PRADESH , ZIP: 226001 , Address1: HOUSE 339, SECTOR 12 lucknow , City1: LUCKNOW , State1: UTTAR PRADESH , ZIP1: 226001 , Address2: , City2: , State2: , ZIP2: , Address3: , City3: , State3: , ZIP3: , Address4: , City4: , State4: , ZIP4: , Phone: 9390894296 , Phone1: , Phone2: , Phone3: , Phone4: , Email: swatiiyer@gmail.com , Email1: , Email2: , Email3: , Email4:Name: Riya Verma , First: Riya , Middle: , Last: Verma , Gender: F , DOB: 1982-12-07 , Spouse: , Mother: , Father: , Company: ENERGY SOLUTIONS , ParentCompany: , TaxID: , LicenseID: , PassportID: , Address: PLOT 123, OLD TOWN kolkata , City: KOLKATA , State: WEST BENGAL , ZIP: 700001 , Address1: PLOT 123, OLD TOWN kolkata , City1: KOLKATA , State1: WEST BENGAL , ZIP1: 700001 , Address2: , City2: , State2: , ZIP2: , Address3: , City3: , State3: , ZIP3: , Address4: , City4: , State4: , ZIP4: , Phone: 9918394863 , Phone1: , Phone2: , Phone3: , Phone4: , Email: riyaverma@yahoo.com , Email1: , Email2: , Email3: , Email4:0 - Loss:
BinaryCrossEntropyLosswith these parameters:{ "activation_fn": "torch.nn.modules.linear.Identity", "pos_weight": null }
Evaluation Dataset
Unnamed Dataset
- Size: 32 evaluation samples
- Columns:
sentence1,sentence2, andlabel - Approximate statistics based on the first 32 samples:
sentence1 sentence2 label type string string int details - min: 622 characters
- mean: 662.03 characters
- max: 708 characters
- min: 629 characters
- mean: 666.31 characters
- max: 725 characters
- 0: ~53.12%
- 1: ~46.88%
- Samples:
sentence1 sentence2 label Name: Sneha Reddy , First: Sneha , Middle: , Last: Reddy , Gender: F , DOB: 1991-07-11 , Spouse: , Mother: , Father: , Company: FINANCE CORP INDIA , ParentCompany: , TaxID: , LicenseID: , PassportID: , Address: HOUSE 687, BLOCK A kolkata , City: KOLKATA , State: WEST BENGAL , ZIP: 700001 , Address1: HOUSE 687, BLOCK A kolkata , City1: KOLKATA , State1: WEST BENGAL , ZIP1: 700001 , Address2: , City2: , State2: , ZIP2: , Address3: , City3: , State3: , ZIP3: , Address4: , City4: , State4: , ZIP4: , Phone: 9454123990 , Phone1: , Phone2: , Phone3: , Phone4: , Email: snehareddy@gmail.com , Email1: , Email2: , Email3: , Email4:Name: Swati Gupta , First: Swati , Middle: , Last: Gupta , Gender: F , DOB: 2000-09-02 , Spouse: , Mother: , Father: , Company: RETAIL GIANT LTD , ParentCompany: , TaxID: , LicenseID: , PassportID: , Address: PLOT 91, NEAR TEMPLE lucknow , City: LUCKNOW , State: UTTAR PRADESH , ZIP: 226001 , Address1: PLOT 91, NEAR TEMPLE lucknow , City1: LUCKNOW , State1: UTTAR PRADESH , ZIP1: 226001 , Address2: , City2: , State2: , ZIP2: , Address3: , City3: , State3: , ZIP3: , Address4: , City4: , State4: , ZIP4: , Phone: 9447112584 , Phone1: , Phone2: , Phone3: , Phone4: , Email: swatigupta@yahoo.com , Email1: , Email2: , Email3: , Email4:0Name: Pradeep Kumar , First: Pradeep , Middle: , Last: Kumar , Gender: M , DOB: 1984-05-19 , Spouse: , Mother: , Father: , Company: MICROSOFT INDIA , ParentCompany: , TaxID: , LicenseID: , PassportID: , Address: 169 KONDHWA NEAR SCHOOL , City: BANGALORE , State: KARNATAKA , ZIP: 560066 , Address1: 169 KONDHWA NEAR SCHOOL , City1: BANGALORE , State1: KARNATAKA , ZIP1: 560066 , Address2: , City2: , State2: , ZIP2: , Address3: , City3: , State3: , ZIP3: , Address4: , City4: , State4: , ZIP4: , Phone: 8574585188 , Phone1: , Phone2: , Phone3: , Phone4: , Email: pradeepkumar@workmail.com , Email1: , Email2: , Email3: , Email4:Name: Kavya Sharma , First: Kavya , Middle: , Last: Sharma , Gender: F , DOB: 2002-02-20 , Spouse: , Mother: , Father: , Company: PAYTM , ParentCompany: , TaxID: , LicenseID: , PassportID: , Address: 477 LAJPAT NAGAR NEAR TEMPLE , City: MUMBAI , State: MAHARASHTRA , ZIP: 400017 , Address1: 477 LAJPAT NAGAR NEAR TEMPLE , City1: MUMBAI , State1: MAHARASHTRA , ZIP1: 400017 , Address2: , City2: , State2: , ZIP2: , Address3: , City3: , State3: , ZIP3: , Address4: , City4: , State4: , ZIP4: , Phone: 7168571579 , Phone1: , Phone2: , Phone3: , Phone4: , Email: kavyasharma@workmail.com , Email1: , Email2: , Email3: , Email4:0Name: Priya Prasad Reddy , First: Priya , Middle: Prasad , Last: Reddy , Gender: F , DOB: 1983-07-03 , Spouse: , Mother: , Father: , Company: GLOBAL INFOTECH , ParentCompany: , TaxID: , LicenseID: , PassportID: , Address: FLAT 401, LAKE APT, 24 MAIN RD , City: LUCKNOW , State: UTTAR PRADESH , ZIP: 226001 , Address1: FLAT 401, LAKE APT, 24 MAIN RD , City1: LUCKNOW , State1: UTTAR PRADESH , ZIP1: 226001 , Address2: , City2: , State2: , ZIP2: , Address3: , City3: , State3: , ZIP3: , Address4: , City4: , State4: , ZIP4: , Phone: 9149203558 , Phone1: , Phone2: , Phone3: , Phone4: , Email: priyareddy@gmail.com , Email1: priya.reddy@globalinfotech.com , Email2: , Email3: , Email4:Name: Priya Reddy , First: Priya , Middle: , Last: Reddy , Gender: F , DOB: 1983-07-03 , Spouse: , Mother: , Father: , Company: GLOBAL INFOTECH PVT LTD , ParentCompany: , TaxID: , LicenseID: , PassportID: , Address: FLAT 401, LAKE APARTMENT, 24 MAIN ROAD , City: LUCKNOW , State: UTTAR PRADESH , ZIP: 226001 , Address1: FLAT 401, LAKE APARTMENT, 24 MAIN ROAD , City1: LUCKNOW , State1: UTTAR PRADESH , ZIP1: 226001 , Address2: , City2: , State2: , ZIP2: , Address3: , City3: , State3: , ZIP3: , Address4: , City4: , State4: , ZIP4: , Phone: 9208449460 , Phone1: , Phone2: , Phone3: , Phone4: , Email: priyareddy@gmail.com , Email1: priya.reddy@globalinfotech.com , Email2: , Email3: , Email4:1 - Loss:
BinaryCrossEntropyLosswith these parameters:{ "activation_fn": "torch.nn.modules.linear.Identity", "pos_weight": null }
Training Hyperparameters
Non-Default Hyperparameters
eval_strategy: stepsper_device_train_batch_size: 64per_device_eval_batch_size: 64learning_rate: 2e-05weight_decay: 0.01num_train_epochs: 1warmup_ratio: 0.1use_cpu: Trueload_best_model_at_end: Truedataloader_pin_memory: False
All Hyperparameters
Click to expand
overwrite_output_dir: Falsedo_predict: Falseeval_strategy: stepsprediction_loss_only: Trueper_device_train_batch_size: 64per_device_eval_batch_size: 64per_gpu_train_batch_size: Noneper_gpu_eval_batch_size: Nonegradient_accumulation_steps: 1eval_accumulation_steps: Nonetorch_empty_cache_steps: Nonelearning_rate: 2e-05weight_decay: 0.01adam_beta1: 0.9adam_beta2: 0.999adam_epsilon: 1e-08max_grad_norm: 1.0num_train_epochs: 1max_steps: -1lr_scheduler_type: linearlr_scheduler_kwargs: Nonewarmup_ratio: 0.1warmup_steps: 0log_level: passivelog_level_replica: warninglog_on_each_node: Truelogging_nan_inf_filter: Truesave_safetensors: Truesave_on_each_node: Falsesave_only_model: Falserestore_callback_states_from_checkpoint: Falseno_cuda: Falseuse_cpu: Trueuse_mps_device: Falseseed: 42data_seed: Nonejit_mode_eval: Falsebf16: Falsefp16: Falsefp16_opt_level: O1half_precision_backend: autobf16_full_eval: Falsefp16_full_eval: Falsetf32: Nonelocal_rank: 0ddp_backend: Nonetpu_num_cores: Nonetpu_metrics_debug: Falsedebug: []dataloader_drop_last: Falsedataloader_num_workers: 0dataloader_prefetch_factor: Nonepast_index: -1disable_tqdm: Falseremove_unused_columns: Truelabel_names: Noneload_best_model_at_end: Trueignore_data_skip: Falsefsdp: []fsdp_min_num_params: 0fsdp_config: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}fsdp_transformer_layer_cls_to_wrap: Noneaccelerator_config: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}parallelism_config: Nonedeepspeed: Nonelabel_smoothing_factor: 0.0optim: adamw_torch_fusedoptim_args: Noneadafactor: Falsegroup_by_length: Falselength_column_name: lengthproject: huggingfacetrackio_space_id: trackioddp_find_unused_parameters: Noneddp_bucket_cap_mb: Noneddp_broadcast_buffers: Falsedataloader_pin_memory: Falsedataloader_persistent_workers: Falseskip_memory_metrics: Trueuse_legacy_prediction_loop: Falsepush_to_hub: Falseresume_from_checkpoint: Nonehub_model_id: Nonehub_strategy: every_savehub_private_repo: Nonehub_always_push: Falsehub_revision: Nonegradient_checkpointing: Falsegradient_checkpointing_kwargs: Noneinclude_inputs_for_metrics: Falseinclude_for_metrics: []eval_do_concat_batches: Truefp16_backend: autopush_to_hub_model_id: Nonepush_to_hub_organization: Nonemp_parameters:auto_find_batch_size: Falsefull_determinism: Falsetorchdynamo: Noneray_scope: lastddp_timeout: 1800torch_compile: Falsetorch_compile_backend: Nonetorch_compile_mode: Noneinclude_tokens_per_second: Falseinclude_num_input_tokens_seen: noneftune_noise_alpha: Noneoptim_target_modules: Nonebatch_eval_metrics: Falseeval_on_start: Falseuse_liger_kernel: Falseliger_kernel_config: Noneeval_use_gather_object: Falseaverage_tokens_across_devices: Trueprompts: Nonebatch_sampler: batch_samplermulti_dataset_batch_sampler: proportionalrouter_mapping: {}learning_rate_mapping: {}
Training Logs
| Epoch | Step | Validation Loss | entity-matching_average_precision |
|---|---|---|---|
| 0.3333 | 1 | 0.0162 | 1.0 |
| 0.6667 | 2 | 0.0148 | 1.0 |
| 1.0 | 3 | 0.0162 | 1.0 |
- The bold row denotes the saved checkpoint.
Framework Versions
- Python: 3.10.12
- Sentence Transformers: 5.3.0
- Transformers: 4.57.6
- PyTorch: 2.10.0+cu128
- Accelerate: 1.13.0
- Datasets: 4.8.4
- Tokenizers: 0.22.2
Citation
BibTeX
Sentence Transformers
@inproceedings{reimers-2019-sentence-bert,
title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
author = "Reimers, Nils and Gurevych, Iryna",
booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
month = "11",
year = "2019",
publisher = "Association for Computational Linguistics",
url = "https://arxiv.org/abs/1908.10084",
}