---
tags:
- sentence-transformers
- sentence-similarity
- feature-extraction
- dense
- generated_from_trainer
- dataset_size:3465
- loss:MultipleNegativesRankingLoss
base_model: BAAI/bge-base-en-v1.5
widget:
- source_sentence: CMU walls (WB epoxy topcoat) - scrubber dump walls
sentences:
- CMU walls WB epoxy topcoat scrubber dump walls
- digital mCP sErCo mOuntEd By gc eLectRicIAN
- MV 5witchgear 15KV MV Meta1 Clad Lead time 52 weeks
- source_sentence: 'Install hydronic heat sys: EA; incl. boiler, piping, & dist. comp.'
sentences:
- sitE SCreen wAll AccEnt painting wainSCotS/bANds
- 1nterior columns H5S - Safety Yellow To 12' AFF
- 'Hydronic heat sys install: EA; incl. boiler, piping, & dist. comp.'
- source_sentence: Paint trash enclosure
sentences:
- Trash Enclosures
- estImated rEImburSables BillED at cOst pluS 10 percent
- Smoke detectors or interlock
- source_sentence: mobIlizATiON
sentences:
- Provide and install wood deck construction, complete with necessary framing, decking
boards, and railings per CSI 06 10 53.
- CP @ 10,000 SF; incl. P&I
- mobiLIzation 1 moB
- source_sentence: Furnish and construct an earthen berm to ensure proper drainage
and support landscaping in accordance with Section 31 22 00.
sentences:
- HVAC filter replacement, EA
- SIte undErGround 10" Fire loOp Design bASed on 10" undergRound fiRe looP with
10" Lead-inS brought to 8" afF
- Excavate and place 100 cubic yards of earthen berm.
pipeline_tag: sentence-similarity
library_name: sentence-transformers
---
# SentenceTransformer based on BAAI/bge-base-en-v1.5
This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [BAAI/bge-base-en-v1.5](https://huggingface.co/BAAI/bge-base-en-v1.5). It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
## Model Details
### Model Description
- **Model Type:** Sentence Transformer
- **Base model:** [BAAI/bge-base-en-v1.5](https://huggingface.co/BAAI/bge-base-en-v1.5)
- **Maximum Sequence Length:** 512 tokens
- **Output Dimensionality:** 768 dimensions
- **Similarity Function:** Cosine Similarity
### Model Sources
- **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
- **Repository:** [Sentence Transformers on GitHub](https://github.com/huggingface/sentence-transformers)
- **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)
### Full Model Architecture
```
SentenceTransformer(
(0): Transformer({'max_seq_length': 512, 'do_lower_case': True, 'architecture': 'BertModel'})
(1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': True, 'pooling_mode_mean_tokens': False, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
(2): Normalize()
)
```
## Usage
### Direct Usage (Sentence Transformers)
First install the Sentence Transformers library:
```bash
pip install -U sentence-transformers
```
Then you can load this model and run inference.
```python
from sentence_transformers import SentenceTransformer
# Download from the 🤗 Hub
model = SentenceTransformer("sentence_transformers_model_id")
# Run inference
sentences = [
'Furnish and construct an earthen berm to ensure proper drainage and support landscaping in accordance with Section 31 22 00.',
'Excavate and place 100 cubic yards of earthen berm.',
'SIte undErGround 10" Fire loOp Design bASed on 10" undergRound fiRe looP with 10" Lead-inS brought to 8" afF',
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 768]
# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities)
# tensor([[1.0000, 0.6402, 0.0300],
# [0.6402, 1.0000, 0.1313],
# [0.0300, 0.1313, 1.0000]])
```
## Training Details
### Training Dataset
#### Unnamed Dataset
* Size: 3,465 training samples
* Columns: sentence_0, sentence_1, and label
* Approximate statistics based on the first 1000 samples:
| | sentence_0 | sentence_1 | label |
|:--------|:----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|:---------------------------------------------------------------|
| type | string | string | float |
| details |
Finish selected interior surfaces with a gloss paint application in accordance with specification standards. | Apply gloss finish paint, SF; interior surfaces only. | 0.0 |
| 9x10 Doors | Exterior Sectional Overhead Dock Door Amarr 9x10 2742 2" Thick (2) Rectangular Cut Insulated Windows 20K Cycle Springs Black Standard color white on exterior and white interior Motor operated | 1.0 |
| painT Hm doors And fraMEs | hm doORs/FRamEs | 0.0 |
* Loss: [MultipleNegativesRankingLoss](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
```json
{
"scale": 20.0,
"similarity_fct": "cos_sim",
"gather_across_devices": false
}
```
### Training Hyperparameters
#### Non-Default Hyperparameters
- `per_device_train_batch_size`: 16
- `per_device_eval_batch_size`: 16
- `max_grad_norm`: 0.5
- `num_train_epochs`: 5
- `multi_dataset_batch_sampler`: round_robin
#### All Hyperparameters