tasksource
/

ettin-32m-embed

@@ -7,7 +7,7 @@ tags:
 - feature-extraction
 - dense
 - generated_from_trainer
-- dataset_size:6331245
 - loss:AnglELoss
 - loss:CoSENTLoss
 - loss:CachedMultipleNegativesRankingLoss
@@ -37,39 +37,41 @@ widget:
     \ pediatrician, or paediatrician. The word pediatrics and its cognates mean healer\
     \ of children; they derive from two Greek words: Ï\x80Î±á¿\x96Ï\x82 (pais child)\
     \ and á¼°Î±Ï\x84Ï\x81Ï\x8CÏ\x82 (iatros doctor, healer)."
-- source_sentence: These ancient rites are rarely performed in contemporary Sri Lanka
-    , but the conserved songs are still performed by folk musicians .
   sentences:
-  - In 1971 , a main campus was completed in 33 MacDonnell Road for the new school
     .
-  - These ancient rites are still performed in contemporary Sri Lanka , but the preserved
-    songs are rarely performed by folk musicians .
-  - After May 4 , 2012 , Gordon M. Snow was replaced by Joseph M. Demarest and then
-    Michael S. Welch with limited formal announcement .
-- source_sentence: A woman is playing the flute.
   sentences:
-  - A boy is playing the trumpet.
-  - A man tries to read the paper.
-  - A man is playing the guitar.
-- source_sentence: Interference now on all our scans.
   sentences:
-  - Would you permit me to explain this Polly?
-  - All Ourscans are jammed.
-  - The aircraft family was first introduced at the Paris Air Show in 1999.
-- source_sentence: why has chs invested in da?
   sentences:
-  - In order to renew the strategic road map to CHS's growth, CHS partnered with DA
-    to improve outcomes rather than increasing its size. Most of DA's capacity was
-    used to provide tools in order to support CHS-affiliated hospitals in delivering
-    best-in-class healthcare to patients.
-  - You can in theory add every enchantment that is compatible with a tool/weapon/armor
-    onto the same item. The bow can have these 7 enchantments, though mending and
-    infinity are mutually exclusive. So you can have up to 6 different enchantments
-    on a bow using an anvil.
-  - 'Clean up is a phrasal verb which means: to make (a room or space) clean and orderly.
-    ... Clean out is a phrasal verb which means something such as a cupboard, room,
-    or container, you take everything out of it and clean the inside of it thoroughly.
-    Secondly, "clean"is a simple word which is often used in our daily life.'
 datasets:
 - google-research-datasets/paws
 - nyu-mll/glue
@@ -151,12 +153,12 @@ from sentence_transformers import SentenceTransformer
 model = SentenceTransformer("tasksource/ettin-32m-embed")
 # Run inference
 queries = [
-    "why has chs invested in da?",
 ]
 documents = [
-    "In order to renew the strategic road map to CHS's growth, CHS partnered with DA to improve outcomes rather than increasing its size. Most of DA's capacity was used to provide tools in order to support CHS-affiliated hospitals in delivering best-in-class healthcare to patients.",
-    'You can in theory add every enchantment that is compatible with a tool/weapon/armor onto the same item. The bow can have these 7 enchantments, though mending and infinity are mutually exclusive. So you can have up to 6 different enchantments on a bow using an anvil.',
-    'Clean up is a phrasal verb which means: to make (a room or space) clean and orderly. ... Clean out is a phrasal verb which means something such as a cupboard, room, or container, you take everything out of it and clean the inside of it thoroughly. Secondly, "clean"is a simple word which is often used in our daily life.',
 ]
 query_embeddings = model.encode_query(queries)
 document_embeddings = model.encode_document(documents)
@@ -166,7 +168,7 @@ print(query_embeddings.shape, document_embeddings.shape)
 # Get the similarity scores for the embeddings
 similarities = model.similarity(query_embeddings, document_embeddings)
 print(similarities)
-# tensor([[ 0.5738,  0.0240, -0.0787]])
 ```
 <!--
@@ -213,19 +215,19 @@ You can finetune this model on your own dataset.
 #### paws/labeled_final
 * Dataset: [paws/labeled_final](https://huggingface.co/datasets/paws) at [161ece9](https://huggingface.co/datasets/paws/tree/161ece9501cf0a11f3e48bd356eaa82de46d6a09)
-* Size: 49,401 training samples
 * Columns: <code>sentence1</code>, <code>sentence2</code>, and <code>label</code>
 * Approximate statistics based on the first 1000 samples:
-  |         | sentence1                                                                          | sentence2                                                                         | label                                           |
-  |:--------|:-----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|:------------------------------------------------|
-  | type    | string                                                                             | string                                                                            | int                                             |
-  | details | <ul><li>min: 10 tokens</li><li>mean: 27.44 tokens</li><li>max: 51 tokens</li></ul> | <ul><li>min: 8 tokens</li><li>mean: 27.44 tokens</li><li>max: 51 tokens</li></ul> | <ul><li>0: ~55.60%</li><li>1: ~44.40%</li></ul> |
 * Samples:
-  | sentence1                                                                                                                                                                   | sentence2                                                                                                                                                                           | label          |
-  |:----------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------|
-  | <code>In Paris , in October 1560 , he secretly met the English ambassador , Nicolas Throckmorton , asking him for a passport to return to England through Scotland .</code> | <code>In October 1560 , he secretly met with the English ambassador , Nicolas Throckmorton , in Paris , and asked him for a passport to return to Scotland through England .</code> | <code>0</code> |
-  | <code>The NBA season of 1975 -- 76 was the 30th season of the National Basketball Association .</code>                                                                      | <code>The 1975 -- 76 season of the National Basketball Association was the 30th season of the NBA .</code>                                                                          | <code>1</code> |
-  | <code>There are also specific discussions , public profile debates and project discussions .</code>                                                                         | <code>There are also public discussions , profile specific discussions , and project discussions .</code>                                                                           | <code>0</code> |
 * Loss: [<code>AnglELoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#angleloss) with these parameters:
   ```json
   {
@@ -239,19 +241,19 @@ You can finetune this model on your own dataset.
 #### glue/mrpc
 * Dataset: [glue/mrpc](https://huggingface.co/datasets/glue) at [bcdcba7](https://huggingface.co/datasets/glue/tree/bcdcba79d07bc864c1c254ccfcedcce55bcc9a8c)
-* Size: 3,668 training samples
 * Columns: <code>sentence1</code>, <code>sentence2</code>, and <code>label</code>
 * Approximate statistics based on the first 1000 samples:
   |         | sentence1                                                                          | sentence2                                                                          | label                                           |
   |:--------|:-----------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------|:------------------------------------------------|
   | type    | string                                                                             | string                                                                             | int                                             |
-  | details | <ul><li>min: 10 tokens</li><li>mean: 27.55 tokens</li><li>max: 49 tokens</li></ul> | <ul><li>min: 12 tokens</li><li>mean: 27.25 tokens</li><li>max: 48 tokens</li></ul> | <ul><li>0: ~33.70%</li><li>1: ~66.30%</li></ul> |
 * Samples:
-  | sentence1                                                                                                              | sentence2                                                                                                                        | label          |
-  |:-----------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------|:---------------|
-  | <code>Amrozi accused his brother , whom he called " the witness " , of deliberately distorting his evidence .</code>   | <code>Referring to him as only " the witness " , Amrozi accused his brother of deliberately distorting his evidence .</code>     | <code>1</code> |
-  | <code>Yucaipa owned Dominick 's before selling the chain to Safeway in 1998 for $ 2.5 billion .</code>                 | <code>Yucaipa bought Dominick 's in 1995 for $ 693 million and sold it to Safeway for $ 1.8 billion in 1998 .</code>             | <code>0</code> |
-  | <code>They had published an advertisement on the Internet on June 10 , offering the cargo for sale , he added .</code> | <code>On June 10 , the ship 's owners had published an advertisement on the Internet , offering the explosives for sale .</code> | <code>1</code> |
 * Loss: [<code>AnglELoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#angleloss) with these parameters:
   ```json
   {
@@ -265,19 +267,19 @@ You can finetune this model on your own dataset.
 #### fever-evidence-related
 * Dataset: [fever-evidence-related](https://huggingface.co/datasets/mwong/fever-evidence-related) at [14aba00](https://huggingface.co/datasets/mwong/fever-evidence-related/tree/14aba009b5fcd97b1a9ee6f3e3b0da0e308cf7cb)
-* Size: 403,218 training samples
 * Columns: <code>sentence1</code>, <code>sentence2</code>, and <code>label</code>
 * Approximate statistics based on the first 1000 samples:
   |         | sentence1                                                                         | sentence2                                                                             | label                                           |
   |:--------|:----------------------------------------------------------------------------------|:--------------------------------------------------------------------------------------|:------------------------------------------------|
   | type    | string                                                                            | string                                                                                | int                                             |
-  | details | <ul><li>min: 6 tokens</li><li>mean: 13.92 tokens</li><li>max: 48 tokens</li></ul> | <ul><li>min: 33 tokens</li><li>mean: 316.81 tokens</li><li>max: 1024 tokens</li></ul> | <ul><li>0: ~29.20%</li><li>1: ~70.80%</li></ul> |
 * Samples:
-  | sentence1                                                                    | sentence2                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                | label          |
-  |:-----------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------|
-  | <code>Nikolaj Coster-Waldau worked with the Fox Broadcasting Company.</code> | <code>Nikolaj Coster-Waldau -LRB- -LSB- neɡolaɪ kʰʌsdɐ ˈʋaldɑʊ -RSB- ; born 27 July 1970 -RRB- is a Danish actor , producer and screenwriter .. He graduated from Danish National School of Theatre in Copenhagen in 1993 .. Danish National School of Theatre. Danish National School of Theatre and Contemporary Dance. Copenhagen. Copenhagen. Coster-Waldau 's breakthrough performance in Denmark was his role in the film Nightwatch -LRB- 1994 -RRB- .. Nightwatch. Nightwatch ( 1994 film ). Since then he has appeared in numerous films in his native Scandinavia and Europe in general , including Headhunters -LRB- 2011 -RRB- and A Thousand Times Good Night -LRB- 2013 -RRB- .. Headhunters. Headhunters ( film ). A Thousand Times Good Night. A Thousand Times Good Night. In the United States , his debut film role was in the war film Black Hawk Down -LRB- 2001 -RRB- , playing Medal of Honor recipient Gary Gordon .. Black Hawk Down. Black Hawk Down ( film ). Gary Gordon. Gary Gordon. He then played Detective Jo...</code> | <code>0</code> |
-  | <code>Nikolaj Coster-Waldau worked with the Fox Broadcasting Company.</code> | <code>Majboor -LRB- Hindi  : मजबर , English  : Compulsed -RRB- is a 1974 Indian Hindi crime-thriller film directed by Ravi Tandon .. Ravi Tandon. Ravi Tandon. Hindi. Hindi. crime. crime film. thriller film. thriller film. Music is by Laxmikant Pyarelal and lyrics by Anand Bakshi .. Laxmikant Pyarelal. Laxmikant Pyarelal. Anand Bakshi. Anand Bakshi. The film was written by Salim-Javed .. Salim-Javed. Salim-Javed. The movie stars Amitabh Bachchan , Parveen Babi , Pran , Madan Puri , Rehman and Farida Jalal .. Amitabh Bachchan. Amitabh Bachchan. Parveen Babi. Parveen Babi. Pran. Pran ( actor ). Farida Jalal. Farida Jalal. Madan Puri. Madan Puri. Rehman. Rehman ( actor ). It is a remake of an American film titled Zig Zag -LRB- 1970 film -RRB- starring George Kennedy The film was later remade in Telugu by director K. Raghavendra Rao as Raja -LRB- 1976 -RRB- starring Shobhan Babu and Jayasudha .. George Kennedy. George Kennedy. Telugu. Telugu language. K. Raghavendra Rao. K. Raghavendra Rao. Raja....</code> | <code>1</code> |
-  | <code>Nikolaj Coster-Waldau worked with the Fox Broadcasting Company.</code> | <code>The small snakehead ' -LRB- Channa asiatica -RRB- is a species of snakehead .. Channa. Channa. snakehead. Channidae. It is one of four species of the genus Channa '' native to China .. Channa. Channa. China. China. It also can be found in Taiwan and southern Japan , to which it migrated -LRB- or was introduced -RRB- .. Taiwan. Taiwan. Japan. Japan. It is a medium-sized snakehead which is a nestbuilder -LRB- as opposed to the Indian mouthbrooder dwarf snakeheads -RRB- .. snakehead. Channidae. mouthbrooder. mouthbrooder</code>                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 | <code>1</code> |
 * Loss: [<code>AnglELoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#angleloss) with these parameters:
   ```json
   {
@@ -291,19 +293,19 @@ You can finetune this model on your own dataset.
 #### parade
 * Dataset: [parade](https://huggingface.co/datasets/tasksource/parade) at [466978f](https://huggingface.co/datasets/tasksource/parade/tree/466978f31aebf4d052287f32ea3ae393f178f386)
-* Size: 7,550 training samples
 * Columns: <code>sentence1</code>, <code>sentence2</code>, and <code>label</code>
 * Approximate statistics based on the first 1000 samples:
   |         | sentence1                                                                         | sentence2                                                                         | label                                           |
   |:--------|:----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|:------------------------------------------------|
   | type    | string                                                                            | string                                                                            | int                                             |
-  | details | <ul><li>min: 6 tokens</li><li>mean: 21.97 tokens</li><li>max: 61 tokens</li></ul> | <ul><li>min: 5 tokens</li><li>mean: 21.81 tokens</li><li>max: 49 tokens</li></ul> | <ul><li>0: ~57.10%</li><li>1: ~42.90%</li></ul> |
 * Samples:
-  | sentence1                                                                                                                                                                             | sentence2                                                                                                                                                  | label          |
-  |:--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------|
-  | <code>predictive models are involved with predicting a value based on other values in the dataset. the process of training a predictive model is known as supervised learning.</code> | <code>predict a value based on other values in the dataset. process of training a pred model is supervised learning.</code>                                | <code>1</code> |
-  | <code>predict a value based on other values in the dataset. process of training a pred model is supervised learning.</code>                                                           | <code>involved with predicting a value based on other values in the dataset; process of training this type of model is known as supervised learning</code> | <code>1</code> |
-  | <code>predicting one value (the target variable) using other values</code>                                                                                                            | <code>predictive models are involved with predicting a value based on other values in the dataset.</code>                                                  | <code>1</code> |
 * Loss: [<code>AnglELoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#angleloss) with these parameters:
   ```json
   {
@@ -317,19 +319,19 @@ You can finetune this model on your own dataset.
 #### apt
 * Dataset: [apt](https://huggingface.co/datasets/tasksource/apt) at [f6c07f6](https://huggingface.co/datasets/tasksource/apt/tree/f6c07f66d3eccebd36418885ce10aff295d436dd)
-* Size: 3,349 training samples
 * Columns: <code>sentence1</code>, <code>sentence2</code>, and <code>label</code>
 * Approximate statistics based on the first 1000 samples:
   |         | sentence1                                                                          | sentence2                                                                          | label                                           |
   |:--------|:-----------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------|:------------------------------------------------|
   | type    | string                                                                             | string                                                                             | int                                             |
-  | details | <ul><li>min: 4 tokens</li><li>mean: 17.28 tokens</li><li>max: 124 tokens</li></ul> | <ul><li>min: 3 tokens</li><li>mean: 16.99 tokens</li><li>max: 121 tokens</li></ul> | <ul><li>0: ~35.90%</li><li>1: ~64.10%</li></ul> |
 * Samples:
-  | sentence1                                                                            | sentence2                                                                              | label          |
-  |:-------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------|:---------------|
-  | <code>Come on.</code>                                                                | <code>Come on</code>                                                                   | <code>1</code> |
-  | <code>In Washington, the federal government remained closed for a second day.</code> | <code>The federal government in Washington was closed for a second day running.</code> | <code>1</code> |
-  | <code>The findings appear in next Friday's Physical Review Letters.</code>           | <code>Results published next Friday</code>                                             | <code>0</code> |
 * Loss: [<code>AnglELoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#angleloss) with these parameters:
   ```json
   {
@@ -343,19 +345,19 @@ You can finetune this model on your own dataset.
 #### glue/stsb
 * Dataset: [glue/stsb](https://huggingface.co/datasets/glue) at [bcdcba7](https://huggingface.co/datasets/glue/tree/bcdcba79d07bc864c1c254ccfcedcce55bcc9a8c)
-* Size: 5,749 training samples
 * Columns: <code>sentence1</code>, <code>sentence2</code>, and <code>label</code>
 * Approximate statistics based on the first 1000 samples:
   |         | sentence1                                                                         | sentence2                                                                         | label                                                          |
   |:--------|:----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|:---------------------------------------------------------------|
   | type    | string                                                                            | string                                                                            | float                                                          |
-  | details | <ul><li>min: 6 tokens</li><li>mean: 10.16 tokens</li><li>max: 28 tokens</li></ul> | <ul><li>min: 6 tokens</li><li>mean: 10.12 tokens</li><li>max: 25 tokens</li></ul> | <ul><li>min: 0.0</li><li>mean: 2.23</li><li>max: 5.0</li></ul> |
 * Samples:
-  | sentence1                                                  | sentence2                                                             | label                          |
-  |:-----------------------------------------------------------|:----------------------------------------------------------------------|:-------------------------------|
-  | <code>A plane is taking off.</code>                        | <code>An air plane is taking off.</code>                              | <code>5.0</code>               |
-  | <code>A man is playing a large flute.</code>               | <code>A man is playing a flute.</code>                                | <code>3.799999952316284</code> |
-  | <code>A man is spreading shreded cheese on a pizza.</code> | <code>A man is spreading shredded cheese on an uncooked pizza.</code> | <code>3.799999952316284</code> |
 * Loss: [<code>CoSENTLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#cosentloss) with these parameters:
   ```json
   {
@@ -369,19 +371,19 @@ You can finetune this model on your own dataset.
 #### sick/relatedness
 * Dataset: sick/relatedness
-* Size: 4,439 training samples
 * Columns: <code>sentence1</code>, <code>sentence2</code>, and <code>label</code>
 * Approximate statistics based on the first 1000 samples:
   |         | sentence1                                                                         | sentence2                                                                         | label                                                          |
   |:--------|:----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|:---------------------------------------------------------------|
   | type    | string                                                                            | string                                                                            | float                                                          |
-  | details | <ul><li>min: 6 tokens</li><li>mean: 12.66 tokens</li><li>max: 28 tokens</li></ul> | <ul><li>min: 5 tokens</li><li>mean: 12.46 tokens</li><li>max: 38 tokens</li></ul> | <ul><li>min: 1.0</li><li>mean: 3.41</li><li>max: 5.0</li></ul> |
 * Samples:
-  | sentence1                                                                                               | sentence2                                                                                      | label                          |
-  |:--------------------------------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------------------|:-------------------------------|
-  | <code>A group of kids is playing in a yard and an old man is standing in the background</code>          | <code>A group of boys in a yard is playing and a man is standing in the background</code>      | <code>4.5</code>               |
-  | <code>A group of children is playing in the house and there is no man standing in the background</code> | <code>A group of kids is playing in a yard and an old man is standing in the background</code> | <code>3.200000047683716</code> |
-  | <code>The young boys are playing outdoors and the man is smiling nearby</code>                          | <code>The kids are playing outdoors near a man with a smile</code>                             | <code>4.699999809265137</code> |
 * Loss: [<code>CoSENTLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#cosentloss) with these parameters:
   ```json
   {
@@ -395,19 +397,19 @@ You can finetune this model on your own dataset.
 #### sts-companion
 * Dataset: [sts-companion](https://huggingface.co/datasets/tasksource/sts-companion) at [fd8beff](https://huggingface.co/datasets/tasksource/sts-companion/tree/fd8beffb788df5f6673bc688e6dcbe3690a3acc6)
-* Size: 4,760 training samples
 * Columns: <code>label</code>, <code>sentence1</code>, and <code>sentence2</code>
 * Approximate statistics based on the first 1000 samples:
-  |         | label                                                          | sentence1                                                                         | sentence2                                                                         |
-  |:--------|:---------------------------------------------------------------|:----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|
-  | type    | float                                                          | string                                                                            | string                                                                            |
-  | details | <ul><li>min: 0.0</li><li>mean: 3.09</li><li>max: 5.0</li></ul> | <ul><li>min: 5 tokens</li><li>mean: 18.91 tokens</li><li>max: 91 tokens</li></ul> | <ul><li>min: 5 tokens</li><li>mean: 17.28 tokens</li><li>max: 83 tokens</li></ul> |
 * Samples:
-  | label            | sentence1                                                                                                                                                                                                                                                                                                                                                  | sentence2                                                                                                |
-  |:-----------------|:-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------|
-  | <code>1.6</code> | <code>this lus in this frame refer to biological entities labeled by the fe organism. an organism is described as something that can be alive, or have naturally occuring biological processes and functions, however the concept of life is often used metaphorically for non-organic entities which resemble or act as if they have organic life.</code> | <code>living things collectively;</code>                                                                 |
-  | <code>3.8</code> | <code>Washington's Economic Boom, Financed by You  Real life "Hunger Games"</code>                                                                                                                                                                                                                                                                         | <code>Washington?s Economic Boom, Financed by You</code>                                                 |
-  | <code>4.4</code> | <code>Knowledge of foreign languages is accepted as a necessary precursor to mobility.</code>                                                                                                                                                                                                                                                              | <code>It is accepted that knowledge of foreign languages is a necessary precondition to mobility.</code> |
 * Loss: [<code>CoSENTLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#cosentloss) with these parameters:
   ```json
   {
@@ -421,19 +423,19 @@ You can finetune this model on your own dataset.
 #### zero-shot-label-nli
 * Dataset: [zero-shot-label-nli](https://huggingface.co/datasets/tasksource/zero-shot-label-nli) at [ee693db](https://huggingface.co/datasets/tasksource/zero-shot-label-nli/tree/ee693dba923b5d5484aa9232b7357c5e45dd39b8)
-* Size: 800,000 training samples
 * Columns: <code>label</code>, <code>sentence1</code>, and <code>sentence2</code>
 * Approximate statistics based on the first 1000 samples:
   |         | label                                           | sentence1                                                                           | sentence2                                                                        |
   |:--------|:------------------------------------------------|:------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------|
   | type    | int                                             | string                                                                              | string                                                                           |
-  | details | <ul><li>0: ~51.20%</li><li>1: ~48.80%</li></ul> | <ul><li>min: 3 tokens</li><li>mean: 62.72 tokens</li><li>max: 1024 tokens</li></ul> | <ul><li>min: 7 tokens</li><li>mean: 8.01 tokens</li><li>max: 16 tokens</li></ul> |
 * Samples:
-  | label          | sentence1                                                                                                                                                                                                 | sentence2                                   |
-  |:---------------|:----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:--------------------------------------------|
-  | <code>0</code> | <code>How can you build website like Facebook?<br>How do you make a site like Facebook?</code>                                                                                                            | <code>This example is not_duplicate.</code> |
-  | <code>0</code> | <code>Warren Buffet was born on August 30 , 1932 .<br>Warren Edward Buffett -LRB- -LSB- ˈbʌfᵻt -RSB- born August 30 , 1930 -RRB- is an American business magnate , investor , and philanthropist .</code> | <code>This example is SUPPORTS.</code>      |
-  | <code>0</code> | <code>raise : Raise a siege. :<br>raise : The President raised several million dollars for his college. :</code>                                                                                          | <code>This example is True.</code>          |
 * Loss: [<code>AnglELoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#angleloss) with these parameters:
   ```json
   {
@@ -726,11 +728,11 @@ You can finetune this model on your own dataset.
 ### Training Hyperparameters
 #### Non-Default Hyperparameters
-- `per_device_train_batch_size`: 384
-- `learning_rate`: 0.0001
-- `weight_decay`: 1e-06
 - `num_train_epochs`: 1
-- `warmup_ratio`: 0.1
 - `fp16`: True
 - `gradient_checkpointing`: True
 - `torch_compile`: True
@@ -743,15 +745,15 @@ You can finetune this model on your own dataset.
 - `do_predict`: False
 - `eval_strategy`: no
 - `prediction_loss_only`: True
-- `per_device_train_batch_size`: 384
 - `per_device_eval_batch_size`: 8
 - `per_gpu_train_batch_size`: None
 - `per_gpu_eval_batch_size`: None
 - `gradient_accumulation_steps`: 1
 - `eval_accumulation_steps`: None
 - `torch_empty_cache_steps`: None
-- `learning_rate`: 0.0001
-- `weight_decay`: 1e-06
 - `adam_beta1`: 0.9
 - `adam_beta2`: 0.999
 - `adam_epsilon`: 1e-08
@@ -760,7 +762,7 @@ You can finetune this model on your own dataset.
 - `max_steps`: -1
 - `lr_scheduler_type`: linear
 - `lr_scheduler_kwargs`: {}
-- `warmup_ratio`: 0.1
 - `warmup_steps`: 0
 - `log_level`: passive
 - `log_level_replica`: warning
@@ -864,38 +866,45 @@ You can finetune this model on your own dataset.
 ### Training Logs
 | Epoch  | Step  | Training Loss |
 |:------:|:-----:|:-------------:|
-| 0.0303 | 500   | 4.8473        |
-| 0.0606 | 1000  | 2.6754        |
-| 0.0909 | 1500  | 2.6358        |
-| 0.1212 | 2000  | 2.619         |
-| 0.1515 | 2500  | 2.8342        |
-| 0.1818 | 3000  | 2.2872        |
-| 0.2121 | 3500  | 2.2727        |
-| 0.2424 | 4000  | 2.3469        |
-| 0.2727 | 4500  | 2.1085        |
-| 0.3030 | 5000  | 2.2076        |
-| 0.3334 | 5500  | 2.1161        |
-| 0.3637 | 6000  | 2.2332        |
-| 0.3940 | 6500  | 2.1574        |
-| 0.4243 | 7000  | 2.1012        |
-| 0.4546 | 7500  | 1.946         |
-| 0.4849 | 8000  | 1.7233        |
-| 0.5152 | 8500  | 2.4444        |
-| 0.5455 | 9000  | 2.1055        |
-| 0.5758 | 9500  | 1.9107        |
-| 0.6061 | 10000 | 2.0212        |
-| 0.6364 | 10500 | 2.1029        |
-| 0.6667 | 11000 | 1.8484        |
-| 0.6970 | 11500 | 2.1658        |
-| 0.7273 | 12000 | 2.1007        |
-| 0.7576 | 12500 | 1.9194        |
-| 0.7879 | 13000 | 1.6709        |
-| 0.8182 | 13500 | 1.7653        |
-| 0.8485 | 14000 | 1.952         |
-| 0.8788 | 14500 | 1.8437        |
-| 0.9091 | 15000 | 1.6667        |
-| 0.9395 | 15500 | 1.7433        |
-| 0.9698 | 16000 | 1.7623        |
 ### Framework Versions

 - feature-extraction
 - dense
 - generated_from_trainer
+- dataset_size:7176192
 - loss:AnglELoss
 - loss:CoSENTLoss
 - loss:CachedMultipleNegativesRankingLoss
     \ pediatrician, or paediatrician. The word pediatrics and its cognates mean healer\
     \ of children; they derive from two Greek words: Ï\x80Î±á¿\x96Ï\x82 (pais child)\
     \ and á¼°Î±Ï\x84Ï\x81Ï\x8CÏ\x82 (iatros doctor, healer)."
+- source_sentence: Creek Township borders Elsinboro Township , Pennsville Township
+    and Salem .
   sentences:
+  - Today , Galesburg-Augusta Community Schools consists of a primary school and a
+    high school in Galesburg and a middle school in Augusta .
+  - Elsinboro Township borders with the Lower Alloways Creek Township , Pennsville
+    Township and Salem .
+  - In 1953 , he married the actress Gilda Neeltje , sister of the actress Diane Holland
     .
+- source_sentence: A man is riding on one wheel on a motorcycle.
   sentences:
+  - A person is performing tricks on a motorcycle.
+  - A boy jumping in the air on the beach.
+  - A woman is pouring ingredients into a frying pan.
+- source_sentence: '''Why don''t you find out?'
   sentences:
+  - He is suggesting that the lack of effort focusing on the concept is making it
+    seem unrealistic.
+  - The military stated that the 244th Engineer Battalion has been handling the construction
+    of playgrounds, cleaning up the rubble and restoring irrigation services in Iraq.
+  - Why you haven't find out?.
+- source_sentence: what are the three subatomic particles called?
   sentences:
+  - Subatomic particles include electrons, the negatively charged, almost massless
+    particles that nevertheless account for most of the size of the atom, and they
+    include the heavier building blocks of the small but very dense nucleus of the
+    atom, the positively charged protons and the electrically neutral neutrons.
+  - Your body needs cholesterol to build healthy cells, but high levels of cholesterol
+    can increase your risk of heart disease. With high cholesterol, you can develop
+    fatty deposits in your blood vessels. Eventually, these deposits grow, making
+    it difficult for enough blood to flow through your arteries.
+  - 'If you experience any of the following symptoms, stop taking ibuprofen and call
+    your doctor: stomach pain, heartburn, vomit that is bloody or looks like coffee
+    grounds, blood in the stool, or black and tarry stools. Keep all appointments
+    with your doctor and the laboratory.'
 datasets:
 - google-research-datasets/paws
 - nyu-mll/glue
 model = SentenceTransformer("tasksource/ettin-32m-embed")
 # Run inference
 queries = [
+    "what are the three subatomic particles called?",
 ]
 documents = [
+    'Subatomic particles include electrons, the negatively charged, almost massless particles that nevertheless account for most of the size of the atom, and they include the heavier building blocks of the small but very dense nucleus of the atom, the positively charged protons and the electrically neutral neutrons.',
+    'Your body needs cholesterol to build healthy cells, but high levels of cholesterol can increase your risk of heart disease. With high cholesterol, you can develop fatty deposits in your blood vessels. Eventually, these deposits grow, making it difficult for enough blood to flow through your arteries.',
+    'If you experience any of the following symptoms, stop taking ibuprofen and call your doctor: stomach pain, heartburn, vomit that is bloody or looks like coffee grounds, blood in the stool, or black and tarry stools. Keep all appointments with your doctor and the laboratory.',
 ]
 query_embeddings = model.encode_query(queries)
 document_embeddings = model.encode_document(documents)
 # Get the similarity scores for the embeddings
 similarities = model.similarity(query_embeddings, document_embeddings)
 print(similarities)
+# tensor([[ 0.6600, -0.0148,  0.0229]])
 ```
 <!--
 #### paws/labeled_final
 * Dataset: [paws/labeled_final](https://huggingface.co/datasets/paws) at [161ece9](https://huggingface.co/datasets/paws/tree/161ece9501cf0a11f3e48bd356eaa82de46d6a09)
+* Size: 148,203 training samples
 * Columns: <code>sentence1</code>, <code>sentence2</code>, and <code>label</code>
 * Approximate statistics based on the first 1000 samples:
+  |         | sentence1                                                                          | sentence2                                                                          | label                                           |
+  |:--------|:-----------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------|:------------------------------------------------|
+  | type    | string                                                                             | string                                                                             | int                                             |
+  | details | <ul><li>min: 11 tokens</li><li>mean: 27.65 tokens</li><li>max: 57 tokens</li></ul> | <ul><li>min: 11 tokens</li><li>mean: 27.73 tokens</li><li>max: 57 tokens</li></ul> | <ul><li>0: ~57.50%</li><li>1: ~42.50%</li></ul> |
 * Samples:
+  | sentence1                                                                                                                                                               | sentence2                                                                                                                                                                  | label          |
+  |:------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------|
+  | <code>Ceremonial music ( `` rokon fada '' ) is listed as a status symbol , and musicians are generally chosen for political reasons as opposed to musical ones .</code> | <code>Ceremonial music ( `` rokon fada '' ) is performed as a status symbol , and musicians are generally chosen for musical reasons as opposed to political ones .</code> | <code>0</code> |
+  | <code>In 1989 he travelled to South Africa , Johannesburg and Angola , Mozambique on a peace-seeking mission .</code>                                                   | <code>In 1989 , he traveled to Mozambique , Johannesburg , and Angola , South Africa on a peace-seeking mission .</code>                                                   | <code>1</code> |
+  | <code>In this way , the Nestorian faith was established in the East under tragic signs .</code>                                                                         | <code>In this way , under Nestorian auspices , the tragic faith was established in the East .</code>                                                                       | <code>0</code> |
 * Loss: [<code>AnglELoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#angleloss) with these parameters:
   ```json
   {
 #### glue/mrpc
 * Dataset: [glue/mrpc](https://huggingface.co/datasets/glue) at [bcdcba7](https://huggingface.co/datasets/glue/tree/bcdcba79d07bc864c1c254ccfcedcce55bcc9a8c)
+* Size: 11,004 training samples
 * Columns: <code>sentence1</code>, <code>sentence2</code>, and <code>label</code>
 * Approximate statistics based on the first 1000 samples:
   |         | sentence1                                                                          | sentence2                                                                          | label                                           |
   |:--------|:-----------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------|:------------------------------------------------|
   | type    | string                                                                             | string                                                                             | int                                             |
+  | details | <ul><li>min: 11 tokens</li><li>mean: 27.23 tokens</li><li>max: 52 tokens</li></ul> | <ul><li>min: 11 tokens</li><li>mean: 27.29 tokens</li><li>max: 53 tokens</li></ul> | <ul><li>0: ~33.10%</li><li>1: ~66.90%</li></ul> |
 * Samples:
+  | sentence1                                                                                                                                                         | sentence2                                                                                                                                                                                        | label          |
+  |:------------------------------------------------------------------------------------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------|
+  | <code>Tony Blair has taken a hardline stance arguing nothing should be done to lessen the pressure on Mugabe at the gathering in the capital Abuja .</code>       | <code>The Prime Minister has taken a hardline stance arguing nothing should be done to lessen the pressure on Mugabe .</code>                                                                    | <code>0</code> |
+  | <code>The identical rovers will act as robotic geologists , searching for evidence of past water .</code>                                                         | <code>The rovers act as robotic geologists , moving on six wheels .</code>                                                                                                                       | <code>0</code> |
+  | <code>" We make no apologies for finding every legal way possible to protect the American public from further terrorist attack , " Barbara Comstock said .</code> | <code>" We make no apologies for finding every legal way possible to protect the American public from further terrorist attacks , " said Barbara Comstock , Ashcroft 's press secretary .</code> | <code>1</code> |
 * Loss: [<code>AnglELoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#angleloss) with these parameters:
   ```json
   {
 #### fever-evidence-related
 * Dataset: [fever-evidence-related](https://huggingface.co/datasets/mwong/fever-evidence-related) at [14aba00](https://huggingface.co/datasets/mwong/fever-evidence-related/tree/14aba009b5fcd97b1a9ee6f3e3b0da0e308cf7cb)
+* Size: 800,000 training samples
 * Columns: <code>sentence1</code>, <code>sentence2</code>, and <code>label</code>
 * Approximate statistics based on the first 1000 samples:
   |         | sentence1                                                                         | sentence2                                                                             | label                                           |
   |:--------|:----------------------------------------------------------------------------------|:--------------------------------------------------------------------------------------|:------------------------------------------------|
   | type    | string                                                                            | string                                                                                | int                                             |
+  | details | <ul><li>min: 7 tokens</li><li>mean: 13.65 tokens</li><li>max: 28 tokens</li></ul> | <ul><li>min: 28 tokens</li><li>mean: 318.06 tokens</li><li>max: 1024 tokens</li></ul> | <ul><li>0: ~30.20%</li><li>1: ~69.80%</li></ul> |
 * Samples:
+  | sentence1                                                  | sentence2                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                | label          |
+  |:-----------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------|
+  | <code>Batman: The Killing Joke features characters.</code> | <code>notice. Cantonese Pinyin -LRB- , also known as 教院式拼音方案 -RRB- is a romanization system for Cantonese developed by Rev. Yu Ping Chiu in 1971 , and subsequently modified by the Education Department -LRB- merged into the Education and Manpower Bureau since 2003 -RRB- of Hong Kong and Prof. Zhan Bohui of the Chinese Dialects Research Centre of the Jinan University , Guangdong , PRC , and honorary professor of the School of Chinese , University of Hong Kong .. romanization. romanization. Cantonese. Cantonese. Education and Manpower Bureau. Education and Manpower Bureau. Zhan Bohui. Zhan Bohui. It is the only romanization system accepted by Education and Manpower Bureau of Hong Kong and Hong Kong Examinations and Assessment Authority .. romanization. romanization. Education and Manpower Bureau. Education and Manpower Bureau. Hong Kong Examinations and Assessment Authority. Hong Kong Examinations and Assessment Authority. The formal and short forms of the system 's Chinese names mean respectiv...</code> | <code>1</code> |
+  | <code>Jon Snow is played by a person.</code>               | <code>Cao'an is a temple in Jinjiang , Fujian .. Originally constructed by Chinese Manicheans , it was viewed by later worshipers as a Buddhist temple .. Manicheans. Manichaeism. This `` Manichean temple in Buddhist disguise ''. is seen by modern experts on Manichaeism as `` the only extant Manichean temple in China '' , or `` the only Manichean building which has survived intact '' .</code>                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               | <code>1</code> |
+  | <code>Scotland includes islands.</code>                    | <code>Scotland -LRB- -LSB- ˈskɒt.lənd -RSB- Scots  : -LSB- - scoˈskɔt.lənd -RSB- Alba -LSB- ˈalˠapə -RSB- -RRB- is a country that is part of the United Kingdom and covers the northern third of the island of Great Britain .. Scots. Scots language. Scotland. Scots Law. Alba. Alba. country. country. part. Countries of the United Kingdom. United Kingdom. United Kingdom. Great Britain. Great Britain. It shares a border with England to the south , and is otherwise surrounded by the Atlantic Ocean , with the North Sea to the east and the North Channel and Irish Sea to the south-west .. England. England. Atlantic Ocean. Atlantic Ocean. North Sea. North Sea. North Channel. North Channel ( British Isles ). Irish Sea. Irish Sea. In addition to the mainland , the country is made up of more than 790 islands , including the Northern Isles and the Hebrides .. country. country. Northern Isles. Northern Isles. Hebrides. Hebrides. The Kingdom of Scotland emerged as an independent sovereign state in the Early ...</code> | <code>0</code> |
 * Loss: [<code>AnglELoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#angleloss) with these parameters:
   ```json
   {
 #### parade
 * Dataset: [parade](https://huggingface.co/datasets/tasksource/parade) at [466978f](https://huggingface.co/datasets/tasksource/parade/tree/466978f31aebf4d052287f32ea3ae393f178f386)
+* Size: 22,650 training samples
 * Columns: <code>sentence1</code>, <code>sentence2</code>, and <code>label</code>
 * Approximate statistics based on the first 1000 samples:
   |         | sentence1                                                                         | sentence2                                                                         | label                                           |
   |:--------|:----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|:------------------------------------------------|
   | type    | string                                                                            | string                                                                            | int                                             |
+  | details | <ul><li>min: 6 tokens</li><li>mean: 22.21 tokens</li><li>max: 61 tokens</li></ul> | <ul><li>min: 6 tokens</li><li>mean: 21.48 tokens</li><li>max: 49 tokens</li></ul> | <ul><li>0: ~54.80%</li><li>1: ~45.20%</li></ul> |
 * Samples:
+  | sentence1                                                                                                                                    | sentence2                                                                                                                                                                         | label          |
+  |:---------------------------------------------------------------------------------------------------------------------------------------------|:----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------|
+  | <code>access to device itself  application specific data (network services, dns, html, http, etc)</code>                                     | <code>(upper layer data)facilitates communication between such programs and lower-layer network services. high-level apis, including resource sharing, remote file access.</code> | <code>0</code> |
+  | <code>an important element of information management, but it is just one part of a larger whole</code>                                       | <code>converting facts and figures into useful information</code>                                                                                                                 | <code>0</code> |
+  | <code>web site that has a field for you to type in a search query, as it will search the internet for you using your search criteria.</code> | <code>web-based search tool that locates a web page using a keyword</code>                                                                                                        | <code>1</code> |
 * Loss: [<code>AnglELoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#angleloss) with these parameters:
   ```json
   {
 #### apt
 * Dataset: [apt](https://huggingface.co/datasets/tasksource/apt) at [f6c07f6](https://huggingface.co/datasets/tasksource/apt/tree/f6c07f66d3eccebd36418885ce10aff295d436dd)
+* Size: 10,047 training samples
 * Columns: <code>sentence1</code>, <code>sentence2</code>, and <code>label</code>
 * Approximate statistics based on the first 1000 samples:
   |         | sentence1                                                                          | sentence2                                                                          | label                                           |
   |:--------|:-----------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------|:------------------------------------------------|
   | type    | string                                                                             | string                                                                             | int                                             |
+  | details | <ul><li>min: 4 tokens</li><li>mean: 17.32 tokens</li><li>max: 213 tokens</li></ul> | <ul><li>min: 3 tokens</li><li>mean: 16.46 tokens</li><li>max: 121 tokens</li></ul> | <ul><li>0: ~35.80%</li><li>1: ~64.20%</li></ul> |
 * Samples:
+  | sentence1                                                         | sentence2                                                                | label          |
+  |:------------------------------------------------------------------|:-------------------------------------------------------------------------|:---------------|
+  | <code>Watch out.</code>                                           | <code>U.S. Bank</code>                                                   | <code>0</code> |
+  | <code>Oh! we spent all night, used all the fancy machines.</code> | <code>We spent all night using the luxurious equipment.</code>           | <code>1</code> |
+  | <code>I'm willing to give you all this information...</code>      | <code>This information, all of it, I'm inclined to provide you...</code> | <code>1</code> |
 * Loss: [<code>AnglELoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#angleloss) with these parameters:
   ```json
   {
 #### glue/stsb
 * Dataset: [glue/stsb](https://huggingface.co/datasets/glue) at [bcdcba7](https://huggingface.co/datasets/glue/tree/bcdcba79d07bc864c1c254ccfcedcce55bcc9a8c)
+* Size: 17,247 training samples
 * Columns: <code>sentence1</code>, <code>sentence2</code>, and <code>label</code>
 * Approximate statistics based on the first 1000 samples:
   |         | sentence1                                                                         | sentence2                                                                         | label                                                          |
   |:--------|:----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|:---------------------------------------------------------------|
   | type    | string                                                                            | string                                                                            | float                                                          |
+  | details | <ul><li>min: 6 tokens</li><li>mean: 14.68 tokens</li><li>max: 57 tokens</li></ul> | <ul><li>min: 6 tokens</li><li>mean: 14.84 tokens</li><li>max: 68 tokens</li></ul> | <ul><li>min: 0.0</li><li>mean: 2.64</li><li>max: 5.0</li></ul> |
 * Samples:
+  | sentence1                                                                                     | sentence2                                                                                                      | label                           |
+  |:----------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------|:--------------------------------|
+  | <code>Mandela's condition has 'improved'</code>                                               | <code>Mandela's condition has 'worsened over past 48 hours'</code>                                             | <code>1.0</code>                |
+  | <code>the cfe is very important for european security.</code>                                 | <code>the cfe is a cornerstone of european security.</code>                                                    | <code>5.0</code>                |
+  | <code>The Nasdaq fell about 1.3% for the month, snapping a seven-month winning streak.</code> | <code>The Nasdaq is down roughly 0.4 percent for the month, on track to snap a 7-month streak of gains.</code> | <code>2.4000000953674316</code> |
 * Loss: [<code>CoSENTLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#cosentloss) with these parameters:
   ```json
   {
 #### sick/relatedness
 * Dataset: sick/relatedness
+* Size: 13,317 training samples
 * Columns: <code>sentence1</code>, <code>sentence2</code>, and <code>label</code>
 * Approximate statistics based on the first 1000 samples:
   |         | sentence1                                                                         | sentence2                                                                         | label                                                          |
   |:--------|:----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|:---------------------------------------------------------------|
   | type    | string                                                                            | string                                                                            | float                                                          |
+  | details | <ul><li>min: 6 tokens</li><li>mean: 12.25 tokens</li><li>max: 28 tokens</li></ul> | <ul><li>min: 5 tokens</li><li>mean: 12.11 tokens</li><li>max: 38 tokens</li></ul> | <ul><li>min: 1.0</li><li>mean: 3.51</li><li>max: 5.0</li></ul> |
 * Samples:
+  | sentence1                                             | sentence2                                                                           | label                           |
+  |:------------------------------------------------------|:------------------------------------------------------------------------------------|:--------------------------------|
+  | <code>A cold cyclist is celebrating</code>            | <code>A bike is being held over his head by a bicyclist in a group of people</code> | <code>2.299999952316284</code>  |
+  | <code>Nobody is cutting a capsicum into pieces</code> | <code>The person is slicing a clove of garlic into pieces</code>                    | <code>3.0999999046325684</code> |
+  | <code>A woman is not cutting shrimps</code>           | <code>A man is chopping butter into a container</code>                              | <code>1.7999999523162842</code> |
 * Loss: [<code>CoSENTLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#cosentloss) with these parameters:
   ```json
   {
 #### sts-companion
 * Dataset: [sts-companion](https://huggingface.co/datasets/tasksource/sts-companion) at [fd8beff](https://huggingface.co/datasets/tasksource/sts-companion/tree/fd8beffb788df5f6673bc688e6dcbe3690a3acc6)
+* Size: 14,280 training samples
 * Columns: <code>label</code>, <code>sentence1</code>, and <code>sentence2</code>
 * Approximate statistics based on the first 1000 samples:
+  |         | label                                                          | sentence1                                                                         | sentence2                                                                          |
+  |:--------|:---------------------------------------------------------------|:----------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------|
+  | type    | float                                                          | string                                                                            | string                                                                             |
+  | details | <ul><li>min: 0.0</li><li>mean: 3.13</li><li>max: 5.0</li></ul> | <ul><li>min: 4 tokens</li><li>mean: 18.95 tokens</li><li>max: 91 tokens</li></ul> | <ul><li>min: 4 tokens</li><li>mean: 17.55 tokens</li><li>max: 269 tokens</li></ul> |
 * Samples:
+  | label            | sentence1                                                                                                              | sentence2                                                                                                          |
+  |:-----------------|:-----------------------------------------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------------------------------------|
+  | <code>4.2</code> | <code>I am calling BS!!! NYTimes: Morsi Says His Slurs of Jews Were Taken Out of Context</code>                        | <code>Morsi Says Slurs of Jews Were Taken Out of Context</code>                                                    |
+  | <code>3.0</code> | <code>The driver of the coach tried to avoid it by swerving hard, but still grazed the right side of the lorry.</code> | <code>The driver of the last to try to avoid it through a sudden move, but he fell short by his right side.</code> |
+  | <code>5.0</code> | <code>create a mess or disorder</code>                                                                                 | <code>make a mess of or create disorder in.</code>                                                                 |
 * Loss: [<code>CoSENTLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#cosentloss) with these parameters:
   ```json
   {
 #### zero-shot-label-nli
 * Dataset: [zero-shot-label-nli](https://huggingface.co/datasets/tasksource/zero-shot-label-nli) at [ee693db](https://huggingface.co/datasets/tasksource/zero-shot-label-nli/tree/ee693dba923b5d5484aa9232b7357c5e45dd39b8)
+* Size: 1,090,333 training samples
 * Columns: <code>label</code>, <code>sentence1</code>, and <code>sentence2</code>
 * Approximate statistics based on the first 1000 samples:
   |         | label                                           | sentence1                                                                           | sentence2                                                                        |
   |:--------|:------------------------------------------------|:------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------|
   | type    | int                                             | string                                                                              | string                                                                           |
+  | details | <ul><li>0: ~50.70%</li><li>1: ~49.30%</li></ul> | <ul><li>min: 3 tokens</li><li>mean: 68.51 tokens</li><li>max: 1024 tokens</li></ul> | <ul><li>min: 7 tokens</li><li>mean: 7.95 tokens</li><li>max: 17 tokens</li></ul> |
 * Samples:
+  | label          | sentence1                                                                                                                                                                                                                               | sentence2                                    |
+  |:---------------|:----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------|
+  | <code>0</code> | <code>Amrozi accused his brother , whom he called " the witness " , of deliberately distorting his evidence .<br>Referring to him as only " the witness " , Amrozi accused his brother of deliberately distorting his evidence .</code> | <code>This example is not_equivalent.</code> |
+  | <code>1</code> | <code>Do science and religion conflict with each other?<br>Does science conflict with the Bible?</code>                                                                                                                                 | <code>This example is not_duplicate.</code>  |
+  | <code>0</code> | <code>do iran and afghanistan speak the same language</code>                                                                                                                                                                            | <code>This example is False.</code>          |
 * Loss: [<code>AnglELoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#angleloss) with these parameters:
   ```json
   {
 ### Training Hyperparameters
 #### Non-Default Hyperparameters
+- `per_device_train_batch_size`: 360
+- `learning_rate`: 8e-05
+- `weight_decay`: 5e-05
 - `num_train_epochs`: 1
+- `warmup_ratio`: 0.03
 - `fp16`: True
 - `gradient_checkpointing`: True
 - `torch_compile`: True
 - `do_predict`: False
 - `eval_strategy`: no
 - `prediction_loss_only`: True
+- `per_device_train_batch_size`: 360
 - `per_device_eval_batch_size`: 8
 - `per_gpu_train_batch_size`: None
 - `per_gpu_eval_batch_size`: None
 - `gradient_accumulation_steps`: 1
 - `eval_accumulation_steps`: None
 - `torch_empty_cache_steps`: None
+- `learning_rate`: 8e-05
+- `weight_decay`: 5e-05
 - `adam_beta1`: 0.9
 - `adam_beta2`: 0.999
 - `adam_epsilon`: 1e-08
 - `max_steps`: -1
 - `lr_scheduler_type`: linear
 - `lr_scheduler_kwargs`: {}
+- `warmup_ratio`: 0.03
 - `warmup_steps`: 0
 - `log_level`: passive
 - `log_level_replica`: warning
 ### Training Logs
 | Epoch  | Step  | Training Loss |
 |:------:|:-----:|:-------------:|
+| 0.0251 | 500   | 5.0537        |
+| 0.0501 | 1000  | 3.6206        |
+| 0.0752 | 1500  | 3.249         |
+| 0.1003 | 2000  | 3.5885        |
+| 0.1254 | 2500  | 3.2479        |
+| 0.1504 | 3000  | 3.2033        |
+| 0.1755 | 3500  | 2.7123        |
+| 0.2006 | 4000  | 2.8247        |
+| 0.2257 | 4500  | 2.7694        |
+| 0.2507 | 5000  | 3.0215        |
+| 0.2758 | 5500  | 2.6723        |
+| 0.3009 | 6000  | 2.8297        |
+| 0.3259 | 6500  | 2.4046        |
+| 0.3510 | 7000  | 2.2289        |
+| 0.3761 | 7500  | 2.4628        |
+| 0.4012 | 8000  | 2.4032        |
+| 0.4262 | 8500  | 2.5024        |
+| 0.4513 | 9000  | 2.0948        |
+| 0.4764 | 9500  | 2.4389        |
+| 0.5015 | 10000 | 2.4771        |
+| 0.5265 | 10500 | 2.6465        |
+| 0.5516 | 11000 | 2.5892        |
+| 0.5767 | 11500 | 2.3557        |
+| 0.6017 | 12000 | 2.2359        |
+| 0.6268 | 12500 | 2.5839        |
+| 0.6519 | 13000 | 2.4216        |
+| 0.6770 | 13500 | 2.3211        |
+| 0.7020 | 14000 | 2.1171        |
+| 0.7271 | 14500 | 2.1206        |
+| 0.7522 | 15000 | 2.2557        |
+| 0.7773 | 15500 | 2.2815        |
+| 0.8023 | 16000 | 2.0951        |
+| 0.8274 | 16500 | 2.3415        |
+| 0.8525 | 17000 | 2.2792        |
+| 0.8775 | 17500 | 2.3113        |
+| 0.9026 | 18000 | 2.1932        |
+| 0.9277 | 18500 | 2.1134        |
+| 0.9528 | 19000 | 1.9995        |
+| 0.9778 | 19500 | 1.8916        |
 ### Framework Versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7658290cf36da3d18ee7ebfc328f9c40bd49d23c22c9bf0cd9cb101c1c526c40
 size 127538496

 version https://git-lfs.github.com/spec/v1
+oid sha256:3d9729ed5a375cb33fdfe9941bf4032235f8e37c6b27fa88b752ff736b85616b
 size 127538496