suhwan3 commited on
Commit
9208a43
·
verified ·
1 Parent(s): e682361

Upload fine-tuned model

Browse files
.gitattributes CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ tokenizer.json filter=lfs diff=lfs merge=lfs -text
1_Pooling/config.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "word_embedding_dimension": 1024,
3
+ "pooling_mode_cls_token": false,
4
+ "pooling_mode_mean_tokens": true,
5
+ "pooling_mode_max_tokens": false,
6
+ "pooling_mode_mean_sqrt_len_tokens": false,
7
+ "pooling_mode_weightedmean_tokens": false,
8
+ "pooling_mode_lasttoken": false,
9
+ "include_prompt": true
10
+ }
README.md ADDED
@@ -0,0 +1,642 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - sentence-transformers
4
+ - sentence-similarity
5
+ - feature-extraction
6
+ - generated_from_trainer
7
+ - dataset_size:23003
8
+ - loss:TripletLoss
9
+ base_model: intfloat/multilingual-e5-large-instruct
10
+ widget:
11
+ - source_sentence: The Merlyn.AI SectorSurfer Momentum ETF is designed to dynamically
12
+ shift its investment strategy based on market conditions, tracking an index that
13
+ utilizes an algorithmic Bull/Bear indicator assessing U.S. equity markets for
14
+ advancing trends or elevated decline risk using factors like price-trend, momentum,
15
+ value sentiment, and volatility. In Bull markets, it targets approximately a 70/30
16
+ domestic/foreign aggressive equity allocation by selecting six thematic ETFs (four
17
+ sectors, two geopolitical), while in Bear markets, it seeks safety by choosing
18
+ at least four momentum-leading bond, treasury, and gold safe-harbor ETFs, explicitly
19
+ avoiding inverse and leveraged funds. The index is typically evaluated monthly,
20
+ though the indicator can trigger strategy changes anytime during excessive market
21
+ volatility. Under normal circumstances, at least 80% of the fund's assets are
22
+ invested in the index's component securities; the fund is non-diversified. Please
23
+ be aware this fund is closing, with its last day of trading scheduled for November
24
+ 10, 2023.
25
+ sentences:
26
+ - The BlackRock Future Climate and Sustainable Economy ETF (BECO) is an actively
27
+ managed equity fund focused on the transition to a lower carbon economy and future
28
+ climate themes. It seeks a relatively concentrated, non-diversified portfolio
29
+ of globally-listed companies of any market capitalization, investing across multiple
30
+ subthemes such as sustainable energy, resource efficiency, future transport, sustainable
31
+ nutrition, and biodiversity. The fund utilizes proprietary environmental criteria,
32
+ including carbon metrics, and aims to align with the Paris Climate Agreement goals
33
+ for net-zero emissions by 2050, while excluding certain high-emission industries
34
+ and companies violating the UN Global Compact. It also attempts to achieve a better
35
+ aggregate environmental and ESG score than its benchmark, the MSCI ACWI Multiple
36
+ Industries Select Index. Note that BECO is being delisted, with its last day of
37
+ trading on an exchange scheduled for August 12, 2024.
38
+ - The Direxion Daily Semiconductor Bull 3X Shares (SOXL) seeks daily investment
39
+ results, before fees and expenses, of 300% of the daily performance of the ICE
40
+ Semiconductor Index. To achieve this bullish, leveraged exposure, the fund invests
41
+ at least 80% of its net assets in financial instruments, such as swap agreements,
42
+ securities of the index, and ETFs that track the index. The underlying ICE Semiconductor
43
+ Index is a rules-based, modified float-adjusted market capitalization-weighted
44
+ index that tracks the performance of the thirty largest U.S. listed semiconductor
45
+ companies. As a daily leveraged fund, SOXL rebalances daily, meaning results over
46
+ periods longer than one day can differ significantly from 300% of the index's
47
+ performance due to the effects of compounding; the fund is also non-diversified.
48
+ - The KraneShares Trust ETF seeks investment results corresponding generally to
49
+ the price and yield performance of the Solactive Global Luxury Index. Under normal
50
+ circumstances, the fund invests at least 80% of its net assets in instruments
51
+ in the underlying index or those with similar economic characteristics. This index
52
+ is a modified, free float adjusted market capitalization weighted index designed
53
+ to measure the equity performance of companies from global luxury-related sectors,
54
+ such as travel & leisure, premium ware, and apparel, located in developed markets.
55
+ The index selects the top 25 companies based on criteria including size, trading
56
+ volume, and country of listing, applying a modified weighting approach where the
57
+ top 5 securities receive higher allocations (with the largest capped at 10%) while
58
+ others are capped at 4.5%. The index is rebalanced semi-annually. The fund is
59
+ non-diversified and while targeting US investments, it maintains at least 40%
60
+ of its assets in foreign entities or those with significant business activities
61
+ outside the United States.
62
+ - source_sentence: The Xtrackers MSCI Emerging Markets Climate Selection ETF seeks
63
+ to track an emerging markets index focused on companies meeting specific climate
64
+ criteria. Derived from the MSCI ACWI Select Climate 500 methodology, the underlying
65
+ index selects eligible emerging market stocks using an optimization process designed
66
+ to reduce greenhouse gas emission intensity (targeting 10% revenue-related and
67
+ 7% financing-related reductions) and increase exposure to companies with SBTi-approved
68
+ targets. The strategy also excludes controversial companies and evaluates companies
69
+ based on broader ESG considerations. The fund is non-diversified and invests at
70
+ least 80% of its assets in the component securities of this climate-focused emerging
71
+ markets index.
72
+ sentences:
73
+ - The First Trust Indxx NextG UCITS ETF seeks investment results that generally
74
+ correspond to the price and yield of the Indxx 5G & NextG Thematic Index. This
75
+ tiered-weighted index of global mid- and large-cap equities tracks companies dedicating
76
+ significant resources to the research, development, and application of fifth generation
77
+ (5G) and emerging next generation digital cellular technologies. The fund normally
78
+ invests at least 90% of its net assets in the index's securities, which are primarily
79
+ drawn from themes including 5G infrastructure and hardware (such as data/cell
80
+ tower REITs and equipment manufacturers) and telecommunication service providers
81
+ operating relevant cellular and wireless networks.
82
+ - The iPath S&P MLP ETN tracks an S&P Dow Jones index designed to provide exposure
83
+ to leading partnerships listed on major U.S. exchanges. Comprising master limited
84
+ partnerships (MLPs) and similar publicly traded limited liability companies, these
85
+ constituents are primarily classified within the GICS Energy Sector and GICS Gas
86
+ Utilities Industry.
87
+ - The First Trust NASDAQ ABA Community Bank Index Fund (QABA) seeks investment results
88
+ corresponding generally to the NASDAQ OMX® ABA Community Bank TM Index, normally
89
+ investing at least 90% of its net assets in the index's securities. The index
90
+ tracks NASDAQ-listed US banks and thrifts of small, mid, and large capitalization,
91
+ designed to capture the community banking industry. Uniquely, it deliberately
92
+ excludes the 50 largest banks by asset size, banks with significant international
93
+ operations, and those specializing in credit cards, specifically targeting true
94
+ community banks and avoiding larger "mega-money centers." The index is market-cap-weighted
95
+ and undergoes regular rebalancing and reconstitution, subject to certain issuer
96
+ weight caps.
97
+ - source_sentence: The VanEck Morningstar Wide Moat ETF (MOAT) seeks to replicate
98
+ the performance of the Morningstar® Wide Moat Focus IndexSM by investing at least
99
+ 80% of its assets in the index's securities. The fund targets US companies that
100
+ Morningstar identifies as having sustainable competitive advantages ("wide moat
101
+ companies") based on a proprietary methodology considering quantitative and qualitative
102
+ factors. Specifically, the index focuses on companies determined to have the highest
103
+ fair value among these wide moat firms. MOAT holds a concentrated, equal-weighted
104
+ portfolio, which typically involves around 40 names but can hold more, featuring
105
+ a staggered rebalance schedule and potential sector biases. The fund is non-diversified
106
+ and employs caps on turnover and sector exposure, resulting in a strategy that
107
+ can significantly diverge from broader market coverage despite its focus on established
108
+ companies with competitive advantages.
109
+ sentences:
110
+ - The Fidelity MSCI Industrials Index ETF (FIDU) aims to match the performance of
111
+ the MSCI USA IMI Industrials 25/25 Index, which represents the broad U.S. industrial
112
+ sector using a market-cap-weighted approach with a 25/25 capping methodology.
113
+ The fund, launched in October 2013, provides plain-vanilla exposure and invests
114
+ at least 80% of its assets in securities found within this index. It uses a representative
115
+ sampling strategy rather than replicating the entire index, and the underlying
116
+ index is rebalanced quarterly.
117
+ - The KraneShares Electric Vehicles and Future Mobility Index ETF (KARS) seeks to
118
+ track the price and yield performance of the Bloomberg Electric Vehicles Index
119
+ by investing at least 80% of its net assets in corresponding instruments or those
120
+ with similar economic characteristics. The underlying index is designed to measure
121
+ the equity market performance of globally-listed companies significantly involved
122
+ in the production of electric vehicles, components, or other initiatives enhancing
123
+ future mobility, including areas like energy storage, autonomous navigation technology,
124
+ lithium and copper mining, and hydrogen fuel cells. KARS holds a concentrated
125
+ portfolio, typically around 32 companies, weighted by market capitalization subject
126
+ to specific position caps, and is reconstituted and rebalanced quarterly.
127
+ - The iPath S&P MLP ETN tracks an S&P Dow Jones index designed to provide exposure
128
+ to leading partnerships listed on major U.S. exchanges. Comprising master limited
129
+ partnerships (MLPs) and similar publicly traded limited liability companies, these
130
+ constituents are primarily classified within the GICS Energy Sector and GICS Gas
131
+ Utilities Industry.
132
+ - source_sentence: The Global X Clean Water ETF (AQWA) seeks to provide exposure to
133
+ the global water industry by tracking the Solactive Global Clean Water Industry
134
+ Index. The fund invests at least 80% of its assets in securities of this index,
135
+ which targets companies deriving a significant portion (at least 50%) of their
136
+ revenue from water infrastructure, equipment, and services, including treatment,
137
+ purification, conservation, and management. The index selection process uses proprietary
138
+ technology like NLP to identify eligible firms, incorporates minimum ESG standards
139
+ based on UN Global Compact principles, and includes the 40 highest-ranking companies,
140
+ weighted by market capitalization with specific caps. Reconstituted and rebalanced
141
+ semi-annually, the fund is considered non-diversified.
142
+ sentences:
143
+ - The First Trust Nasdaq Transportation ETF aims to track the Nasdaq US Smart Transportation
144
+ TM Index, investing at least 90% of its net assets in the index's securities.
145
+ This non-diversified fund provides exposure to a concentrated portfolio of approximately
146
+ 30 highly liquid U.S. transportation companies across various segments such as
147
+ delivery, shipping, marine, railroads, trucking, airports, airlines, bridges,
148
+ tunnels, and automobiles. The index selects companies based on liquidity and then
149
+ ranks and weights them according to factors reflecting growth (price returns),
150
+ value (cash flow-to-price), and low volatility, ensuring no single constituent
151
+ exceeds 8%. The index undergoes annual reconstitution and quarterly rebalancing.
152
+ - The Direxion Daily Healthcare Bull 3X Shares (CURE) is an ETF that seeks daily
153
+ investment results, before fees and expenses, of 300% (3X) of the daily performance
154
+ of the Health Care Select Sector Index. It invests at least 80% of its net assets
155
+ in financial instruments designed to provide this 3X daily leveraged exposure.
156
+ The underlying index tracks US listed healthcare companies, including pharmaceuticals,
157
+ health care equipment and supplies, providers and services, biotechnology, life
158
+ sciences tools, and health care technology, covering major large-cap names. CURE
159
+ is non-diversified and intended strictly as a short-term tactical instrument,
160
+ as it delivers its stated 3X exposure only for a single day, and returns over
161
+ longer periods can significantly differ from three times the index's performance.
162
+ - The BlackRock Future Climate and Sustainable Economy ETF (BECO) is an actively
163
+ managed equity fund focused on the transition to a lower carbon economy and future
164
+ climate themes. It seeks a relatively concentrated, non-diversified portfolio
165
+ of globally-listed companies of any market capitalization, investing across multiple
166
+ subthemes such as sustainable energy, resource efficiency, future transport, sustainable
167
+ nutrition, and biodiversity. The fund utilizes proprietary environmental criteria,
168
+ including carbon metrics, and aims to align with the Paris Climate Agreement goals
169
+ for net-zero emissions by 2050, while excluding certain high-emission industries
170
+ and companies violating the UN Global Compact. It also attempts to achieve a better
171
+ aggregate environmental and ESG score than its benchmark, the MSCI ACWI Multiple
172
+ Industries Select Index. Note that BECO is being delisted, with its last day of
173
+ trading on an exchange scheduled for August 12, 2024.
174
+ - source_sentence: The Horizon Kinetics Medical ETF (MEDX) is an actively-managed,
175
+ non-diversified fund aiming for long-term capital growth by investing primarily
176
+ in global companies (U.S. and foreign) within the medical research, pharmaceuticals,
177
+ medical technology, and related industries. The fund typically focuses on companies
178
+ generating at least 50% of their revenue from these areas and may include companies
179
+ of any market capitalization, with an emphasis on those involved in cancer research
180
+ and treatment. Under normal circumstances, at least 80% of assets are invested
181
+ in equity securities, convertibles, and warrants of such companies. Portfolio
182
+ selection and weighting are based on the adviser's evaluation and discretion.
183
+ The fund may also temporarily invest up to 100% in US short-term debt or invest
184
+ in non-convertible high-yield bonds.
185
+ sentences:
186
+ - The Fidelity MSCI Health Care Index ETF (FHLC) seeks to track the performance
187
+ of the MSCI USA IMI Health Care 25/50 Index, which represents the broad U.S. health
188
+ care sector. The ETF invests at least 80% of its assets in securities included
189
+ in this market-cap-weighted index, which captures large, mid, and small-cap companies
190
+ across over 10 subsectors. Employing a representative sampling strategy, the fund
191
+ aims to correspond to the index's performance. The index incorporates a 25/50
192
+ capping methodology, is rebalanced quarterly, and its broad reach offers diversification
193
+ across cap sizes and subsectors, potentially reducing concentration in dominant
194
+ large pharma names and increasing exposure to areas like drug retailers and insurance.
195
+ The fund is classified as non-diversified.
196
+ - The SPDR S&P Oil & Gas Equipment & Services ETF (XES) seeks investment results
197
+ corresponding generally to the total return performance of the S&P Oil & Gas Equipment
198
+ & Services Select Industry Index. This index represents companies in the oil and
199
+ gas equipment and services segment of the broad U.S. S&P Total Market Index (S&P
200
+ TMI), including those involved in activities like wildcatting, drilling hardware,
201
+ and related services. The index utilizes an equal-weighting methodology for its
202
+ constituent companies, which are selected based on market capitalization and liquidity
203
+ requirements and undergo quarterly rebalancing. The fund itself employs a sampling
204
+ strategy, aiming to invest at least 80% of its total assets in the securities
205
+ that comprise its benchmark index.
206
+ - The VanEck Biotech ETF (BBH) seeks to replicate the performance of the MVIS® US
207
+ Listed Biotech 25 Index, which provides exposure to approximately 25 of the largest
208
+ or leading U.S.-listed companies in the biotechnology industry. The fund normally
209
+ invests at least 80% of its assets in securities comprising this market-cap-weighted
210
+ index. The underlying index includes common stocks and depositary receipts of
211
+ firms involved in the research, development, production, marketing, and sale of
212
+ drugs based on genetic analysis and diagnostic equipment. While focusing on U.S.-listed
213
+ companies, it may include foreign firms listed domestically, and medium-capitalization
214
+ companies can be included. Reflecting the index's concentration, the fund is non-diversified
215
+ and may have a top-heavy portfolio. The index is reviewed semi-annually.
216
+ datasets:
217
+ - hobbang/stage1-triplet-dataset-selected
218
+ pipeline_tag: sentence-similarity
219
+ library_name: sentence-transformers
220
+ ---
221
+
222
+ # SentenceTransformer based on intfloat/multilingual-e5-large-instruct
223
+
224
+ This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [intfloat/multilingual-e5-large-instruct](https://huggingface.co/intfloat/multilingual-e5-large-instruct) on the [stage1-triplet-dataset-selected](https://huggingface.co/datasets/hobbang/stage1-triplet-dataset-selected) dataset. It maps sentences & paragraphs to a 1024-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
225
+
226
+ ## Model Details
227
+
228
+ ### Model Description
229
+ - **Model Type:** Sentence Transformer
230
+ - **Base model:** [intfloat/multilingual-e5-large-instruct](https://huggingface.co/intfloat/multilingual-e5-large-instruct) <!-- at revision 84344a23ee1820ac951bc365f1e91d094a911763 -->
231
+ - **Maximum Sequence Length:** 512 tokens
232
+ - **Output Dimensionality:** 1024 dimensions
233
+ - **Similarity Function:** Cosine Similarity
234
+ - **Training Dataset:**
235
+ - [stage1-triplet-dataset-selected](https://huggingface.co/datasets/hobbang/stage1-triplet-dataset-selected)
236
+ <!-- - **Language:** Unknown -->
237
+ <!-- - **License:** Unknown -->
238
+
239
+ ### Model Sources
240
+
241
+ - **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
242
+ - **Repository:** [Sentence Transformers on GitHub](https://github.com/UKPLab/sentence-transformers)
243
+ - **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)
244
+
245
+ ### Full Model Architecture
246
+
247
+ ```
248
+ SentenceTransformer(
249
+ (0): Transformer({'max_seq_length': 512, 'do_lower_case': False}) with Transformer model: XLMRobertaModel
250
+ (1): Pooling({'word_embedding_dimension': 1024, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
251
+ (2): Normalize()
252
+ )
253
+ ```
254
+
255
+ ## Usage
256
+
257
+ ### Direct Usage (Sentence Transformers)
258
+
259
+ First install the Sentence Transformers library:
260
+
261
+ ```bash
262
+ pip install -U sentence-transformers
263
+ ```
264
+
265
+ Then you can load this model and run inference.
266
+ ```python
267
+ from sentence_transformers import SentenceTransformer
268
+
269
+ # Download from the 🤗 Hub
270
+ model = SentenceTransformer("sentence_transformers_model_id")
271
+ # Run inference
272
+ sentences = [
273
+ "The Horizon Kinetics Medical ETF (MEDX) is an actively-managed, non-diversified fund aiming for long-term capital growth by investing primarily in global companies (U.S. and foreign) within the medical research, pharmaceuticals, medical technology, and related industries. The fund typically focuses on companies generating at least 50% of their revenue from these areas and may include companies of any market capitalization, with an emphasis on those involved in cancer research and treatment. Under normal circumstances, at least 80% of assets are invested in equity securities, convertibles, and warrants of such companies. Portfolio selection and weighting are based on the adviser's evaluation and discretion. The fund may also temporarily invest up to 100% in US short-term debt or invest in non-convertible high-yield bonds.",
274
+ "The VanEck Biotech ETF (BBH) seeks to replicate the performance of the MVIS® US Listed Biotech 25 Index, which provides exposure to approximately 25 of the largest or leading U.S.-listed companies in the biotechnology industry. The fund normally invests at least 80% of its assets in securities comprising this market-cap-weighted index. The underlying index includes common stocks and depositary receipts of firms involved in the research, development, production, marketing, and sale of drugs based on genetic analysis and diagnostic equipment. While focusing on U.S.-listed companies, it may include foreign firms listed domestically, and medium-capitalization companies can be included. Reflecting the index's concentration, the fund is non-diversified and may have a top-heavy portfolio. The index is reviewed semi-annually.",
275
+ 'The SPDR S&P Oil & Gas Equipment & Services ETF (XES) seeks investment results corresponding generally to the total return performance of the S&P Oil & Gas Equipment & Services Select Industry Index. This index represents companies in the oil and gas equipment and services segment of the broad U.S. S&P Total Market Index (S&P TMI), including those involved in activities like wildcatting, drilling hardware, and related services. The index utilizes an equal-weighting methodology for its constituent companies, which are selected based on market capitalization and liquidity requirements and undergo quarterly rebalancing. The fund itself employs a sampling strategy, aiming to invest at least 80% of its total assets in the securities that comprise its benchmark index.',
276
+ ]
277
+ embeddings = model.encode(sentences)
278
+ print(embeddings.shape)
279
+ # [3, 1024]
280
+
281
+ # Get the similarity scores for the embeddings
282
+ similarities = model.similarity(embeddings, embeddings)
283
+ print(similarities.shape)
284
+ # [3, 3]
285
+ ```
286
+
287
+ <!--
288
+ ### Direct Usage (Transformers)
289
+
290
+ <details><summary>Click to see the direct usage in Transformers</summary>
291
+
292
+ </details>
293
+ -->
294
+
295
+ <!--
296
+ ### Downstream Usage (Sentence Transformers)
297
+
298
+ You can finetune this model on your own dataset.
299
+
300
+ <details><summary>Click to expand</summary>
301
+
302
+ </details>
303
+ -->
304
+
305
+ <!--
306
+ ### Out-of-Scope Use
307
+
308
+ *List how the model may foreseeably be misused and address what users ought not to do with the model.*
309
+ -->
310
+
311
+ <!--
312
+ ## Bias, Risks and Limitations
313
+
314
+ *What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
315
+ -->
316
+
317
+ <!--
318
+ ### Recommendations
319
+
320
+ *What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
321
+ -->
322
+
323
+ ## Training Details
324
+
325
+ ### Training Dataset
326
+
327
+ #### stage1-triplet-dataset-selected
328
+
329
+ * Dataset: [stage1-triplet-dataset-selected](https://huggingface.co/datasets/hobbang/stage1-triplet-dataset-selected) at [18e0423](https://huggingface.co/datasets/hobbang/stage1-triplet-dataset-selected/tree/18e0423399bc6678e814264ca8c8acdf02dfce97)
330
+ * Size: 23,003 training samples
331
+ * Columns: <code>anchor</code>, <code>positive</code>, and <code>negative</code>
332
+ * Approximate statistics based on the first 1000 samples:
333
+ | | anchor | positive | negative |
334
+ |:--------|:-------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|
335
+ | type | string | string | string |
336
+ | details | <ul><li>min: 94 tokens</li><li>mean: 170.87 tokens</li><li>max: 224 tokens</li></ul> | <ul><li>min: 29 tokens</li><li>mean: 174.15 tokens</li><li>max: 261 tokens</li></ul> | <ul><li>min: 72 tokens</li><li>mean: 174.89 tokens</li><li>max: 261 tokens</li></ul> |
337
+ * Samples:
338
+ | anchor | positive | negative |
339
+ |:-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
340
+ | <code>The Invesco Financial Preferred ETF (PGF) seeks to track the ICE Exchange-Listed Fixed Rate Financial Preferred Securities Index, primarily by investing at least 90% of its total assets in the securities comprising the index. The underlying index is market capitalization weighted and designed to track the performance of exchange-listed, fixed rate, U.S. dollar denominated preferred securities, including functionally equivalent instruments, issued by U.S. financial companies. PGF provides a concentrated portfolio exclusively focused on financial-sector preferred securities and is considered non-diversified, holding both investment- and non-investment-grade securities within this focus.</code> | <code>The FlexShares ESG & Climate Investment Grade Corporate Core Index Fund (FEIG) is a passively managed ETF designed to provide broad-market, core exposure to USD-denominated investment-grade corporate bonds. It seeks to track the performance of the Northern Trust ESG & Climate Investment Grade U.S. Corporate Core IndexSM, which selects bonds from a universe of USD-denominated, investment-grade corporate debt with maturities of at least one year. The index employs an optimization process to increase the aggregate ESG score and reduce aggregate climate-related risk among constituent companies, involving ranking firms on material ESG metrics, governance, and carbon risks, while excluding controversial companies and international initiative violators. Weights are also optimized to minimize systematic risk, and the index is rebalanced monthly. Under normal circumstances, the fund invests at least 80% of its assets in the index's securities.</code> | <code>The Viridi Bitcoin Miners ETF primarily invests in companies engaged in Bitcoin mining, aiming to allocate at least 80% of its net assets, plus borrowings for investment purposes, to securities of such companies under normal circumstances. The fund focuses on U.S. and non-U.S. equity securities in developed markets, which may include investments via depositary receipts. It also specifically targets common stock from newly listed IPOs, shares derived from SPAC IPOs, and securities resulting from reverse mergers. This ETF is non-diversified.</code> |
341
+ | <code>The Invesco Financial Preferred ETF (PGF) seeks to track the ICE Exchange-Listed Fixed Rate Financial Preferred Securities Index, primarily by investing at least 90% of its total assets in the securities comprising the index. The underlying index is market capitalization weighted and designed to track the performance of exchange-listed, fixed rate, U.S. dollar denominated preferred securities, including functionally equivalent instruments, issued by U.S. financial companies. PGF provides a concentrated portfolio exclusively focused on financial-sector preferred securities and is considered non-diversified, holding both investment- and non-investment-grade securities within this focus.</code> | <code>The Fidelity Sustainable High Yield ETF (FSYD) is an actively managed fund primarily seeking high income, and potentially capital growth, by investing at least 80% of its assets in global high-yield (below investment grade) debt securities. The fund focuses on issuers demonstrating proven or improving sustainability practices based on an evaluation of their individual environmental, social, and governance (ESG) profiles using a proprietary rating process. Its comprehensive selection approach also incorporates a multi-factor quantitative screening model and fundamental analysis of issuers, aiming to identify value and quality within the high-yield universe.</code> | <code>The ETFMG Prime Mobile Payments ETF seeks to track the performance of the Nasdaq CTA Global Digital Payments Index, which identifies companies engaged in the global digital payments industry across categories like card networks, infrastructure, software, processors, and solutions. Under normal circumstances, the fund invests at least 80% of its net assets in common stocks (including ADRs and GDRs) of these Mobile Payments Companies. It typically holds a narrow portfolio expected to contain up to 50 companies, weighted using a theme-adjusted market capitalization scheme, and is considered non-diversified.</code> |
342
+ | <code>The Invesco Financial Preferred ETF (PGF) seeks to track the ICE Exchange-Listed Fixed Rate Financial Preferred Securities Index, primarily by investing at least 90% of its total assets in the securities comprising the index. The underlying index is market capitalization weighted and designed to track the performance of exchange-listed, fixed rate, U.S. dollar denominated preferred securities, including functionally equivalent instruments, issued by U.S. financial companies. PGF provides a concentrated portfolio exclusively focused on financial-sector preferred securities and is considered non-diversified, holding both investment- and non-investment-grade securities within this focus.</code> | <code>The First Trust TCW Securitized Plus ETF (DEED) is an actively-managed fund focused on U.S. securitized debt securities, aiming to maximize long-term total return and outperform the Bloomberg US Mortgage-Backed Securities Index. Under normal market conditions, the fund allocates at least 80% of its net assets to securitized debt, including asset-backed securities, residential and commercial mortgage-backed securities, and collateralized loan obligations (CLOs). At least 50% of total assets are invested in securities issued or guaranteed by the U.S. government, its agencies, or government-sponsored entities, while the balance may include non-government and privately-issued securitized debt. The fund invests across various maturities and credit qualities (junk and investment-grade), using proprietary research to identify undervalued securities, and may utilize OTC derivatives for up to 25% of the portfolio.</code> | <code>The First Trust Growth Strength UCITS ETF aims to track the price and yield of The Growth Strength Index. Passively managed, the fund normally invests at least 80% of its assets in the index's common stocks and REIT components. The index selects 50 equal-weighted, well-capitalized, large-cap US companies from the top 500 US securities by market capitalization based on fundamental criteria such as return on equity, long-term debt levels, liquidity, positive shareholder equity, and a composite ranking based on 3-year revenue and cash flow growth. The resulting portfolio is non-diversified and rebalanced quarterly.</code> |
343
+ * Loss: [<code>TripletLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#tripletloss) with these parameters:
344
+ ```json
345
+ {
346
+ "distance_metric": "TripletDistanceMetric.COSINE",
347
+ "triplet_margin": 0.05
348
+ }
349
+ ```
350
+
351
+ ### Evaluation Dataset
352
+
353
+ #### stage1-triplet-dataset-selected
354
+
355
+ * Dataset: [stage1-triplet-dataset-selected](https://huggingface.co/datasets/hobbang/stage1-triplet-dataset-selected) at [18e0423](https://huggingface.co/datasets/hobbang/stage1-triplet-dataset-selected/tree/18e0423399bc6678e814264ca8c8acdf02dfce97)
356
+ * Size: 388 evaluation samples
357
+ * Columns: <code>anchor</code>, <code>positive</code>, and <code>negative</code>
358
+ * Approximate statistics based on the first 388 samples:
359
+ | | anchor | positive | negative |
360
+ |:--------|:-------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|
361
+ | type | string | string | string |
362
+ | details | <ul><li>min: 85 tokens</li><li>mean: 176.98 tokens</li><li>max: 271 tokens</li></ul> | <ul><li>min: 85 tokens</li><li>mean: 176.83 tokens</li><li>max: 271 tokens</li></ul> | <ul><li>min: 85 tokens</li><li>mean: 175.41 tokens</li><li>max: 271 tokens</li></ul> |
363
+ * Samples:
364
+ | anchor | positive | negative |
365
+ |:----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
366
+ | <code>The Global X S&P 500 Risk Managed Income ETF seeks to track the Cboe S&P 500 Risk Managed Income Index by investing at least 80% of its assets in index securities. The index's strategy involves holding the underlying stocks of the S&P 500 Index while applying an options collar, specifically selling at-the-money covered call options and buying monthly 5% out-of-the-money put options corresponding to the portfolio's value. This approach aims to generate income, ideally resulting in a net credit from the options premiums, and provide risk management, though selling at-the-money calls inherently caps the fund's potential for upside participation.</code> | <code>The U.S. Global Technology and Aerospace & Defense ETF is an actively managed ETF seeking capital appreciation by investing in equity securities of companies expected to benefit from national defense efforts. These efforts include technological innovations and the development of products and services related to aerospace, physical, and cybersecurity defense, often in preparation for or in response to domestic, regional, or global conflicts. The fund is non-diversified.</code> | <code>The KraneShares Global Carbon Offset Strategy ETF (KSET) was the first US-listed ETF providing exposure to the global voluntary carbon market. It achieved this by investing primarily in liquid carbon offset credit futures, including CME-traded Global Emissions Offsets (GEOs) and Nature-Based Global Emission Offsets (N-GEOs), which are designed to help businesses meet greenhouse gas reduction goals. Tracking an index that weighted eligible futures based on liquidity, the fund sought exposure to the same carbon offset credit futures, typically those maturing within two years. The ETF was considered non-diversified and utilized a Cayman Island subsidiary. However, the fund was delisted, with its last day of trading on an exchange being March 14, 2024.</code> |
367
+ | <code>The Global X S&P 500 Risk Managed Income ETF seeks to track the Cboe S&P 500 Risk Managed Income Index by investing at least 80% of its assets in index securities. The index's strategy involves holding the underlying stocks of the S&P 500 Index while applying an options collar, specifically selling at-the-money covered call options and buying monthly 5% out-of-the-money put options corresponding to the portfolio's value. This approach aims to generate income, ideally resulting in a net credit from the options premiums, and provide risk management, though selling at-the-money calls inherently caps the fund's potential for upside participation.</code> | <code>The JPMorgan Social Advancement ETF (UPWD) is an actively managed, non-diversified fund that seeks to invest globally in companies facilitating social and economic advancements and empowerment across the socioeconomic spectrum. Primarily holding common stocks, depositary receipts, and REITs, the fund targets themes including essential amenities, affordable housing, healthcare, education, attainable financing, and the digital ecosystem, potentially investing in companies of various sizes, including small-caps, across U.S., foreign, and emerging markets with possible concentration in specific sectors. Security selection follows a proprietary three-step process involving exclusions, thematic ranking using a ThemeBot, and a sustainable investment inclusion process combined with fundamental research. Please note that this security is being delisted, with its last day of trading scheduled for December 15, 2023.</code> | <code>The Direxion Daily Gold Miners Index Bull 2X Shares (NUGT) is designed to provide 200% of the daily performance of the NYSE Arca Gold Miners Index, before fees and expenses. This market-cap-weighted index comprises publicly traded global companies, primarily involved in gold mining and to a lesser extent silver mining, operating in both developed and emerging markets. NUGT achieves its objective by investing at least 80% of its net assets in financial instruments providing 2X daily leveraged exposure to the index. As a leveraged fund intended for daily results, NUGT is designed for short-term trading, typically held for only one trading day, and holding it for longer periods can lead to performance results that differ significantly from the stated daily target due to the effects of compounding. The fund is also non-diversified.</code> |
368
+ | <code>The Global X S&P 500 Risk Managed Income ETF seeks to track the Cboe S&P 500 Risk Managed Income Index by investing at least 80% of its assets in index securities. The index's strategy involves holding the underlying stocks of the S&P 500 Index while applying an options collar, specifically selling at-the-money covered call options and buying monthly 5% out-of-the-money put options corresponding to the portfolio's value. This approach aims to generate income, ideally resulting in a net credit from the options premiums, and provide risk management, though selling at-the-money calls inherently caps the fund's potential for upside participation.</code> | <code>The Xtrackers MSCI Emerging Markets ESG Leaders Equity ETF tracks an index of large- and mid-cap emerging market stocks that emphasize strong environmental, social, and governance (ESG) characteristics. The index first excludes companies involved in specific controversial industries. From the remaining universe, it ranks stocks based on MSCI ESG scores, including a controversy component, to identify and select the highest-ranking ESG leaders, effectively screening out ESG laggards. To maintain market-like country and sector weights, the index selects the top ESG-scoring stocks within each sector until a specified market capitalization threshold is reached. Selected stocks are then weighted by market capitalization within their respective sectors. The fund typically invests over 80% of its assets in the securities of this underlying index.</code> | <code>The BlackRock Future Climate and Sustainable Economy ETF (BECO) is an actively managed equity fund focused on the transition to a lower carbon economy and future climate themes. It seeks a relatively concentrated, non-diversified portfolio of globally-listed companies of any market capitalization, investing across multiple subthemes such as sustainable energy, resource efficiency, future transport, sustainable nutrition, and biodiversity. The fund utilizes proprietary environmental criteria, including carbon metrics, and aims to align with the Paris Climate Agreement goals for net-zero emissions by 2050, while excluding certain high-emission industries and companies violating the UN Global Compact. It also attempts to achieve a better aggregate environmental and ESG score than its benchmark, the MSCI ACWI Multiple Industries Select Index. Note that BECO is being delisted, with its last day of trading on an exchange scheduled for August 12, 2024.</code> |
369
+ * Loss: [<code>TripletLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#tripletloss) with these parameters:
370
+ ```json
371
+ {
372
+ "distance_metric": "TripletDistanceMetric.COSINE",
373
+ "triplet_margin": 0.05
374
+ }
375
+ ```
376
+
377
+ ### Training Hyperparameters
378
+ #### Non-Default Hyperparameters
379
+
380
+ - `eval_strategy`: steps
381
+ - `per_device_train_batch_size`: 32
382
+ - `per_device_eval_batch_size`: 16
383
+ - `learning_rate`: 2e-06
384
+ - `num_train_epochs`: 1
385
+ - `warmup_ratio`: 0.1
386
+ - `bf16`: True
387
+ - `dataloader_drop_last`: True
388
+ - `load_best_model_at_end`: True
389
+ - `batch_sampler`: no_duplicates
390
+
391
+ #### All Hyperparameters
392
+ <details><summary>Click to expand</summary>
393
+
394
+ - `overwrite_output_dir`: False
395
+ - `do_predict`: False
396
+ - `eval_strategy`: steps
397
+ - `prediction_loss_only`: True
398
+ - `per_device_train_batch_size`: 32
399
+ - `per_device_eval_batch_size`: 16
400
+ - `per_gpu_train_batch_size`: None
401
+ - `per_gpu_eval_batch_size`: None
402
+ - `gradient_accumulation_steps`: 1
403
+ - `eval_accumulation_steps`: None
404
+ - `torch_empty_cache_steps`: None
405
+ - `learning_rate`: 2e-06
406
+ - `weight_decay`: 0.0
407
+ - `adam_beta1`: 0.9
408
+ - `adam_beta2`: 0.999
409
+ - `adam_epsilon`: 1e-08
410
+ - `max_grad_norm`: 1.0
411
+ - `num_train_epochs`: 1
412
+ - `max_steps`: -1
413
+ - `lr_scheduler_type`: linear
414
+ - `lr_scheduler_kwargs`: {}
415
+ - `warmup_ratio`: 0.1
416
+ - `warmup_steps`: 0
417
+ - `log_level`: passive
418
+ - `log_level_replica`: warning
419
+ - `log_on_each_node`: True
420
+ - `logging_nan_inf_filter`: True
421
+ - `save_safetensors`: True
422
+ - `save_on_each_node`: False
423
+ - `save_only_model`: False
424
+ - `restore_callback_states_from_checkpoint`: False
425
+ - `no_cuda`: False
426
+ - `use_cpu`: False
427
+ - `use_mps_device`: False
428
+ - `seed`: 42
429
+ - `data_seed`: None
430
+ - `jit_mode_eval`: False
431
+ - `use_ipex`: False
432
+ - `bf16`: True
433
+ - `fp16`: False
434
+ - `fp16_opt_level`: O1
435
+ - `half_precision_backend`: auto
436
+ - `bf16_full_eval`: False
437
+ - `fp16_full_eval`: False
438
+ - `tf32`: None
439
+ - `local_rank`: 0
440
+ - `ddp_backend`: None
441
+ - `tpu_num_cores`: None
442
+ - `tpu_metrics_debug`: False
443
+ - `debug`: []
444
+ - `dataloader_drop_last`: True
445
+ - `dataloader_num_workers`: 0
446
+ - `dataloader_prefetch_factor`: None
447
+ - `past_index`: -1
448
+ - `disable_tqdm`: False
449
+ - `remove_unused_columns`: True
450
+ - `label_names`: None
451
+ - `load_best_model_at_end`: True
452
+ - `ignore_data_skip`: False
453
+ - `fsdp`: []
454
+ - `fsdp_min_num_params`: 0
455
+ - `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
456
+ - `tp_size`: 0
457
+ - `fsdp_transformer_layer_cls_to_wrap`: None
458
+ - `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
459
+ - `deepspeed`: None
460
+ - `label_smoothing_factor`: 0.0
461
+ - `optim`: adamw_torch
462
+ - `optim_args`: None
463
+ - `adafactor`: False
464
+ - `group_by_length`: False
465
+ - `length_column_name`: length
466
+ - `ddp_find_unused_parameters`: None
467
+ - `ddp_bucket_cap_mb`: None
468
+ - `ddp_broadcast_buffers`: False
469
+ - `dataloader_pin_memory`: True
470
+ - `dataloader_persistent_workers`: False
471
+ - `skip_memory_metrics`: True
472
+ - `use_legacy_prediction_loop`: False
473
+ - `push_to_hub`: False
474
+ - `resume_from_checkpoint`: None
475
+ - `hub_model_id`: None
476
+ - `hub_strategy`: every_save
477
+ - `hub_private_repo`: None
478
+ - `hub_always_push`: False
479
+ - `gradient_checkpointing`: False
480
+ - `gradient_checkpointing_kwargs`: None
481
+ - `include_inputs_for_metrics`: False
482
+ - `include_for_metrics`: []
483
+ - `eval_do_concat_batches`: True
484
+ - `fp16_backend`: auto
485
+ - `push_to_hub_model_id`: None
486
+ - `push_to_hub_organization`: None
487
+ - `mp_parameters`:
488
+ - `auto_find_batch_size`: False
489
+ - `full_determinism`: False
490
+ - `torchdynamo`: None
491
+ - `ray_scope`: last
492
+ - `ddp_timeout`: 1800
493
+ - `torch_compile`: False
494
+ - `torch_compile_backend`: None
495
+ - `torch_compile_mode`: None
496
+ - `include_tokens_per_second`: False
497
+ - `include_num_input_tokens_seen`: False
498
+ - `neftune_noise_alpha`: None
499
+ - `optim_target_modules`: None
500
+ - `batch_eval_metrics`: False
501
+ - `eval_on_start`: False
502
+ - `use_liger_kernel`: False
503
+ - `eval_use_gather_object`: False
504
+ - `average_tokens_across_devices`: False
505
+ - `prompts`: None
506
+ - `batch_sampler`: no_duplicates
507
+ - `multi_dataset_batch_sampler`: proportional
508
+
509
+ </details>
510
+
511
+ ### Training Logs
512
+ | Epoch | Step | Training Loss | Validation Loss |
513
+ |:----------:|:-------:|:-------------:|:---------------:|
514
+ | 0.0139 | 10 | 0.0367 | - |
515
+ | 0.0279 | 20 | 0.0378 | - |
516
+ | 0.0418 | 30 | 0.0346 | - |
517
+ | 0.0557 | 40 | 0.0337 | - |
518
+ | 0.0696 | 50 | 0.0328 | - |
519
+ | 0.0836 | 60 | 0.0291 | - |
520
+ | 0.0975 | 70 | 0.0257 | - |
521
+ | 0.1114 | 80 | 0.0206 | - |
522
+ | 0.1253 | 90 | 0.0201 | - |
523
+ | 0.1393 | 100 | 0.0208 | 0.0132 |
524
+ | 0.1532 | 110 | 0.0167 | - |
525
+ | 0.1671 | 120 | 0.0167 | - |
526
+ | 0.1811 | 130 | 0.0156 | - |
527
+ | 0.1950 | 140 | 0.0153 | - |
528
+ | 0.2089 | 150 | 0.0125 | - |
529
+ | 0.2228 | 160 | 0.0141 | - |
530
+ | 0.2368 | 170 | 0.0153 | - |
531
+ | 0.2507 | 180 | 0.0142 | - |
532
+ | 0.2646 | 190 | 0.0095 | - |
533
+ | 0.2786 | 200 | 0.0144 | 0.0111 |
534
+ | 0.2925 | 210 | 0.0132 | - |
535
+ | 0.3064 | 220 | 0.0107 | - |
536
+ | 0.3203 | 230 | 0.0116 | - |
537
+ | 0.3343 | 240 | 0.0134 | - |
538
+ | 0.3482 | 250 | 0.0112 | - |
539
+ | 0.3621 | 260 | 0.0115 | - |
540
+ | 0.3760 | 270 | 0.0124 | - |
541
+ | 0.3900 | 280 | 0.0126 | - |
542
+ | 0.4039 | 290 | 0.0105 | - |
543
+ | 0.4178 | 300 | 0.0111 | 0.0109 |
544
+ | 0.4318 | 310 | 0.0136 | - |
545
+ | 0.4457 | 320 | 0.0123 | - |
546
+ | 0.4596 | 330 | 0.0113 | - |
547
+ | 0.4735 | 340 | 0.0125 | - |
548
+ | 0.4875 | 350 | 0.0082 | - |
549
+ | 0.5014 | 360 | 0.0102 | - |
550
+ | 0.5153 | 370 | 0.0081 | - |
551
+ | 0.5292 | 380 | 0.0115 | - |
552
+ | 0.5432 | 390 | 0.0107 | - |
553
+ | 0.5571 | 400 | 0.012 | 0.0106 |
554
+ | 0.5710 | 410 | 0.0094 | - |
555
+ | 0.5850 | 420 | 0.0099 | - |
556
+ | 0.5989 | 430 | 0.0105 | - |
557
+ | 0.6128 | 440 | 0.0101 | - |
558
+ | 0.6267 | 450 | 0.0099 | - |
559
+ | 0.6407 | 460 | 0.0106 | - |
560
+ | 0.6546 | 470 | 0.0099 | - |
561
+ | 0.6685 | 480 | 0.0108 | - |
562
+ | 0.6825 | 490 | 0.01 | - |
563
+ | **0.6964** | **500** | **0.0084** | **0.0102** |
564
+ | 0.7103 | 510 | 0.0092 | - |
565
+ | 0.7242 | 520 | 0.0084 | - |
566
+ | 0.7382 | 530 | 0.0077 | - |
567
+ | 0.7521 | 540 | 0.0096 | - |
568
+ | 0.7660 | 550 | 0.0099 | - |
569
+ | 0.7799 | 560 | 0.0103 | - |
570
+ | 0.7939 | 570 | 0.0082 | - |
571
+ | 0.8078 | 580 | 0.009 | - |
572
+ | 0.8217 | 590 | 0.0078 | - |
573
+ | 0.8357 | 600 | 0.0091 | 0.0104 |
574
+ | 0.8496 | 610 | 0.0088 | - |
575
+ | 0.8635 | 620 | 0.0103 | - |
576
+ | 0.8774 | 630 | 0.0109 | - |
577
+ | 0.8914 | 640 | 0.0072 | - |
578
+ | 0.9053 | 650 | 0.0084 | - |
579
+ | 0.9192 | 660 | 0.0099 | - |
580
+ | 0.9331 | 670 | 0.008 | - |
581
+ | 0.9471 | 680 | 0.0081 | - |
582
+ | 0.9610 | 690 | 0.0075 | - |
583
+ | 0.9749 | 700 | 0.0096 | 0.0103 |
584
+ | 0.9889 | 710 | 0.0089 | - |
585
+
586
+ * The bold row denotes the saved checkpoint.
587
+
588
+ ### Framework Versions
589
+ - Python: 3.10.12
590
+ - Sentence Transformers: 4.1.0
591
+ - Transformers: 4.51.3
592
+ - PyTorch: 2.1.0+cu118
593
+ - Accelerate: 1.6.0
594
+ - Datasets: 3.5.0
595
+ - Tokenizers: 0.21.1
596
+
597
+ ## Citation
598
+
599
+ ### BibTeX
600
+
601
+ #### Sentence Transformers
602
+ ```bibtex
603
+ @inproceedings{reimers-2019-sentence-bert,
604
+ title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
605
+ author = "Reimers, Nils and Gurevych, Iryna",
606
+ booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
607
+ month = "11",
608
+ year = "2019",
609
+ publisher = "Association for Computational Linguistics",
610
+ url = "https://arxiv.org/abs/1908.10084",
611
+ }
612
+ ```
613
+
614
+ #### TripletLoss
615
+ ```bibtex
616
+ @misc{hermans2017defense,
617
+ title={In Defense of the Triplet Loss for Person Re-Identification},
618
+ author={Alexander Hermans and Lucas Beyer and Bastian Leibe},
619
+ year={2017},
620
+ eprint={1703.07737},
621
+ archivePrefix={arXiv},
622
+ primaryClass={cs.CV}
623
+ }
624
+ ```
625
+
626
+ <!--
627
+ ## Glossary
628
+
629
+ *Clearly define terms in order to be accessible across audiences.*
630
+ -->
631
+
632
+ <!--
633
+ ## Model Card Authors
634
+
635
+ *Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
636
+ -->
637
+
638
+ <!--
639
+ ## Model Card Contact
640
+
641
+ *Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
642
+ -->
config.json ADDED
@@ -0,0 +1,27 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "architectures": [
3
+ "XLMRobertaModel"
4
+ ],
5
+ "attention_probs_dropout_prob": 0.1,
6
+ "bos_token_id": 0,
7
+ "classifier_dropout": null,
8
+ "eos_token_id": 2,
9
+ "hidden_act": "gelu",
10
+ "hidden_dropout_prob": 0.1,
11
+ "hidden_size": 1024,
12
+ "initializer_range": 0.02,
13
+ "intermediate_size": 4096,
14
+ "layer_norm_eps": 1e-05,
15
+ "max_position_embeddings": 514,
16
+ "model_type": "xlm-roberta",
17
+ "num_attention_heads": 16,
18
+ "num_hidden_layers": 24,
19
+ "output_past": true,
20
+ "pad_token_id": 1,
21
+ "position_embedding_type": "absolute",
22
+ "torch_dtype": "float32",
23
+ "transformers_version": "4.51.3",
24
+ "type_vocab_size": 1,
25
+ "use_cache": true,
26
+ "vocab_size": 250002
27
+ }
config_sentence_transformers.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "__version__": {
3
+ "sentence_transformers": "4.1.0",
4
+ "transformers": "4.51.3",
5
+ "pytorch": "2.1.0+cu118"
6
+ },
7
+ "prompts": {},
8
+ "default_prompt_name": null,
9
+ "similarity_fn_name": "cosine"
10
+ }
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a3dd281ad225be331be86614a4975d1c79c3c7b720bcd3caad1360a2828c1dfa
3
+ size 2239607176
modules.json ADDED
@@ -0,0 +1,20 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [
2
+ {
3
+ "idx": 0,
4
+ "name": "0",
5
+ "path": "",
6
+ "type": "sentence_transformers.models.Transformer"
7
+ },
8
+ {
9
+ "idx": 1,
10
+ "name": "1",
11
+ "path": "1_Pooling",
12
+ "type": "sentence_transformers.models.Pooling"
13
+ },
14
+ {
15
+ "idx": 2,
16
+ "name": "2",
17
+ "path": "2_Normalize",
18
+ "type": "sentence_transformers.models.Normalize"
19
+ }
20
+ ]
sentence_bert_config.json ADDED
@@ -0,0 +1,4 @@
 
 
 
 
 
1
+ {
2
+ "max_seq_length": 512,
3
+ "do_lower_case": false
4
+ }
special_tokens_map.json ADDED
@@ -0,0 +1,51 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "bos_token": {
3
+ "content": "<s>",
4
+ "lstrip": false,
5
+ "normalized": false,
6
+ "rstrip": false,
7
+ "single_word": false
8
+ },
9
+ "cls_token": {
10
+ "content": "<s>",
11
+ "lstrip": false,
12
+ "normalized": false,
13
+ "rstrip": false,
14
+ "single_word": false
15
+ },
16
+ "eos_token": {
17
+ "content": "</s>",
18
+ "lstrip": false,
19
+ "normalized": false,
20
+ "rstrip": false,
21
+ "single_word": false
22
+ },
23
+ "mask_token": {
24
+ "content": "<mask>",
25
+ "lstrip": true,
26
+ "normalized": false,
27
+ "rstrip": false,
28
+ "single_word": false
29
+ },
30
+ "pad_token": {
31
+ "content": "<pad>",
32
+ "lstrip": false,
33
+ "normalized": false,
34
+ "rstrip": false,
35
+ "single_word": false
36
+ },
37
+ "sep_token": {
38
+ "content": "</s>",
39
+ "lstrip": false,
40
+ "normalized": false,
41
+ "rstrip": false,
42
+ "single_word": false
43
+ },
44
+ "unk_token": {
45
+ "content": "<unk>",
46
+ "lstrip": false,
47
+ "normalized": false,
48
+ "rstrip": false,
49
+ "single_word": false
50
+ }
51
+ }
tokenizer.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:883b037111086fd4dfebbbc9b7cee11e1517b5e0c0514879478661440f137085
3
+ size 17082987
tokenizer_config.json ADDED
@@ -0,0 +1,56 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "added_tokens_decoder": {
3
+ "0": {
4
+ "content": "<s>",
5
+ "lstrip": false,
6
+ "normalized": false,
7
+ "rstrip": false,
8
+ "single_word": false,
9
+ "special": true
10
+ },
11
+ "1": {
12
+ "content": "<pad>",
13
+ "lstrip": false,
14
+ "normalized": false,
15
+ "rstrip": false,
16
+ "single_word": false,
17
+ "special": true
18
+ },
19
+ "2": {
20
+ "content": "</s>",
21
+ "lstrip": false,
22
+ "normalized": false,
23
+ "rstrip": false,
24
+ "single_word": false,
25
+ "special": true
26
+ },
27
+ "3": {
28
+ "content": "<unk>",
29
+ "lstrip": false,
30
+ "normalized": false,
31
+ "rstrip": false,
32
+ "single_word": false,
33
+ "special": true
34
+ },
35
+ "250001": {
36
+ "content": "<mask>",
37
+ "lstrip": true,
38
+ "normalized": false,
39
+ "rstrip": false,
40
+ "single_word": false,
41
+ "special": true
42
+ }
43
+ },
44
+ "additional_special_tokens": [],
45
+ "bos_token": "<s>",
46
+ "clean_up_tokenization_spaces": true,
47
+ "cls_token": "<s>",
48
+ "eos_token": "</s>",
49
+ "extra_special_tokens": {},
50
+ "mask_token": "<mask>",
51
+ "model_max_length": 512,
52
+ "pad_token": "<pad>",
53
+ "sep_token": "</s>",
54
+ "tokenizer_class": "XLMRobertaTokenizer",
55
+ "unk_token": "<unk>"
56
+ }