driante commited on
Commit
e861adb
·
verified ·
1 Parent(s): af3fc2e

Add new SentenceTransformer model

Browse files
.gitattributes CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ tokenizer.json filter=lfs diff=lfs merge=lfs -text
1_Pooling/config.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "word_embedding_dimension": 1024,
3
+ "pooling_mode_cls_token": true,
4
+ "pooling_mode_mean_tokens": false,
5
+ "pooling_mode_max_tokens": false,
6
+ "pooling_mode_mean_sqrt_len_tokens": false,
7
+ "pooling_mode_weightedmean_tokens": false,
8
+ "pooling_mode_lasttoken": false,
9
+ "include_prompt": true
10
+ }
README.md ADDED
@@ -0,0 +1,734 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - sentence-transformers
4
+ - sentence-similarity
5
+ - feature-extraction
6
+ - generated_from_trainer
7
+ - dataset_size:82169
8
+ - loss:MultipleNegativesRankingLoss
9
+ base_model: BAAI/bge-m3
10
+ widget:
11
+ - source_sentence: can beef help reduce emissions
12
+ sentences:
13
+ - 'Toward "Better" Meat? Aligning Meat Sourcing Strategies with Corporate Climate
14
+ and Sustainability Goals These studies each shed light on the quantitative effects
15
+ of shifting production or sourcing from a conventional system to an alternative
16
+ system.
17
+
18
+
19
+ Because Poore and Nemecek’s (2018) database only captured studies published between
20
+ 2000 and June 2016, we performed a literature review using similar search terms
21
+ and study inclusion criteria to capture additional studies that were published
22
+ through 2022. As Poore and Nemecek (2018) did, in some instances we performed
23
+ adjustments to fill data gaps or make results more comparable between studies
24
+ (e.g., estimating land use using data included in a study, making assumptions
25
+ to estimate impacts from the animals’ full life cycle). See Appendix A for more
26
+ details on our approach to adding in more recent studies and Appendix B for the
27
+ full list of “paired studies” included in our analysis below, as well as all adjustments
28
+ made. The Glossary provides definitions of the various production systems.
29
+
30
+
31
+ For each quantitative environmental indicator (e.g., GHG emissions, land use)
32
+ in each “paired study,” we calculated the percent changes that occurred when shifting
33
+ from the conventional system to the alternative production system.'
34
+ - 'Toward "Better" Meat? Aligning Meat Sourcing Strategies with Corporate Climate
35
+ and Sustainability Goals Finally, there are more complex nutrient quality indices
36
+ that could be used as denominators (FAO 2021; Katz-Rosene et al. 2023), but, since
37
+ no consensus exists about which one is “best,” we have used the simpler denominator
38
+ of protein. In sum, use of any of these alternative numerators and denominators
39
+ would not change the main findings and recommendations of this report.
40
+
41
+
42
+ 4. For GHG emissions, we removed land-use-change emissions from the estimates
43
+ in Poore and Nemecek (2018), so as not to double-count with the “carbon opportunity
44
+ costs” of agricultural land use.'
45
+ - 'Toward "Better" Meat? Aligning Meat Sourcing Strategies with Corporate Climate
46
+ and Sustainability Goals Shift toward lower-emissions foods. As noted elsewhere
47
+ in this report, because beef is an emissions-intensive food, shifting purchases
48
+ and sales toward lower-emissions foods can help companies reduce scope 3 emissions.
49
+
50
+
51
+ There is growing interest in improving grazing management to increase the amount
52
+ of carbon sequestered in pasturelands, a practice often called “regenerative grazing.”
53
+ Some proponents of regenerative grazing even suggest that by removing carbon from
54
+ the atmosphere, soil carbon sequestration could fully offset GHG emissions from
55
+ beef production, suggesting potentially “carbon neutral” or “carbon negative”
56
+ beef. And while traditional life cycle assessments assumed that soil carbon stocks
57
+ on agricultural lands were in equilibrium and did not include soil carbon stock
58
+ changes in studies on agriculture’s environmental impacts, more recent studies
59
+ have begun to incorporate soil carbon measurements, including several beef studies
60
+ included in our review (Buratti et al. 2017; Eldesouky et al. 2018; Stanley et
61
+ al. 2018).'
62
+ - source_sentence: what is the npv for land restoration in latin america
63
+ sentences:
64
+ - 'Toward "Better" Meat? Aligning Meat Sourcing Strategies with Corporate Climate
65
+ and Sustainability Goals Overall, this places fish and seafood at the lower end
66
+ of the environmental impact spectrum for animal proteins (Gephart et al. 2021)
67
+ but usually still higher than plant-based proteins.
68
+
69
+
70
+ Similarly to terrestrial animal proteins, life cycle assessments of aquaculture
71
+ (fish farming) have found that there are environmental trade-offs with intensification.
72
+ When finfish and crustacean aquaculture systems move along the spectrum from more
73
+ traditional extensive systems to more industrialized intensive systems, land use
74
+ and water use per kilogram of fish declines, but water pollution and energy use
75
+ per kilogram of fish grow (Bohnes et al. 2018; Waite et al. 2014; Hall et al.
76
+ 2011). Effects on GHG emissions can be mixed under intensification due to the
77
+ growth in energy use and land use for feeds balanced by the reduction in land
78
+ use for ponds (Searchinger et al. 2019), and translation of land use into “carbon
79
+ opportunity costs” can help better weigh these trade-offs. Aquaculture is also
80
+ a significant user of wild fish as feed; more than 20 percent of total wild-caught
81
+ fish catch in 2020 went to “nonfood” uses—mostly for fishmeal and fish oil used
82
+ in aquaculture operations (FAO 2022c).'
83
+ - 'Toward "Better" Meat? Aligning Meat Sourcing Strategies with Corporate Climate
84
+ and Sustainability Goals ▪ The company first simulates a pure “less meat” strategy
85
+ to reduce scope 3 emissions and carbon opportunity costs by a combined 25 percent.
86
+ To do so, it finds that sourcing 50 percent less beef, 20 percent less of other
87
+ meats, and 15 percent less dairy—and shifting the purchases toward pulses, soy,
88
+ and vegetables—achieves this 25 percent reduction in climate impacts.
89
+
90
+
91
+ ▪ The company then explores a plausible scenario of shifting all chicken and egg
92
+ purchases toward higherwelfare products. It uses Table 7 and selects points within
93
+ the impact ranges to assume that free-range chicken and eggs could lead to 15
94
+ percent higher GHG emissions and 25 percent higher land use (carbon opportunity
95
+ costs) than conventional chicken. The company estimates that this would increase
96
+ total climate impacts, but only slightly, since chicken and eggs represent a small
97
+ amount of the company’s total climate impact. Under this scenario, total climate
98
+ impacts are reduced versus the base year by “only” 24 percent instead of 25 percent.
99
+ ▪'
100
+ - The Economic Case for Landscape Restoration in Latin America This implies an underestimation
101
+ of benefits given that, in this form, the restoration scenario equation does not
102
+ account for the remaining annual difference in net flow values between the degraded
103
+ hectare that is restored and the same hectare left degraded for the years between
104
+ full restoration and the end of the study’s overall assumed 50-year time horizon.
105
+ The NPVs of all target hectares would have to be calculated for all 50 years,
106
+ particularly in the cases of lightly and moderately degraded lands which have
107
+ recovery periods under restoration (delimited in this equation by t, which are
108
+ only 7 and 15 years, respectively).
109
+ - source_sentence: what is meat sourcing strategy
110
+ sentences:
111
+ - 'Toward "Better" Meat? Aligning Meat Sourcing Strategies with Corporate Climate
112
+ and Sustainability Goals CHAPTER 1 Introduction and context Meat and dairy production
113
+ are responsible for a large proportion of global greenhouse gas (GHG) emissions.
114
+ According to one widely cited estimate by the Food and Agriculture Organization
115
+ of the United Nations (FAO), animal agriculture (including the agricultural production
116
+ process and related land-use change) accounted for 14.5 percent of global GHG
117
+ emissions in 2005, with beef production alone accounting for 6 percent of global
118
+ emissions (Gerber and FAO 2013). Toward “Better” Meat? Aligning meat sourcing
119
+ strategies with corporate climate and sustainability goals | 11
120
+
121
+
122
+ More recent estimates for animal agriculture’s contribution to global emissions
123
+ in 2010–15 are of a similar magnitude, ranging from 11 to 20 percent (e.g., Poore
124
+ and Nemecek 2018; Twine 2021; Xu et al. 2021; FAO 2022a). Animal agriculture also
125
+ accounted for more than 30 percent of global methane emissions in 2017 (CCAC and
126
+ UNEP 2021).'
127
+ - 'Toward "Better" Meat? Aligning Meat Sourcing Strategies with Corporate Climate
128
+ and Sustainability Goals Further work is necessary to gather publicly available
129
+ data on other environmental, social, and economic attributes of “better meat,”
130
+ such as for soil health, on-farm biodiversity, and agricultural livelihoods, to
131
+ inform corporate decision-making. Similarly, better data are needed on alternative
132
+ systems and practices related to fish and seafood production; these “blue foods”
133
+ are important contributors to global food and nutrition security, but data are
134
+ even scarcer for these food production systems than for terrestrial animal agriculture.
135
+
136
+
137
+ In an ideal world, “better meat” production could lead to improvements across
138
+ all sustainability goals; however, our analysis shows that companies with quantitative
139
+ sustainability goals need to consider both co-benefits and trade-offs across all
140
+ goals when designing their meat sourcing strategies. We also show that balancing
141
+ these goals is eminently possible. This analysis also confirms the critical importance
142
+ of shifting diets high in animal-based foods toward plant-based foods and alternative
143
+ proteins to improve both environmental and animal welfare outcomes.'
144
+ - 'Toward "Better" Meat? Aligning Meat Sourcing Strategies with Corporate Climate
145
+ and Sustainability Goals It is true that poultry has a lower climate impact per
146
+ kilogram of protein than beef and lamb, and climate strategies may consider a
147
+ shift in purchasing from beef toward chicken to continue to provide the same amount
148
+ of meat to consumers while reducing GHG emissions. However, an important trade-off
149
+ to consider from an animal welfare perspective is the number of animal lives per
150
+ unit of protein produced. While alternative systems thought of as “better” might
151
+ improve the quality of life of the animals to some degree, animal welfare experts
152
+ also recognize the inherent value of all animals, and companies might choose to
153
+ factor the number of animals slaughtered into their decision-making as a simple
154
+ and easily understood indicator of animal welfare.
155
+
156
+
157
+ Figure 5 shows the trade-off between climate and animal welfare indicators when
158
+ shifting between animal-based foods, showing that the foods with the highest climate
159
+ impact per kg of protein also require the fewest animals to be killed, and vice
160
+ versa. For example, to produce a kg of protein, more than 100 times as many chickens
161
+ need to be slaughtered compared to cows.'
162
+ - source_sentence: cost of restoring landscape
163
+ sentences:
164
+ - 'The Economic Case for Landscape Restoration in Latin America This report assesses
165
+ the economic costs and benefits of landscape restoration in Latin America and
166
+ the Caribbean by monetizing a set of benefits that could flow from 20 million
167
+ hectares of restored lands. The introduction highlights some of the drivers and
168
+ impacts of degradation in the Latin America and Caribbean region. The section
169
+ that follows presents an overview of the method used to monetize the benefits
170
+ of landscape restoration; detailed descriptions of the methodology and modeling
171
+ approach are available in the annexes. Next, we present the results—the estimation
172
+ of net economic benefits from restoration and the different values for biomes
173
+ and degree of restoration. Finally, we suggest areas where future analysis could
174
+ provide more location-specific financial estimates.
175
+
176
+
177
+ Agriculture and forestry play an important role in the economy and social fabric
178
+ of Latin America and the Caribbean'
179
+ - 'Getting Ready Include a more comprehensive analysis of the legal framework for
180
+ tenure and existing conditions on-the-ground
181
+
182
+
183
+ Discuss how tenure conflicts might be addressed as part of the REDD+ strategy
184
+
185
+
186
+ Discusses the ability of forest agencies to plan and implement forest management
187
+ activities Considers the role of non-government stakeholders, including communities,
188
+ in forest management Links identified governance challenges to proposed REDD+
189
+ strategy options and implementation framework
190
+
191
+
192
+ The NPD provides an overview of recent efforts to improve forest management in
193
+ RoC, e.g., through the FLEGT Voluntary Partnership Agreement (VPA), certification
194
+ schemes, and improving the coverage and management of protected areas. According
195
+ to the NPD, the FLEGT process identified numerous forest sector challenges that
196
+ should be addressed as part of a REDD+ program, notably lack of forest administration
197
+ capacity and the need to strengthen involvement of local populations in forest
198
+ management decision-making. According to the NPD, over 4 million hectares of concessions
199
+ have been developed since 2001, but the NPD does not discuss the role of the private
200
+ sector in forest management activities in detail.'
201
+ - The Economic Case for Landscape Restoration in Latin America Nevertheless, because
202
+ E&M activities will always require more than a single year to be fully implemented,
203
+ the full per hectare cost should not be assigned to the first year of restoration
204
+ alone, but rather to a number of initial years along the restoration time horizon.
205
+ In the case of lightly degraded landscapes, the total cost/ha (from Tables 7 and
206
+ 8) has been divided and assigned equally to the first four years (or roughly the
207
+ first half) of the restoration time horizon. In the case of moderately degraded
208
+ lands, the total cost has been subtracted from annual benefit flow values in equal
209
+ annual tranches over the first 8 years (again, roughly the first half of the restoration
210
+ time horizon). Finally, total costs for severely degraded lands are subtracted
211
+ in equal annual amounts over the first 25 years of the restoration time horizon.
212
+ Allocating costs over a 25-year time horizon has the effect of discounting costs
213
+ relative to the benefits.
214
+ - source_sentence: what is the wri meat initiative?
215
+ sentences:
216
+ - 'Pilot analysis of global ecosystems: Grassland ecosystems Although GLASOD was
217
+ by necessity a somewhat subjective assessment it was extremely carefully prepared
218
+ by leading experts in the field. It remains the only global database on the status
219
+ of human-induced soil degradation, and no other data set comes as close to defining
220
+ the extent of desertification at the global scale (UNEP 1997: V).'
221
+ - 'Toward "Better" Meat? Aligning Meat Sourcing Strategies with Corporate Climate
222
+ and Sustainability Goals Toward “Better” Meat? Aligning meat sourcing strategies
223
+ with corporate climate and sustainability goals
224
+
225
+
226
+ WOR L D WOR L D R E S O U R C E S R E S O U R C E S I NS T I T U T E I NS T I
227
+ T U T E
228
+
229
+
230
+ RICHARD WAITE is the Acting Director for Agriculture Initiatives at WRI.
231
+
232
+
233
+ is a doctoral student with Oxford University’s Environmental Change Institute
234
+ and a former Research Analyst for WRI’s Food and Climate Programs.
235
+
236
+
237
+ CLARA CHO is the Data Analyst for the Coolfood initiative at WRI. Contact: clara.cho@wri.org.
238
+
239
+
240
+ We are pleased to acknowledge our institutional strategic partners that provide
241
+ core funding to WRI: the Netherlands Ministry of Foreign Affairs, Royal Danish
242
+ Ministry of Foreign Affairs, and Swedish International Development Cooperation
243
+ Agency.
244
+
245
+
246
+ The authors acknowledge the following individuals for their valuable guidance
247
+ and critical reviews:'
248
+ - 'Getting Ready THE IMPORTANCE OF FOREST GOVERNANCE TO THE REDD+ READINESS PROCESS
249
+
250
+
251
+ Strengthening forest governance will be an essential component of the activities
252
+ implemented by countries seeking to achieve significant and lasting emission reductions
253
+ through REDD+. Poor forest governance is often characterized by weak capacity
254
+ to manage natural resources, lack of decision-maker accountability to impacted
255
+ stakeholders, and lack of public access to information about the status and use
256
+ of forest resources. Potential drivers of deforestation and forest degradation—such
257
+ as illegal logging, unplanned forest conversion, and conflicts over access to
258
+ land and resources—are often symptoms of weak forest governance. To develop effective
259
+ national REDD+ strategies, governments need to better understand these challenges
260
+ and develop measures to strengthen forest governance in ways that build the trust
261
+ of domestic and international stakeholders.'
262
+ pipeline_tag: sentence-similarity
263
+ library_name: sentence-transformers
264
+ metrics:
265
+ - cosine_accuracy@1
266
+ - cosine_accuracy@3
267
+ - cosine_accuracy@5
268
+ - cosine_accuracy@10
269
+ - cosine_precision@1
270
+ - cosine_precision@3
271
+ - cosine_precision@5
272
+ - cosine_precision@10
273
+ - cosine_recall@1
274
+ - cosine_recall@3
275
+ - cosine_recall@5
276
+ - cosine_recall@10
277
+ - cosine_ndcg@10
278
+ - cosine_mrr@10
279
+ - cosine_map@100
280
+ model-index:
281
+ - name: SentenceTransformer based on BAAI/bge-m3
282
+ results:
283
+ - task:
284
+ type: information-retrieval
285
+ name: Information Retrieval
286
+ dataset:
287
+ name: ir eval
288
+ type: ir-eval
289
+ metrics:
290
+ - type: cosine_accuracy@1
291
+ value: 0.34030612244897956
292
+ name: Cosine Accuracy@1
293
+ - type: cosine_accuracy@3
294
+ value: 0.5389030612244898
295
+ name: Cosine Accuracy@3
296
+ - type: cosine_accuracy@5
297
+ value: 0.6211734693877551
298
+ name: Cosine Accuracy@5
299
+ - type: cosine_accuracy@10
300
+ value: 0.7122448979591837
301
+ name: Cosine Accuracy@10
302
+ - type: cosine_precision@1
303
+ value: 0.34030612244897956
304
+ name: Cosine Precision@1
305
+ - type: cosine_precision@3
306
+ value: 0.17963435374149658
307
+ name: Cosine Precision@3
308
+ - type: cosine_precision@5
309
+ value: 0.12423469387755101
310
+ name: Cosine Precision@5
311
+ - type: cosine_precision@10
312
+ value: 0.07122448979591836
313
+ name: Cosine Precision@10
314
+ - type: cosine_recall@1
315
+ value: 0.34030612244897956
316
+ name: Cosine Recall@1
317
+ - type: cosine_recall@3
318
+ value: 0.5389030612244898
319
+ name: Cosine Recall@3
320
+ - type: cosine_recall@5
321
+ value: 0.6211734693877551
322
+ name: Cosine Recall@5
323
+ - type: cosine_recall@10
324
+ value: 0.7122448979591837
325
+ name: Cosine Recall@10
326
+ - type: cosine_ndcg@10
327
+ value: 0.5191028810993514
328
+ name: Cosine Ndcg@10
329
+ - type: cosine_mrr@10
330
+ value: 0.458020782717851
331
+ name: Cosine Mrr@10
332
+ - type: cosine_map@100
333
+ value: 0.46727356494811056
334
+ name: Cosine Map@100
335
+ ---
336
+
337
+ # SentenceTransformer based on BAAI/bge-m3
338
+
339
+ This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [BAAI/bge-m3](https://huggingface.co/BAAI/bge-m3). It maps sentences & paragraphs to a 1024-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
340
+
341
+ ## Model Details
342
+
343
+ ### Model Description
344
+ - **Model Type:** Sentence Transformer
345
+ - **Base model:** [BAAI/bge-m3](https://huggingface.co/BAAI/bge-m3) <!-- at revision 5617a9f61b028005a4858fdac845db406aefb181 -->
346
+ - **Maximum Sequence Length:** 8192 tokens
347
+ - **Output Dimensionality:** 1024 dimensions
348
+ - **Similarity Function:** Cosine Similarity
349
+ <!-- - **Training Dataset:** Unknown -->
350
+ <!-- - **Language:** Unknown -->
351
+ <!-- - **License:** Unknown -->
352
+
353
+ ### Model Sources
354
+
355
+ - **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
356
+ - **Repository:** [Sentence Transformers on GitHub](https://github.com/UKPLab/sentence-transformers)
357
+ - **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)
358
+
359
+ ### Full Model Architecture
360
+
361
+ ```
362
+ SentenceTransformer(
363
+ (0): Transformer({'max_seq_length': 8192, 'do_lower_case': False}) with Transformer model: XLMRobertaModel
364
+ (1): Pooling({'word_embedding_dimension': 1024, 'pooling_mode_cls_token': True, 'pooling_mode_mean_tokens': False, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
365
+ (2): Normalize()
366
+ )
367
+ ```
368
+
369
+ ## Usage
370
+
371
+ ### Direct Usage (Sentence Transformers)
372
+
373
+ First install the Sentence Transformers library:
374
+
375
+ ```bash
376
+ pip install -U sentence-transformers
377
+ ```
378
+
379
+ Then you can load this model and run inference.
380
+ ```python
381
+ from sentence_transformers import SentenceTransformer
382
+
383
+ # Download from the 🤗 Hub
384
+ model = SentenceTransformer("collaborativeearth/bge-m3_wri")
385
+ # Run inference
386
+ sentences = [
387
+ 'what is the wri meat initiative?',
388
+ 'Toward "Better" Meat? Aligning Meat Sourcing Strategies with Corporate Climate and Sustainability Goals Toward “Better” Meat? Aligning meat sourcing strategies with corporate climate and sustainability goals\n\nWOR L D WOR L D R E S O U R C E S R E S O U R C E S I NS T I T U T E I NS T I T U T E\n\nRICHARD WAITE is the Acting Director for Agriculture Initiatives at WRI.\n\nis a doctoral student with Oxford University’s Environmental Change Institute and a former Research Analyst for WRI’s Food and Climate Programs.\n\nCLARA CHO is the Data Analyst for the Coolfood initiative at WRI. Contact: clara.cho@wri.org.\n\nWe are pleased to acknowledge our institutional strategic partners that provide core funding to WRI: the Netherlands Ministry of Foreign Affairs, Royal Danish Ministry of Foreign Affairs, and Swedish International Development Cooperation Agency.\n\nThe authors acknowledge the following individuals for their valuable guidance and critical reviews:',
389
+ 'Pilot analysis of global ecosystems: Grassland ecosystems Although GLASOD was by necessity a somewhat subjective assessment it was extremely carefully prepared by leading experts in the field. It remains the only global database on the status of human-induced soil degradation, and no other data set comes as close to defining the extent of desertification at the global scale (UNEP 1997: V).',
390
+ ]
391
+ embeddings = model.encode(sentences)
392
+ print(embeddings.shape)
393
+ # [3, 1024]
394
+
395
+ # Get the similarity scores for the embeddings
396
+ similarities = model.similarity(embeddings, embeddings)
397
+ print(similarities.shape)
398
+ # [3, 3]
399
+ ```
400
+
401
+ <!--
402
+ ### Direct Usage (Transformers)
403
+
404
+ <details><summary>Click to see the direct usage in Transformers</summary>
405
+
406
+ </details>
407
+ -->
408
+
409
+ <!--
410
+ ### Downstream Usage (Sentence Transformers)
411
+
412
+ You can finetune this model on your own dataset.
413
+
414
+ <details><summary>Click to expand</summary>
415
+
416
+ </details>
417
+ -->
418
+
419
+ <!--
420
+ ### Out-of-Scope Use
421
+
422
+ *List how the model may foreseeably be misused and address what users ought not to do with the model.*
423
+ -->
424
+
425
+ ## Evaluation
426
+
427
+ ### Metrics
428
+
429
+ #### Information Retrieval
430
+
431
+ * Dataset: `ir-eval`
432
+ * Evaluated with [<code>InformationRetrievalEvaluator</code>](https://sbert.net/docs/package_reference/sentence_transformer/evaluation.html#sentence_transformers.evaluation.InformationRetrievalEvaluator)
433
+
434
+ | Metric | Value |
435
+ |:--------------------|:-----------|
436
+ | cosine_accuracy@1 | 0.3403 |
437
+ | cosine_accuracy@3 | 0.5389 |
438
+ | cosine_accuracy@5 | 0.6212 |
439
+ | cosine_accuracy@10 | 0.7122 |
440
+ | cosine_precision@1 | 0.3403 |
441
+ | cosine_precision@3 | 0.1796 |
442
+ | cosine_precision@5 | 0.1242 |
443
+ | cosine_precision@10 | 0.0712 |
444
+ | cosine_recall@1 | 0.3403 |
445
+ | cosine_recall@3 | 0.5389 |
446
+ | cosine_recall@5 | 0.6212 |
447
+ | cosine_recall@10 | 0.7122 |
448
+ | **cosine_ndcg@10** | **0.5191** |
449
+ | cosine_mrr@10 | 0.458 |
450
+ | cosine_map@100 | 0.4673 |
451
+
452
+ <!--
453
+ ## Bias, Risks and Limitations
454
+
455
+ *What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
456
+ -->
457
+
458
+ <!--
459
+ ### Recommendations
460
+
461
+ *What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
462
+ -->
463
+
464
+ ## Training Details
465
+
466
+ ### Training Dataset
467
+
468
+ #### Unnamed Dataset
469
+
470
+ * Size: 82,169 training samples
471
+ * Columns: <code>question</code> and <code>answer</code>
472
+ * Approximate statistics based on the first 1000 samples:
473
+ | | question | answer |
474
+ |:--------|:----------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|
475
+ | type | string | string |
476
+ | details | <ul><li>min: 4 tokens</li><li>mean: 10.62 tokens</li><li>max: 31 tokens</li></ul> | <ul><li>min: 53 tokens</li><li>mean: 232.17 tokens</li><li>max: 337 tokens</li></ul> |
477
+ * Samples:
478
+ | question | answer |
479
+ |:---------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
480
+ | <code>what is the economic case of restoration</code> | <code>The Economic Case for Landscape Restoration in Latin America The Economic Case for Landscape Restoration in Latin America<br><br>THE ECONOMIC CASE FOR LANDSCAPE RESTORATION IN LATIN AMERICA<br><br>WALTER VERGARA, LUCIANA GALLARDO LOMELI, ANA R. RIOS, PAUL ISBELL, STEVEN PRAGER, RONNIE DE CAMINO<br><br>Land use and land-use change are central to the economic and social fabric of Latin America and the Caribbean, and essential to the region’s prospects for sustainable development. Countries are realizing that now, more than ever, is the time for action. Eleven countries, three Brazilian states and several regional programs have already committed to restoring more than 27 million hectares of degraded land in Latin America—but can these ambitions become a reality while supporting good living standards and economic development?</code> |
481
+ | <code>economic case of landscape restoration in latin america</code> | <code>The Economic Case for Landscape Restoration in Latin America The Economic Case for Landscape Restoration in Latin America<br><br>THE ECONOMIC CASE FOR LANDSCAPE RESTORATION IN LATIN AMERICA<br><br>WALTER VERGARA, LUCIANA GALLARDO LOMELI, ANA R. RIOS, PAUL ISBELL, STEVEN PRAGER, RONNIE DE CAMINO<br><br>Land use and land-use change are central to the economic and social fabric of Latin America and the Caribbean, and essential to the region’s prospects for sustainable development. Countries are realizing that now, more than ever, is the time for action. Eleven countries, three Brazilian states and several regional programs have already committed to restoring more than 27 million hectares of degraded land in Latin America—but can these ambitions become a reality while supporting good living standards and economic development?</code> |
482
+ | <code>what is lata-american landscape</code> | <code>The Economic Case for Landscape Restoration in Latin America Agriculture and forestry exports from Latin America represent about 13 percent of the global trade of food, feed, and fiber and account for a majority of employment outside large urban areas—numbers only expected to grow as Latin America is called upon to meet an increasing global demand for food. Yet, since the turn of the century, about 37 million hectares of natural forests, savannas and wetlands have been transformed to expand agriculture. Cumulative, unsustainable land-use practices have led to the degradation of about 300 million hectares, resulting in a reduction in yields and quality of production, and in losses in biomass content, soil quality, surface water hydrology, and biodiversity. Deforestation, land-use change, and unsustainable agricultural activities are also currently the largest drivers of climate change in the region, accounting for 56 percent of all greenhouse gas emissions. Today, while some progress ha...</code> |
483
+ * Loss: [<code>MultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
484
+ ```json
485
+ {
486
+ "scale": 20.0,
487
+ "similarity_fct": "cos_sim"
488
+ }
489
+ ```
490
+
491
+ ### Training Hyperparameters
492
+ #### Non-Default Hyperparameters
493
+
494
+ - `eval_strategy`: steps
495
+ - `per_device_train_batch_size`: 32
496
+ - `learning_rate`: 1e-06
497
+ - `num_train_epochs`: 2
498
+ - `warmup_ratio`: 0.1
499
+ - `fp16`: True
500
+ - `gradient_checkpointing`: True
501
+ - `batch_sampler`: no_duplicates
502
+
503
+ #### All Hyperparameters
504
+ <details><summary>Click to expand</summary>
505
+
506
+ - `overwrite_output_dir`: False
507
+ - `do_predict`: False
508
+ - `eval_strategy`: steps
509
+ - `prediction_loss_only`: True
510
+ - `per_device_train_batch_size`: 32
511
+ - `per_device_eval_batch_size`: 8
512
+ - `per_gpu_train_batch_size`: None
513
+ - `per_gpu_eval_batch_size`: None
514
+ - `gradient_accumulation_steps`: 1
515
+ - `eval_accumulation_steps`: None
516
+ - `torch_empty_cache_steps`: None
517
+ - `learning_rate`: 1e-06
518
+ - `weight_decay`: 0.0
519
+ - `adam_beta1`: 0.9
520
+ - `adam_beta2`: 0.999
521
+ - `adam_epsilon`: 1e-08
522
+ - `max_grad_norm`: 1.0
523
+ - `num_train_epochs`: 2
524
+ - `max_steps`: -1
525
+ - `lr_scheduler_type`: linear
526
+ - `lr_scheduler_kwargs`: {}
527
+ - `warmup_ratio`: 0.1
528
+ - `warmup_steps`: 0
529
+ - `log_level`: passive
530
+ - `log_level_replica`: warning
531
+ - `log_on_each_node`: True
532
+ - `logging_nan_inf_filter`: True
533
+ - `save_safetensors`: True
534
+ - `save_on_each_node`: False
535
+ - `save_only_model`: False
536
+ - `restore_callback_states_from_checkpoint`: False
537
+ - `no_cuda`: False
538
+ - `use_cpu`: False
539
+ - `use_mps_device`: False
540
+ - `seed`: 42
541
+ - `data_seed`: None
542
+ - `jit_mode_eval`: False
543
+ - `use_ipex`: False
544
+ - `bf16`: False
545
+ - `fp16`: True
546
+ - `fp16_opt_level`: O1
547
+ - `half_precision_backend`: auto
548
+ - `bf16_full_eval`: False
549
+ - `fp16_full_eval`: False
550
+ - `tf32`: None
551
+ - `local_rank`: 0
552
+ - `ddp_backend`: None
553
+ - `tpu_num_cores`: None
554
+ - `tpu_metrics_debug`: False
555
+ - `debug`: []
556
+ - `dataloader_drop_last`: False
557
+ - `dataloader_num_workers`: 0
558
+ - `dataloader_prefetch_factor`: None
559
+ - `past_index`: -1
560
+ - `disable_tqdm`: False
561
+ - `remove_unused_columns`: True
562
+ - `label_names`: None
563
+ - `load_best_model_at_end`: False
564
+ - `ignore_data_skip`: False
565
+ - `fsdp`: []
566
+ - `fsdp_min_num_params`: 0
567
+ - `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
568
+ - `tp_size`: 0
569
+ - `fsdp_transformer_layer_cls_to_wrap`: None
570
+ - `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
571
+ - `deepspeed`: None
572
+ - `label_smoothing_factor`: 0.0
573
+ - `optim`: adamw_torch
574
+ - `optim_args`: None
575
+ - `adafactor`: False
576
+ - `group_by_length`: False
577
+ - `length_column_name`: length
578
+ - `ddp_find_unused_parameters`: None
579
+ - `ddp_bucket_cap_mb`: None
580
+ - `ddp_broadcast_buffers`: False
581
+ - `dataloader_pin_memory`: True
582
+ - `dataloader_persistent_workers`: False
583
+ - `skip_memory_metrics`: True
584
+ - `use_legacy_prediction_loop`: False
585
+ - `push_to_hub`: False
586
+ - `resume_from_checkpoint`: None
587
+ - `hub_model_id`: None
588
+ - `hub_strategy`: every_save
589
+ - `hub_private_repo`: None
590
+ - `hub_always_push`: False
591
+ - `gradient_checkpointing`: True
592
+ - `gradient_checkpointing_kwargs`: None
593
+ - `include_inputs_for_metrics`: False
594
+ - `include_for_metrics`: []
595
+ - `eval_do_concat_batches`: True
596
+ - `fp16_backend`: auto
597
+ - `push_to_hub_model_id`: None
598
+ - `push_to_hub_organization`: None
599
+ - `mp_parameters`:
600
+ - `auto_find_batch_size`: False
601
+ - `full_determinism`: False
602
+ - `torchdynamo`: None
603
+ - `ray_scope`: last
604
+ - `ddp_timeout`: 1800
605
+ - `torch_compile`: False
606
+ - `torch_compile_backend`: None
607
+ - `torch_compile_mode`: None
608
+ - `include_tokens_per_second`: False
609
+ - `include_num_input_tokens_seen`: False
610
+ - `neftune_noise_alpha`: None
611
+ - `optim_target_modules`: None
612
+ - `batch_eval_metrics`: False
613
+ - `eval_on_start`: False
614
+ - `use_liger_kernel`: False
615
+ - `eval_use_gather_object`: False
616
+ - `average_tokens_across_devices`: False
617
+ - `prompts`: None
618
+ - `batch_sampler`: no_duplicates
619
+ - `multi_dataset_batch_sampler`: proportional
620
+
621
+ </details>
622
+
623
+ ### Training Logs
624
+ | Epoch | Step | Training Loss | ir-eval_cosine_ndcg@10 |
625
+ |:------:|:----:|:-------------:|:----------------------:|
626
+ | -1 | -1 | - | 0.4718 |
627
+ | 0.0389 | 100 | 0.7439 | - |
628
+ | 0.0779 | 200 | 0.6208 | - |
629
+ | 0.1168 | 300 | 0.4568 | - |
630
+ | 0.1558 | 400 | 0.3713 | - |
631
+ | 0.1947 | 500 | 0.3263 | 0.5004 |
632
+ | 0.2336 | 600 | 0.2722 | - |
633
+ | 0.2726 | 700 | 0.2521 | - |
634
+ | 0.3115 | 800 | 0.2541 | - |
635
+ | 0.3505 | 900 | 0.2348 | - |
636
+ | 0.3894 | 1000 | 0.2321 | 0.5090 |
637
+ | 0.4283 | 1100 | 0.2313 | - |
638
+ | 0.4673 | 1200 | 0.2195 | - |
639
+ | 0.5062 | 1300 | 0.2286 | - |
640
+ | 0.5452 | 1400 | 0.2188 | - |
641
+ | 0.5841 | 1500 | 0.2166 | 0.5115 |
642
+ | 0.6231 | 1600 | 0.2194 | - |
643
+ | 0.6620 | 1700 | 0.2006 | - |
644
+ | 0.7009 | 1800 | 0.1954 | - |
645
+ | 0.7399 | 1900 | 0.2157 | - |
646
+ | 0.7788 | 2000 | 0.2059 | 0.5154 |
647
+ | 0.8178 | 2100 | 0.203 | - |
648
+ | 0.8567 | 2200 | 0.1949 | - |
649
+ | 0.8956 | 2300 | 0.1943 | - |
650
+ | 0.9346 | 2400 | 0.206 | - |
651
+ | 0.9735 | 2500 | 0.2015 | 0.5175 |
652
+ | 1.0125 | 2600 | 0.1801 | - |
653
+ | 1.0514 | 2700 | 0.1867 | - |
654
+ | 1.0903 | 2800 | 0.1914 | - |
655
+ | 1.1293 | 2900 | 0.1827 | - |
656
+ | 1.1682 | 3000 | 0.1899 | 0.5165 |
657
+ | 1.2072 | 3100 | 0.1707 | - |
658
+ | 1.2461 | 3200 | 0.1872 | - |
659
+ | 1.2850 | 3300 | 0.1943 | - |
660
+ | 1.3240 | 3400 | 0.1854 | - |
661
+ | 1.3629 | 3500 | 0.1747 | 0.5182 |
662
+ | 1.4019 | 3600 | 0.1764 | - |
663
+ | 1.4408 | 3700 | 0.1866 | - |
664
+ | 1.4798 | 3800 | 0.1855 | - |
665
+ | 1.5187 | 3900 | 0.1782 | - |
666
+ | 1.5576 | 4000 | 0.1744 | 0.5181 |
667
+ | 1.5966 | 4100 | 0.1793 | - |
668
+ | 1.6355 | 4200 | 0.187 | - |
669
+ | 1.6745 | 4300 | 0.1907 | - |
670
+ | 1.7134 | 4400 | 0.1781 | - |
671
+ | 1.7523 | 4500 | 0.1825 | 0.5185 |
672
+ | 1.7913 | 4600 | 0.1981 | - |
673
+ | 1.8302 | 4700 | 0.1751 | - |
674
+ | 1.8692 | 4800 | 0.1824 | - |
675
+ | 1.9081 | 4900 | 0.1866 | - |
676
+ | 1.9470 | 5000 | 0.188 | 0.5191 |
677
+ | 1.9860 | 5100 | 0.1838 | - |
678
+
679
+
680
+ ### Framework Versions
681
+ - Python: 3.11.12
682
+ - Sentence Transformers: 4.1.0
683
+ - Transformers: 4.51.3
684
+ - PyTorch: 2.6.0+cu124
685
+ - Accelerate: 1.6.0
686
+ - Datasets: 2.14.4
687
+ - Tokenizers: 0.21.1
688
+
689
+ ## Citation
690
+
691
+ ### BibTeX
692
+
693
+ #### Sentence Transformers
694
+ ```bibtex
695
+ @inproceedings{reimers-2019-sentence-bert,
696
+ title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
697
+ author = "Reimers, Nils and Gurevych, Iryna",
698
+ booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
699
+ month = "11",
700
+ year = "2019",
701
+ publisher = "Association for Computational Linguistics",
702
+ url = "https://arxiv.org/abs/1908.10084",
703
+ }
704
+ ```
705
+
706
+ #### MultipleNegativesRankingLoss
707
+ ```bibtex
708
+ @misc{henderson2017efficient,
709
+ title={Efficient Natural Language Response Suggestion for Smart Reply},
710
+ author={Matthew Henderson and Rami Al-Rfou and Brian Strope and Yun-hsuan Sung and Laszlo Lukacs and Ruiqi Guo and Sanjiv Kumar and Balint Miklos and Ray Kurzweil},
711
+ year={2017},
712
+ eprint={1705.00652},
713
+ archivePrefix={arXiv},
714
+ primaryClass={cs.CL}
715
+ }
716
+ ```
717
+
718
+ <!--
719
+ ## Glossary
720
+
721
+ *Clearly define terms in order to be accessible across audiences.*
722
+ -->
723
+
724
+ <!--
725
+ ## Model Card Authors
726
+
727
+ *Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
728
+ -->
729
+
730
+ <!--
731
+ ## Model Card Contact
732
+
733
+ *Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
734
+ -->
config.json ADDED
@@ -0,0 +1,27 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "architectures": [
3
+ "XLMRobertaModel"
4
+ ],
5
+ "attention_probs_dropout_prob": 0.1,
6
+ "bos_token_id": 0,
7
+ "classifier_dropout": null,
8
+ "eos_token_id": 2,
9
+ "hidden_act": "gelu",
10
+ "hidden_dropout_prob": 0.1,
11
+ "hidden_size": 1024,
12
+ "initializer_range": 0.02,
13
+ "intermediate_size": 4096,
14
+ "layer_norm_eps": 1e-05,
15
+ "max_position_embeddings": 8194,
16
+ "model_type": "xlm-roberta",
17
+ "num_attention_heads": 16,
18
+ "num_hidden_layers": 24,
19
+ "output_past": true,
20
+ "pad_token_id": 1,
21
+ "position_embedding_type": "absolute",
22
+ "torch_dtype": "float32",
23
+ "transformers_version": "4.51.3",
24
+ "type_vocab_size": 1,
25
+ "use_cache": true,
26
+ "vocab_size": 250002
27
+ }
config_sentence_transformers.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "__version__": {
3
+ "sentence_transformers": "4.1.0",
4
+ "transformers": "4.51.3",
5
+ "pytorch": "2.6.0+cu124"
6
+ },
7
+ "prompts": {},
8
+ "default_prompt_name": null,
9
+ "similarity_fn_name": "cosine"
10
+ }
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a37e65d7dbd0677dbc3752d7b8727c91b525519648e042cc02c3cddc58a3a04e
3
+ size 2271064456
modules.json ADDED
@@ -0,0 +1,20 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [
2
+ {
3
+ "idx": 0,
4
+ "name": "0",
5
+ "path": "",
6
+ "type": "sentence_transformers.models.Transformer"
7
+ },
8
+ {
9
+ "idx": 1,
10
+ "name": "1",
11
+ "path": "1_Pooling",
12
+ "type": "sentence_transformers.models.Pooling"
13
+ },
14
+ {
15
+ "idx": 2,
16
+ "name": "2",
17
+ "path": "2_Normalize",
18
+ "type": "sentence_transformers.models.Normalize"
19
+ }
20
+ ]
sentence_bert_config.json ADDED
@@ -0,0 +1,4 @@
 
 
 
 
 
1
+ {
2
+ "max_seq_length": 8192,
3
+ "do_lower_case": false
4
+ }
sentencepiece.bpe.model ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cfc8146abe2a0488e9e2a0c56de7952f7c11ab059eca145a0a727afce0db2865
3
+ size 5069051
special_tokens_map.json ADDED
@@ -0,0 +1,51 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "bos_token": {
3
+ "content": "<s>",
4
+ "lstrip": false,
5
+ "normalized": false,
6
+ "rstrip": false,
7
+ "single_word": false
8
+ },
9
+ "cls_token": {
10
+ "content": "<s>",
11
+ "lstrip": false,
12
+ "normalized": false,
13
+ "rstrip": false,
14
+ "single_word": false
15
+ },
16
+ "eos_token": {
17
+ "content": "</s>",
18
+ "lstrip": false,
19
+ "normalized": false,
20
+ "rstrip": false,
21
+ "single_word": false
22
+ },
23
+ "mask_token": {
24
+ "content": "<mask>",
25
+ "lstrip": true,
26
+ "normalized": false,
27
+ "rstrip": false,
28
+ "single_word": false
29
+ },
30
+ "pad_token": {
31
+ "content": "<pad>",
32
+ "lstrip": false,
33
+ "normalized": false,
34
+ "rstrip": false,
35
+ "single_word": false
36
+ },
37
+ "sep_token": {
38
+ "content": "</s>",
39
+ "lstrip": false,
40
+ "normalized": false,
41
+ "rstrip": false,
42
+ "single_word": false
43
+ },
44
+ "unk_token": {
45
+ "content": "<unk>",
46
+ "lstrip": false,
47
+ "normalized": false,
48
+ "rstrip": false,
49
+ "single_word": false
50
+ }
51
+ }
tokenizer.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e4f7e21bec3fb0044ca0bb2d50eb5d4d8c596273c422baef84466d2c73748b9c
3
+ size 17083053
tokenizer_config.json ADDED
@@ -0,0 +1,63 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "added_tokens_decoder": {
3
+ "0": {
4
+ "content": "<s>",
5
+ "lstrip": false,
6
+ "normalized": false,
7
+ "rstrip": false,
8
+ "single_word": false,
9
+ "special": true
10
+ },
11
+ "1": {
12
+ "content": "<pad>",
13
+ "lstrip": false,
14
+ "normalized": false,
15
+ "rstrip": false,
16
+ "single_word": false,
17
+ "special": true
18
+ },
19
+ "2": {
20
+ "content": "</s>",
21
+ "lstrip": false,
22
+ "normalized": false,
23
+ "rstrip": false,
24
+ "single_word": false,
25
+ "special": true
26
+ },
27
+ "3": {
28
+ "content": "<unk>",
29
+ "lstrip": false,
30
+ "normalized": false,
31
+ "rstrip": false,
32
+ "single_word": false,
33
+ "special": true
34
+ },
35
+ "250001": {
36
+ "content": "<mask>",
37
+ "lstrip": true,
38
+ "normalized": false,
39
+ "rstrip": false,
40
+ "single_word": false,
41
+ "special": true
42
+ }
43
+ },
44
+ "bos_token": "<s>",
45
+ "clean_up_tokenization_spaces": true,
46
+ "cls_token": "<s>",
47
+ "eos_token": "</s>",
48
+ "extra_special_tokens": {},
49
+ "mask_token": "<mask>",
50
+ "max_length": 8192,
51
+ "model_max_length": 8192,
52
+ "pad_to_multiple_of": null,
53
+ "pad_token": "<pad>",
54
+ "pad_token_type_id": 0,
55
+ "padding_side": "right",
56
+ "sep_token": "</s>",
57
+ "sp_model_kwargs": {},
58
+ "stride": 0,
59
+ "tokenizer_class": "XLMRobertaTokenizer",
60
+ "truncation_side": "right",
61
+ "truncation_strategy": "longest_first",
62
+ "unk_token": "<unk>"
63
+ }