afalaudn commited on
Commit
97d910b
·
verified ·
1 Parent(s): 7f3283e

Add new SentenceTransformer model

Browse files
.gitattributes CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ tokenizer.json filter=lfs diff=lfs merge=lfs -text
1_Pooling/config.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "word_embedding_dimension": 768,
3
+ "pooling_mode_cls_token": false,
4
+ "pooling_mode_mean_tokens": true,
5
+ "pooling_mode_max_tokens": false,
6
+ "pooling_mode_mean_sqrt_len_tokens": false,
7
+ "pooling_mode_weightedmean_tokens": false,
8
+ "pooling_mode_lasttoken": false,
9
+ "include_prompt": true
10
+ }
2_Dense/config.json ADDED
@@ -0,0 +1,6 @@
 
 
 
 
 
 
 
1
+ {
2
+ "in_features": 768,
3
+ "out_features": 3072,
4
+ "bias": false,
5
+ "activation_function": "torch.nn.modules.linear.Identity"
6
+ }
2_Dense/model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7ef5fea9d0f2a2059b0914004aa94a87dfa48772c6922d7d33dcefc01dae2d00
3
+ size 9437272
3_Dense/config.json ADDED
@@ -0,0 +1,6 @@
 
 
 
 
 
 
 
1
+ {
2
+ "in_features": 3072,
3
+ "out_features": 768,
4
+ "bias": false,
5
+ "activation_function": "torch.nn.modules.linear.Identity"
6
+ }
3_Dense/model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:eff5c695c0ed04480a9ea2e378e6b5a583420aa3419ffeb159f615d35bca3065
3
+ size 9437272
README.md ADDED
@@ -0,0 +1,639 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - sentence-transformers
4
+ - sentence-similarity
5
+ - feature-extraction
6
+ - dense
7
+ - generated_from_trainer
8
+ - dataset_size:1340
9
+ - loss:TripletLoss
10
+ base_model: unsloth/embeddinggemma-300m
11
+ widget:
12
+ - source_sentence: How many 'Laboratory' sub-type admissions from the last year have
13
+ not yet generated an invoice, and what is the average duration (in days) from
14
+ their admission date to today?
15
+ sentences:
16
+ - 'Primary reference table listing the most granular level of administrative geography,
17
+ typically representing villages or ''kelurahan''. It provides standardized names
18
+ and codes for these local areas, linking each sub-district to its parent `DistrictId`
19
+ to complete the geographic hierarchy. **Use this table to** perform highly localized
20
+ demographic analysis, standardize patient addresses to the village level, or define
21
+ precise service areas. **Note: This is a static lookup list of geographic areas
22
+ and does not contain any patient-specific addresses, postal codes, or household
23
+ information.**'
24
+ - 'Primary reference table defining the lifecycle stages of a patient admission
25
+ event. It categorizes visits into states such as ''Active'' (currently in hospital),
26
+ ''Discharged'' (left hospital), ''Invoiced'' (bill generated), or ''Cancelled''.
27
+ Use this table to interpret AdmissionStatusId in transaction tables to filter
28
+ visits by their current operational state (e.g., calculating current census vs.
29
+ historical discharges). Note: This is a static lookup table for status definitions,
30
+ NOT a transaction log of patient movements.'
31
+ - 'Operational transaction table (Financial Log) recording the header-level details
32
+ of patient invoices and billing events. This table captures the financial breakdown
33
+ of a visit, distinguishing between Patient responsibility (Out-of-pocket) and
34
+ Payer responsibility (Insurance/Corporate Coverage), including Gross Amounts,
35
+ Discounts, Taxes, and Net Payable values. Use this table to analyze hospital revenue
36
+ streams, track Accounts Receivable (AR), monitor billing cancellations, or calculate
37
+ the financial yield per admission. Note: This is the Invoice HEADER table containing
38
+ total values; it does not typically list the specific individual line items (drugs,
39
+ labs, services) charged within the bill. For any financial analysis related to
40
+ hospital revenue, Payments, Accounts Receivable (AR), billing breakdowns, or insurance
41
+ claims, this invoice header table is the definitive starting point.'
42
+ - source_sentence: Calculate the proportion of invoices issued to patients with 'Inactive'
43
+ medical records out of the total invoices issued in the last fiscal year.
44
+ sentences:
45
+ - 'Standard reference table for patient biological sex and gender classification.
46
+ Contains standard values ''Male'' (Pria) and ''Female'' (Wanita). Use this table
47
+ to filter patient demographics by gender, support clinical reference range checks,
48
+ or generate gender-based population statistics. Note: Used for biological sex
49
+ classification.'
50
+ - 'Primary reference table storing the comprehensive master profiles of all patients
51
+ registered in the hospital system. It contains critical demographic data including
52
+ Name, Medical Record Number (MrNo), Birth Date, Gender (SexId), Address, Contact
53
+ Information (Phone, Email), and Identification details (National ID, Passport).
54
+ It also tracks patient status (Active/Merged/Deceased) and links to external Payer/Insurance
55
+ information. Use this table to identify unique patients, retrieve contact details
56
+ for notifications, segment patient populations by demographics, or link clinical
57
+ transactions to specific individuals. Note: This table represents the ''Golden
58
+ Record'' of a patient''s identity; historical changes might be tracked in audit
59
+ logs, but this table holds the current state.'
60
+ - 'Primary reference table storing the comprehensive master profiles of all patients
61
+ registered in the hospital system. It contains critical demographic data including
62
+ Name, Medical Record Number (MrNo), Birth Date, Gender (SexId), Address, Contact
63
+ Information (Phone, Email), and Identification details (National ID, Passport).
64
+ It also tracks patient status (Active/Merged/Deceased) and links to external Payer/Insurance
65
+ information. Use this table to identify unique patients, retrieve contact details
66
+ for notifications, segment patient populations by demographics, or link clinical
67
+ transactions to specific individuals. Note: This table represents the ''Golden
68
+ Record'' of a patient''s identity; historical changes might be tracked in audit
69
+ logs, but this table holds the current state.'
70
+ - source_sentence: What is the total number of invoices and the total value of these
71
+ invoices issued in Q1 2024 for female patients whose current status is 'Active'?
72
+ sentences:
73
+ - 'Operational snapshot table that holds the **single, current financial state**
74
+ of a patient at a specific hospital unit. Unlike a transaction log, this table
75
+ does not track history; it contains only the final calculated totals: `BalanceAmount`
76
+ (Total AR/Debt outstanding) and `DepositBalanceAmount` (Total Advance Payment
77
+ available). **Use this table for** quick validation of whether a patient can be
78
+ discharged, checking total debt before billing, or viewing available deposit funds.
79
+ **Concept:** Think of this as the ''ATM Screen Balance'' showing only the final
80
+ amount available right now.'
81
+ - 'Primary reference table defining the validity and lifecycle status of a patient''s
82
+ master record. Contains status codes such as ''Active'', ''Inactive'', and ''Deceased''
83
+ to classify patient accounts. **Use this table to** look up status meanings or
84
+ filter patient queries based on their account state (e.g., excluding deceased
85
+ patients from active lists). **Note: This is a lookup dictionary for status codes,
86
+ NOT a transaction log of status changes for specific patients.**'
87
+ - 'Operational transaction table recording individual line items within patient
88
+ invoices (Accounts Receivable). It captures granular billing details including
89
+ specific items sold (drugs, services), quantities, unit prices, discounts, tax
90
+ calculations, and the financial split between Patient and Payer (Insurance/Guarantor).
91
+ It also tracks revenue allocation (Hospital vs. Doctor portion). **Use this table
92
+ to** generate detailed patient bills, audit revenue streams per item, calculate
93
+ doctor performance fees, or analyze discount utilization. **Note: This table contains
94
+ financial billing data per item, NOT the clinical medical results or the master
95
+ list of available services.**'
96
+ - source_sentence: How many unique male patients were referred by a 'Doctor Referral'
97
+ and had a 'General' patient type for any admission in the last 6 months?
98
+ sentences:
99
+ - 'Operational transaction table (Financial Log) recording the header-level details
100
+ of patient invoices and billing events. This table captures the financial breakdown
101
+ of a visit, distinguishing between Patient responsibility (Out-of-pocket) and
102
+ Payer responsibility (Insurance/Corporate Coverage), including Gross Amounts,
103
+ Discounts, Taxes, and Net Payable values. Use this table to analyze hospital revenue
104
+ streams, track Accounts Receivable (AR), monitor billing cancellations, or calculate
105
+ the financial yield per admission. Note: This is the Invoice HEADER table containing
106
+ total values; it does not typically list the specific individual line items (drugs,
107
+ labs, services) charged within the bill. For any financial analysis related to
108
+ hospital revenue, Payments, Accounts Receivable (AR), billing breakdowns, or insurance
109
+ claims, this invoice header table is the definitive starting point.'
110
+ - 'Standard reference table for patient biological sex and gender classification.
111
+ Contains standard values ''Male'' (Pria) and ''Female'' (Wanita). Use this table
112
+ to filter patient demographics by gender, support clinical reference range checks,
113
+ or generate gender-based population statistics. Note: Used for biological sex
114
+ classification.'
115
+ - 'Operational transaction table recording unstructured free-text medical notes
116
+ and preliminary clinical remarks associated with a patient admission. It captures
117
+ initial diagnosis impressions, symptoms, or observation notes (e.g., ''Asthma'',
118
+ ''Observation Febris'') entered during the admission process. Use this table to
119
+ retrieve qualitative clinical context for a visit or search for specific medical
120
+ conditions mentioned in preliminary notes. Note: This table contains raw free-text
121
+ descriptions, NOT structured ICD-10 diagnosis codes used for billing.'
122
+ - source_sentence: Retrieve the names of all patients who have a 'MedicalRecordStatusId'
123
+ indicating 'Active' in their PatientOrganization record and have received an invoice
124
+ from any organization in the current month.
125
+ sentences:
126
+ - 'Primary reference table storing the comprehensive master profiles of all patients
127
+ registered in the hospital system. It contains critical demographic data including
128
+ Name, Medical Record Number (MrNo), Birth Date, Gender (SexId), Address, Contact
129
+ Information (Phone, Email), and Identification details (National ID, Passport).
130
+ It also tracks patient status (Active/Merged/Deceased) and links to external Payer/Insurance
131
+ information. Use this table to identify unique patients, retrieve contact details
132
+ for notifications, segment patient populations by demographics, or link clinical
133
+ transactions to specific individuals. Note: This table represents the ''Golden
134
+ Record'' of a patient''s identity; historical changes might be tracked in audit
135
+ logs, but this table holds the current state.'
136
+ - 'Primary reference table defining the standard set of religions and spiritual
137
+ beliefs for patient demographic data. It contains codes and names for major religions
138
+ such as Islam, Christian, Catholic, Hindu, and Buddhist. **Use this table to**
139
+ interpret the `ReligionId` from patient master data to standardize demographic
140
+ reporting or analyze patient spiritual care needs. **Note: This table lists the
141
+ CATEGORIES of religion only, not the religious affiliation of individual patients.**'
142
+ - 'Operational transaction table (Financial Log) recording the header-level details
143
+ of patient invoices and billing events. This table captures the financial breakdown
144
+ of a visit, distinguishing between Patient responsibility (Out-of-pocket) and
145
+ Payer responsibility (Insurance/Corporate Coverage), including Gross Amounts,
146
+ Discounts, Taxes, and Net Payable values. Use this table to analyze hospital revenue
147
+ streams, track Accounts Receivable (AR), monitor billing cancellations, or calculate
148
+ the financial yield per admission. Note: This is the Invoice HEADER table containing
149
+ total values; it does not typically list the specific individual line items (drugs,
150
+ labs, services) charged within the bill. For any financial analysis related to
151
+ hospital revenue, Payments, Accounts Receivable (AR), billing breakdowns, or insurance
152
+ claims, this invoice header table is the definitive starting point.'
153
+ pipeline_tag: sentence-similarity
154
+ library_name: sentence-transformers
155
+ metrics:
156
+ - cosine_accuracy@1
157
+ - cosine_accuracy@3
158
+ - cosine_accuracy@5
159
+ - cosine_accuracy@10
160
+ - cosine_precision@1
161
+ - cosine_precision@3
162
+ - cosine_precision@5
163
+ - cosine_precision@10
164
+ - cosine_recall@1
165
+ - cosine_recall@3
166
+ - cosine_recall@5
167
+ - cosine_recall@10
168
+ - cosine_ndcg@10
169
+ - cosine_mrr@10
170
+ - cosine_map@100
171
+ model-index:
172
+ - name: SentenceTransformer based on unsloth/embeddinggemma-300m
173
+ results:
174
+ - task:
175
+ type: information-retrieval
176
+ name: Information Retrieval
177
+ dataset:
178
+ name: Unknown
179
+ type: unknown
180
+ metrics:
181
+ - type: cosine_accuracy@1
182
+ value: 0.03283582089552239
183
+ name: Cosine Accuracy@1
184
+ - type: cosine_accuracy@3
185
+ value: 0.0626865671641791
186
+ name: Cosine Accuracy@3
187
+ - type: cosine_accuracy@5
188
+ value: 0.08358208955223881
189
+ name: Cosine Accuracy@5
190
+ - type: cosine_accuracy@10
191
+ value: 0.13134328358208955
192
+ name: Cosine Accuracy@10
193
+ - type: cosine_precision@1
194
+ value: 0.03283582089552239
195
+ name: Cosine Precision@1
196
+ - type: cosine_precision@3
197
+ value: 0.0208955223880597
198
+ name: Cosine Precision@3
199
+ - type: cosine_precision@5
200
+ value: 0.01671641791044776
201
+ name: Cosine Precision@5
202
+ - type: cosine_precision@10
203
+ value: 0.013134328358208956
204
+ name: Cosine Precision@10
205
+ - type: cosine_recall@1
206
+ value: 0.03283582089552239
207
+ name: Cosine Recall@1
208
+ - type: cosine_recall@3
209
+ value: 0.0626865671641791
210
+ name: Cosine Recall@3
211
+ - type: cosine_recall@5
212
+ value: 0.08358208955223881
213
+ name: Cosine Recall@5
214
+ - type: cosine_recall@10
215
+ value: 0.13134328358208955
216
+ name: Cosine Recall@10
217
+ - type: cosine_ndcg@10
218
+ value: 0.07347950410091954
219
+ name: Cosine Ndcg@10
220
+ - type: cosine_mrr@10
221
+ value: 0.05603056147832267
222
+ name: Cosine Mrr@10
223
+ - type: cosine_map@100
224
+ value: 0.06301005721428388
225
+ name: Cosine Map@100
226
+ ---
227
+
228
+ # SentenceTransformer based on unsloth/embeddinggemma-300m
229
+
230
+ This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [unsloth/embeddinggemma-300m](https://huggingface.co/unsloth/embeddinggemma-300m). It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
231
+
232
+ ## Model Details
233
+
234
+ ### Model Description
235
+ - **Model Type:** Sentence Transformer
236
+ - **Base model:** [unsloth/embeddinggemma-300m](https://huggingface.co/unsloth/embeddinggemma-300m) <!-- at revision bfa3c846ac738e62aa61806ef9112d34acb1dc5a -->
237
+ - **Maximum Sequence Length:** 768 tokens
238
+ - **Output Dimensionality:** 768 dimensions
239
+ - **Similarity Function:** Cosine Similarity
240
+ <!-- - **Training Dataset:** Unknown -->
241
+ <!-- - **Language:** Unknown -->
242
+ <!-- - **License:** Unknown -->
243
+
244
+ ### Model Sources
245
+
246
+ - **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
247
+ - **Repository:** [Sentence Transformers on GitHub](https://github.com/huggingface/sentence-transformers)
248
+ - **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)
249
+
250
+ ### Full Model Architecture
251
+
252
+ ```
253
+ SentenceTransformer(
254
+ (0): Transformer({'max_seq_length': 768, 'do_lower_case': False, 'architecture': 'Gemma3TextModel'})
255
+ (1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
256
+ (2): Dense({'in_features': 768, 'out_features': 3072, 'bias': False, 'activation_function': 'torch.nn.modules.linear.Identity'})
257
+ (3): Dense({'in_features': 3072, 'out_features': 768, 'bias': False, 'activation_function': 'torch.nn.modules.linear.Identity'})
258
+ (4): Normalize()
259
+ )
260
+ ```
261
+
262
+ ## Usage
263
+
264
+ ### Direct Usage (Sentence Transformers)
265
+
266
+ First install the Sentence Transformers library:
267
+
268
+ ```bash
269
+ pip install -U sentence-transformers
270
+ ```
271
+
272
+ Then you can load this model and run inference.
273
+ ```python
274
+ from sentence_transformers import SentenceTransformer
275
+
276
+ # Download from the 🤗 Hub
277
+ model = SentenceTransformer("afalaudn/gemma-embedding-ft3")
278
+ # Run inference
279
+ sentences = [
280
+ "Retrieve the names of all patients who have a 'MedicalRecordStatusId' indicating 'Active' in their PatientOrganization record and have received an invoice from any organization in the current month.",
281
+ "Primary reference table storing the comprehensive master profiles of all patients registered in the hospital system. It contains critical demographic data including Name, Medical Record Number (MrNo), Birth Date, Gender (SexId), Address, Contact Information (Phone, Email), and Identification details (National ID, Passport). It also tracks patient status (Active/Merged/Deceased) and links to external Payer/Insurance information. Use this table to identify unique patients, retrieve contact details for notifications, segment patient populations by demographics, or link clinical transactions to specific individuals. Note: This table represents the 'Golden Record' of a patient's identity; historical changes might be tracked in audit logs, but this table holds the current state.",
282
+ 'Primary reference table defining the standard set of religions and spiritual beliefs for patient demographic data. It contains codes and names for major religions such as Islam, Christian, Catholic, Hindu, and Buddhist. **Use this table to** interpret the `ReligionId` from patient master data to standardize demographic reporting or analyze patient spiritual care needs. **Note: This table lists the CATEGORIES of religion only, not the religious affiliation of individual patients.**',
283
+ ]
284
+ embeddings = model.encode(sentences)
285
+ print(embeddings.shape)
286
+ # [3, 768]
287
+
288
+ # Get the similarity scores for the embeddings
289
+ similarities = model.similarity(embeddings, embeddings)
290
+ print(similarities)
291
+ # tensor([[ 1.0000, 0.6662, -0.1269],
292
+ # [ 0.6662, 1.0000, 0.1456],
293
+ # [-0.1269, 0.1456, 1.0000]])
294
+ ```
295
+
296
+ <!--
297
+ ### Direct Usage (Transformers)
298
+
299
+ <details><summary>Click to see the direct usage in Transformers</summary>
300
+
301
+ </details>
302
+ -->
303
+
304
+ <!--
305
+ ### Downstream Usage (Sentence Transformers)
306
+
307
+ You can finetune this model on your own dataset.
308
+
309
+ <details><summary>Click to expand</summary>
310
+
311
+ </details>
312
+ -->
313
+
314
+ <!--
315
+ ### Out-of-Scope Use
316
+
317
+ *List how the model may foreseeably be misused and address what users ought not to do with the model.*
318
+ -->
319
+
320
+ ## Evaluation
321
+
322
+ ### Metrics
323
+
324
+ #### Information Retrieval
325
+
326
+ * Evaluated with [<code>InformationRetrievalEvaluator</code>](https://sbert.net/docs/package_reference/sentence_transformer/evaluation.html#sentence_transformers.evaluation.InformationRetrievalEvaluator)
327
+
328
+ | Metric | Value |
329
+ |:--------------------|:-----------|
330
+ | cosine_accuracy@1 | 0.0328 |
331
+ | cosine_accuracy@3 | 0.0627 |
332
+ | cosine_accuracy@5 | 0.0836 |
333
+ | cosine_accuracy@10 | 0.1313 |
334
+ | cosine_precision@1 | 0.0328 |
335
+ | cosine_precision@3 | 0.0209 |
336
+ | cosine_precision@5 | 0.0167 |
337
+ | cosine_precision@10 | 0.0131 |
338
+ | cosine_recall@1 | 0.0328 |
339
+ | cosine_recall@3 | 0.0627 |
340
+ | cosine_recall@5 | 0.0836 |
341
+ | cosine_recall@10 | 0.1313 |
342
+ | **cosine_ndcg@10** | **0.0735** |
343
+ | cosine_mrr@10 | 0.056 |
344
+ | cosine_map@100 | 0.063 |
345
+
346
+ <!--
347
+ ## Bias, Risks and Limitations
348
+
349
+ *What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
350
+ -->
351
+
352
+ <!--
353
+ ### Recommendations
354
+
355
+ *What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
356
+ -->
357
+
358
+ ## Training Details
359
+
360
+ ### Training Dataset
361
+
362
+ #### Unnamed Dataset
363
+
364
+ * Size: 1,340 training samples
365
+ * Columns: <code>anchor</code>, <code>positive</code>, and <code>negative</code>
366
+ * Approximate statistics based on the first 1000 samples:
367
+ | | anchor | positive | negative |
368
+ |:--------|:-----------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|
369
+ | type | string | string | string |
370
+ | details | <ul><li>min: 10 tokens</li><li>mean: 35.94 tokens</li><li>max: 82 tokens</li></ul> | <ul><li>min: 63 tokens</li><li>mean: 129.79 tokens</li><li>max: 171 tokens</li></ul> | <ul><li>min: 63 tokens</li><li>mean: 112.02 tokens</li><li>max: 175 tokens</li></ul> |
371
+ * Samples:
372
+ | anchor | positive | negative |
373
+ |:--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
374
+ | <code>What is the total number of patient admissions that resulted in a 'Recovered' discharge condition for each organization in 2023?</code> | <code>Operational transaction table recording every patient registration and visit event at the hospital. This table consolidates patient demographics, visit types (Inpatient, Outpatient, Emergency), primary and referral doctors, payer/insurance eligibility, and critical timelines (Admission and Discharge dates). Use this table to calculate patient census, Average Length of Stay (ALOS), track patient flow, or analyze admission volume by doctor or department. Note: This table focuses on administrative registration and billing initiation; it does not contain detailed clinical notes, specific lab results, or medication prescriptions. When analyzing patient administrative inflow and outflow data, this table is the primary and essential source for all patient visit metrics.</code> | <code>This reference table defines the high-level administrative outcomes of a patient's discharge process. It includes standard codes, English names, and Indonesian translations (LocalName) such as 'Discharged' (Pulang) and 'Cancelled' (Batal). Use this table to categorize the final resolution of a hospital visit, determining if a patient successfully left care or if the discharge process was voided. Note: This table tracks the administrative status of the discharge event itself, not the clinical medical condition (e.g., improved, cured) or the specific reason for discharge.</code> |
375
+ | <code>Retrieve the AdmissionNo and InvoiceNo for all admissions where the patient was not previously registered (Admission.IsPatientRegistered = 0) and the invoice was issued by an organization with an IsActive status of 'true'.</code> | <code>Operational transaction table (Financial Log) recording the header-level details of patient invoices and billing events. This table captures the financial breakdown of a visit, distinguishing between Patient responsibility (Out-of-pocket) and Payer responsibility (Insurance/Corporate Coverage), including Gross Amounts, Discounts, Taxes, and Net Payable values. Use this table to analyze hospital revenue streams, track Accounts Receivable (AR), monitor billing cancellations, or calculate the financial yield per admission. Note: This is the Invoice HEADER table containing total values; it does not typically list the specific individual line items (drugs, labs, services) charged within the bill. For any financial analysis related to hospital revenue, Payments, Accounts Receivable (AR), billing breakdowns, or insurance claims, this invoice header table is the definitive starting point.</code> | <code>Operational transaction table recording formal requests to retrieve physical medical record folders from the Medical Records Department (MRD). It tracks the details of the request, including the specific patient file (`PatientOrganizationId`), the requesting doctor (`DoctorUserId`), the intended destination (`FolderDestinationTypeId`), and the urgency or type of request (`FolderRequestTypeId`). **Use this table to** measure MRD service levels, track the volume of physical file retrievals, or audit the reasons for accessing physical records. **Note: This tracks the *request* for a physical object, NOT the digital access to the EMR or the actual content of the medical record.**</code> |
376
+ | <code>What is the average age of patients with 'Active' status who had an 'Inpatient' admission in 2024, compared to those with 'Inactive' status?</code> | <code>Operational transaction table recording every patient registration and visit event at the hospital. This table consolidates patient demographics, visit types (Inpatient, Outpatient, Emergency), primary and referral doctors, payer/insurance eligibility, and critical timelines (Admission and Discharge dates). Use this table to calculate patient census, Average Length of Stay (ALOS), track patient flow, or analyze admission volume by doctor or department. Note: This table focuses on administrative registration and billing initiation; it does not contain detailed clinical notes, specific lab results, or medication prescriptions. When analyzing patient administrative inflow and outflow data, this table is the primary and essential source for all patient visit metrics.</code> | <code>Operational transaction table recording the official event of a patient leaving the hospital (Discharge). It captures the precise discharge timestamp, the patient's condition upon exit (e.g., Recovered, Improved), and the type of discharge (e.g., Medical Consent, Transfer) linked to their Admission. **Use this table to** calculate Length of Stay (LOS), analyze clinical outcomes, or track bed turnover rates. **Note: This table signifies the physical or administrative end of a visit; it does NOT contain the final invoice amount, though it triggers the billing closure process.**</code> |
377
+ * Loss: [<code>TripletLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#tripletloss) with these parameters:
378
+ ```json
379
+ {
380
+ "distance_metric": "TripletDistanceMetric.COSINE",
381
+ "triplet_margin": 1
382
+ }
383
+ ```
384
+
385
+ ### Evaluation Dataset
386
+
387
+ #### Unnamed Dataset
388
+
389
+ * Size: 335 evaluation samples
390
+ * Columns: <code>anchor</code>, <code>positive</code>, and <code>negative</code>
391
+ * Approximate statistics based on the first 335 samples:
392
+ | | anchor | positive | negative |
393
+ |:--------|:-----------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|
394
+ | type | string | string | string |
395
+ | details | <ul><li>min: 10 tokens</li><li>mean: 35.25 tokens</li><li>max: 77 tokens</li></ul> | <ul><li>min: 63 tokens</li><li>mean: 128.23 tokens</li><li>max: 171 tokens</li></ul> | <ul><li>min: 63 tokens</li><li>mean: 111.21 tokens</li><li>max: 171 tokens</li></ul> |
396
+ * Samples:
397
+ | anchor | positive | negative |
398
+ |:----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
399
+ | <code>Retrieve the admission number, invoice number, and admission type name for all admissions that occurred in January 2023 and had an invoice issued within 7 days of the admission date.</code> | <code>Primary reference table defining the high-level classification of patient visits and hospital service lines. Contains standard categories including Inpatient (Hospitalization), Outpatient (Clinical visits), Emergency (ER), and Health Checkups (MCU). Use this table to group patient volume by service type, filter admission logs, or analyze revenue streams by visit category. Note: This is a static lookup list defining the 'Types' of visits; it does not contain actual patient visit transaction records.</code> | <code>Operational transaction table that records the detailed line items of an inventory stock take event. It captures the specific `ItemId` being counted, the quantity recorded in the system (`SystemStockQuantity`), the actual `PhysicalStockQuantity` counted, and the resulting `VarianceStockQuantity`. For batch-tracked items, it also includes `BatchNo` and `ExpiryDate`. **Use this table to** analyze inventory discrepancies, identify expired or expiring stock during a count, or audit the accuracy of stock records for a specific item. **Note: This table contains the detailed results of a specific stock count *event*; it does NOT provide the real-time inventory balance of an item.**</code> |
400
+ | <code>What is the total number of invoices and the total value of these invoices issued in Q1 2024 for female patients whose current status is 'Active'?</code> | <code>Primary reference table defining the validity and lifecycle status of a patient's master record. Contains status codes such as 'Active', 'Inactive', and 'Deceased' to classify patient accounts. **Use this table to** look up status meanings or filter patient queries based on their account state (e.g., excluding deceased patients from active lists). **Note: This is a lookup dictionary for status codes, NOT a transaction log of status changes for specific patients.**</code> | <code>Operational snapshot table that holds the **single, current financial state** of a patient at a specific hospital unit. Unlike a transaction log, this table does not track history; it contains only the final calculated totals: `BalanceAmount` (Total AR/Debt outstanding) and `DepositBalanceAmount` (Total Advance Payment available). **Use this table for** quick validation of whether a patient can be discharged, checking total debt before billing, or viewing available deposit funds. **Concept:** Think of this as the 'ATM Screen Balance' showing only the final amount available right now.</code> |
401
+ | <code>Tampilkan data Master Data Hospital Rooms (Room)</code> | <code>Primary reference table defining individual patient rooms within a hospital ward. It contains specific room codes and names, linking each room to its parent Ward (`WardId`) and hospital branch (`OrganizationId`), representing the most granular level of inpatient location master data. **Use this table to** map `RoomId` from bed management or admission transactions to a specific room number, analyze room availability, or manage hospital accommodation resources. **Note: This table lists the physical room definitions only, not the individual beds within a room or the real-time patient occupancy status.**</code> | <code>Operational transaction table recording the historical log of bed movements and transfers within the hospital organization. This table captures the exact timestamp of transfer events and identifies the specific Bed asset involved. Use this table to track the history of bed locations, audit bed usage timelines, or monitor the frequency of bed transfers. Note: This table focuses on the physical Bed entity's movement timeline; based on the provided columns, it does not explicitly contain Patient or Admission IDs.</code> |
402
+ * Loss: [<code>TripletLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#tripletloss) with these parameters:
403
+ ```json
404
+ {
405
+ "distance_metric": "TripletDistanceMetric.COSINE",
406
+ "triplet_margin": 1
407
+ }
408
+ ```
409
+
410
+ ### Training Hyperparameters
411
+ #### Non-Default Hyperparameters
412
+
413
+ - `eval_strategy`: steps
414
+ - `per_device_train_batch_size`: 16
415
+ - `per_device_eval_batch_size`: 16
416
+ - `gradient_accumulation_steps`: 4
417
+ - `learning_rate`: 2e-05
418
+ - `num_train_epochs`: 5
419
+ - `warmup_ratio`: 0.1
420
+ - `fp16`: True
421
+ - `load_best_model_at_end`: True
422
+ - `prompts`: {'anchor': ' ', 'positive': '', 'negative': ''}
423
+ - `batch_sampler`: no_duplicates
424
+
425
+ #### All Hyperparameters
426
+ <details><summary>Click to expand</summary>
427
+
428
+ - `overwrite_output_dir`: False
429
+ - `do_predict`: False
430
+ - `eval_strategy`: steps
431
+ - `prediction_loss_only`: True
432
+ - `per_device_train_batch_size`: 16
433
+ - `per_device_eval_batch_size`: 16
434
+ - `per_gpu_train_batch_size`: None
435
+ - `per_gpu_eval_batch_size`: None
436
+ - `gradient_accumulation_steps`: 4
437
+ - `eval_accumulation_steps`: None
438
+ - `torch_empty_cache_steps`: None
439
+ - `learning_rate`: 2e-05
440
+ - `weight_decay`: 0.0
441
+ - `adam_beta1`: 0.9
442
+ - `adam_beta2`: 0.999
443
+ - `adam_epsilon`: 1e-08
444
+ - `max_grad_norm`: 1.0
445
+ - `num_train_epochs`: 5
446
+ - `max_steps`: -1
447
+ - `lr_scheduler_type`: linear
448
+ - `lr_scheduler_kwargs`: None
449
+ - `warmup_ratio`: 0.1
450
+ - `warmup_steps`: 0
451
+ - `log_level`: passive
452
+ - `log_level_replica`: warning
453
+ - `log_on_each_node`: True
454
+ - `logging_nan_inf_filter`: True
455
+ - `save_safetensors`: True
456
+ - `save_on_each_node`: False
457
+ - `save_only_model`: False
458
+ - `restore_callback_states_from_checkpoint`: False
459
+ - `no_cuda`: False
460
+ - `use_cpu`: False
461
+ - `use_mps_device`: False
462
+ - `seed`: 42
463
+ - `data_seed`: None
464
+ - `jit_mode_eval`: False
465
+ - `bf16`: False
466
+ - `fp16`: True
467
+ - `fp16_opt_level`: O1
468
+ - `half_precision_backend`: auto
469
+ - `bf16_full_eval`: False
470
+ - `fp16_full_eval`: False
471
+ - `tf32`: None
472
+ - `local_rank`: 0
473
+ - `ddp_backend`: None
474
+ - `tpu_num_cores`: None
475
+ - `tpu_metrics_debug`: False
476
+ - `debug`: []
477
+ - `dataloader_drop_last`: False
478
+ - `dataloader_num_workers`: 0
479
+ - `dataloader_prefetch_factor`: None
480
+ - `past_index`: -1
481
+ - `disable_tqdm`: False
482
+ - `remove_unused_columns`: True
483
+ - `label_names`: None
484
+ - `load_best_model_at_end`: True
485
+ - `ignore_data_skip`: False
486
+ - `fsdp`: []
487
+ - `fsdp_min_num_params`: 0
488
+ - `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
489
+ - `fsdp_transformer_layer_cls_to_wrap`: None
490
+ - `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
491
+ - `parallelism_config`: None
492
+ - `deepspeed`: None
493
+ - `label_smoothing_factor`: 0.0
494
+ - `optim`: adamw_torch_fused
495
+ - `optim_args`: None
496
+ - `adafactor`: False
497
+ - `group_by_length`: False
498
+ - `length_column_name`: length
499
+ - `project`: huggingface
500
+ - `trackio_space_id`: trackio
501
+ - `ddp_find_unused_parameters`: None
502
+ - `ddp_bucket_cap_mb`: None
503
+ - `ddp_broadcast_buffers`: False
504
+ - `dataloader_pin_memory`: True
505
+ - `dataloader_persistent_workers`: False
506
+ - `skip_memory_metrics`: True
507
+ - `use_legacy_prediction_loop`: False
508
+ - `push_to_hub`: False
509
+ - `resume_from_checkpoint`: None
510
+ - `hub_model_id`: None
511
+ - `hub_strategy`: every_save
512
+ - `hub_private_repo`: None
513
+ - `hub_always_push`: False
514
+ - `hub_revision`: None
515
+ - `gradient_checkpointing`: False
516
+ - `gradient_checkpointing_kwargs`: None
517
+ - `include_inputs_for_metrics`: False
518
+ - `include_for_metrics`: []
519
+ - `eval_do_concat_batches`: True
520
+ - `fp16_backend`: auto
521
+ - `push_to_hub_model_id`: None
522
+ - `push_to_hub_organization`: None
523
+ - `mp_parameters`:
524
+ - `auto_find_batch_size`: False
525
+ - `full_determinism`: False
526
+ - `torchdynamo`: None
527
+ - `ray_scope`: last
528
+ - `ddp_timeout`: 1800
529
+ - `torch_compile`: False
530
+ - `torch_compile_backend`: None
531
+ - `torch_compile_mode`: None
532
+ - `include_tokens_per_second`: False
533
+ - `include_num_input_tokens_seen`: no
534
+ - `neftune_noise_alpha`: None
535
+ - `optim_target_modules`: None
536
+ - `batch_eval_metrics`: False
537
+ - `eval_on_start`: False
538
+ - `use_liger_kernel`: False
539
+ - `liger_kernel_config`: None
540
+ - `eval_use_gather_object`: False
541
+ - `average_tokens_across_devices`: True
542
+ - `prompts`: {'anchor': ' ', 'positive': '', 'negative': ''}
543
+ - `batch_sampler`: no_duplicates
544
+ - `multi_dataset_batch_sampler`: proportional
545
+ - `router_mapping`: {}
546
+ - `learning_rate_mapping`: {}
547
+
548
+ </details>
549
+
550
+ ### Training Logs
551
+ | Epoch | Step | Training Loss | Validation Loss | cosine_ndcg@10 |
552
+ |:------:|:----:|:-------------:|:---------------:|:--------------:|
553
+ | -1 | -1 | - | - | 0.0698 |
554
+ | 0.2381 | 5 | 0.7215 | 0.3602 | 0.0706 |
555
+ | 0.4762 | 10 | 0.3287 | 0.2451 | 0.0765 |
556
+ | 0.7143 | 15 | 0.2145 | 0.1896 | 0.0652 |
557
+ | 0.9524 | 20 | 0.2181 | 0.2286 | 0.0535 |
558
+ | 1.1905 | 25 | 0.2774 | 0.1610 | 0.0694 |
559
+ | 1.4286 | 30 | 0.1948 | 0.1818 | 0.0733 |
560
+ | -1 | -1 | - | - | 0.0733 |
561
+ | 0.2381 | 5 | 0.1696 | 0.1475 | 0.0705 |
562
+ | 0.4762 | 10 | 0.1657 | 0.2067 | 0.0661 |
563
+ | 0.7143 | 15 | 0.1346 | 0.1991 | 0.0635 |
564
+ | 0.9524 | 20 | 0.1197 | 0.1466 | 0.0473 |
565
+ | 1.1905 | 25 | 0.2657 | 0.1380 | 0.0544 |
566
+ | 1.4286 | 30 | 0.1706 | 0.1906 | 0.0719 |
567
+ | 1.6667 | 35 | 0.1686 | 0.1746 | 0.0741 |
568
+ | 1.9048 | 40 | 0.1217 | 0.1577 | 0.0669 |
569
+ | 2.1429 | 45 | 0.1191 | 0.1355 | 0.0606 |
570
+ | 2.3810 | 50 | 0.108 | 0.1362 | 0.0615 |
571
+ | 2.6190 | 55 | 0.1065 | 0.1595 | 0.0647 |
572
+ | 2.8571 | 60 | 0.1314 | 0.1126 | 0.0667 |
573
+ | 3.0952 | 65 | 0.072 | 0.0934 | 0.0589 |
574
+ | 3.3333 | 70 | 0.0868 | 0.0977 | 0.0655 |
575
+ | 3.5714 | 75 | 0.0719 | 0.1310 | 0.0496 |
576
+ | 3.8095 | 80 | 0.1184 | 0.1388 | 0.0681 |
577
+ | 4.0476 | 85 | 0.0997 | 0.1132 | 0.0656 |
578
+ | 4.2857 | 90 | 0.0659 | 0.1029 | 0.0724 |
579
+ | 4.5238 | 95 | 0.0554 | 0.1018 | 0.0707 |
580
+ | 4.7619 | 100 | 0.0729 | 0.0989 | 0.0659 |
581
+ | 5.0 | 105 | 0.0422 | 0.0971 | 0.0735 |
582
+ | -1 | -1 | - | - | 0.0735 |
583
+
584
+
585
+ ### Framework Versions
586
+ - Python: 3.12.12
587
+ - Sentence Transformers: 5.2.2
588
+ - Transformers: 4.57.6
589
+ - PyTorch: 2.10.0+cu128
590
+ - Accelerate: 1.12.0
591
+ - Datasets: 4.3.0
592
+ - Tokenizers: 0.22.2
593
+
594
+ ## Citation
595
+
596
+ ### BibTeX
597
+
598
+ #### Sentence Transformers
599
+ ```bibtex
600
+ @inproceedings{reimers-2019-sentence-bert,
601
+ title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
602
+ author = "Reimers, Nils and Gurevych, Iryna",
603
+ booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
604
+ month = "11",
605
+ year = "2019",
606
+ publisher = "Association for Computational Linguistics",
607
+ url = "https://arxiv.org/abs/1908.10084",
608
+ }
609
+ ```
610
+
611
+ #### TripletLoss
612
+ ```bibtex
613
+ @misc{hermans2017defense,
614
+ title={In Defense of the Triplet Loss for Person Re-Identification},
615
+ author={Alexander Hermans and Lucas Beyer and Bastian Leibe},
616
+ year={2017},
617
+ eprint={1703.07737},
618
+ archivePrefix={arXiv},
619
+ primaryClass={cs.CV}
620
+ }
621
+ ```
622
+
623
+ <!--
624
+ ## Glossary
625
+
626
+ *Clearly define terms in order to be accessible across audiences.*
627
+ -->
628
+
629
+ <!--
630
+ ## Model Card Authors
631
+
632
+ *Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
633
+ -->
634
+
635
+ <!--
636
+ ## Model Card Contact
637
+
638
+ *Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
639
+ -->
config.json ADDED
@@ -0,0 +1,70 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_sliding_window_pattern": 6,
3
+ "architectures": [
4
+ "Gemma3TextModel"
5
+ ],
6
+ "attention_bias": false,
7
+ "attention_dropout": 0.0,
8
+ "attn_logit_softcapping": null,
9
+ "bos_token_id": 2,
10
+ "dtype": "float32",
11
+ "eos_token_id": 1,
12
+ "final_logit_softcapping": null,
13
+ "head_dim": 256,
14
+ "hidden_activation": "gelu_pytorch_tanh",
15
+ "hidden_size": 768,
16
+ "initializer_range": 0.02,
17
+ "intermediate_size": 1152,
18
+ "layer_types": [
19
+ "sliding_attention",
20
+ "sliding_attention",
21
+ "sliding_attention",
22
+ "sliding_attention",
23
+ "sliding_attention",
24
+ "full_attention",
25
+ "sliding_attention",
26
+ "sliding_attention",
27
+ "sliding_attention",
28
+ "sliding_attention",
29
+ "sliding_attention",
30
+ "full_attention",
31
+ "sliding_attention",
32
+ "sliding_attention",
33
+ "sliding_attention",
34
+ "sliding_attention",
35
+ "sliding_attention",
36
+ "full_attention",
37
+ "sliding_attention",
38
+ "sliding_attention",
39
+ "sliding_attention",
40
+ "sliding_attention",
41
+ "sliding_attention",
42
+ "full_attention"
43
+ ],
44
+ "max_position_embeddings": 2048,
45
+ "model_type": "gemma3_text",
46
+ "num_attention_heads": 3,
47
+ "num_hidden_layers": 24,
48
+ "num_key_value_heads": 1,
49
+ "pad_token_id": 0,
50
+ "query_pre_attn_scalar": 256,
51
+ "rms_norm_eps": 1e-06,
52
+ "rope_parameters": {
53
+ "full_attention": {
54
+ "rope_theta": 1000000.0,
55
+ "rope_type": "default"
56
+ },
57
+ "sliding_attention": {
58
+ "rope_theta": 10000.0,
59
+ "rope_type": "default"
60
+ }
61
+ },
62
+ "sliding_window": 129,
63
+ "tie_word_embeddings": true,
64
+ "tokenizer_class": "GemmaTokenizerFast",
65
+ "transformers_version": "5.0.0",
66
+ "unsloth_version": "2026.1.4",
67
+ "use_bidirectional_attention": true,
68
+ "use_cache": true,
69
+ "vocab_size": 262144
70
+ }
config_sentence_transformers.json ADDED
@@ -0,0 +1,14 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "model_type": "SentenceTransformer",
3
+ "__version__": {
4
+ "sentence_transformers": "5.2.2",
5
+ "transformers": "5.0.0",
6
+ "pytorch": "2.9.0+cpu"
7
+ },
8
+ "prompts": {
9
+ "query": "",
10
+ "document": ""
11
+ },
12
+ "default_prompt_name": null,
13
+ "similarity_fn_name": "cosine"
14
+ }
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6033f19eb1e6999d8daf87ac6e6845dc43f1b8cfacd786b122908ff00c86d8ea
3
+ size 1211486072
modules.json ADDED
@@ -0,0 +1,32 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [
2
+ {
3
+ "idx": 0,
4
+ "name": "0",
5
+ "path": "",
6
+ "type": "sentence_transformers.models.Transformer"
7
+ },
8
+ {
9
+ "idx": 1,
10
+ "name": "1",
11
+ "path": "1_Pooling",
12
+ "type": "sentence_transformers.models.Pooling"
13
+ },
14
+ {
15
+ "idx": 2,
16
+ "name": "2",
17
+ "path": "2_Dense",
18
+ "type": "sentence_transformers.models.Dense"
19
+ },
20
+ {
21
+ "idx": 3,
22
+ "name": "3",
23
+ "path": "3_Dense",
24
+ "type": "sentence_transformers.models.Dense"
25
+ },
26
+ {
27
+ "idx": 4,
28
+ "name": "4",
29
+ "path": "4_Normalize",
30
+ "type": "sentence_transformers.models.Normalize"
31
+ }
32
+ ]
sentence_bert_config.json ADDED
@@ -0,0 +1,4 @@
 
 
 
 
 
1
+ {
2
+ "max_seq_length": 768,
3
+ "do_lower_case": false
4
+ }
tokenizer.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e7496f19b8099821eb489dbc4f880dd342842dd931fd4e8a111b70fd00d89853
3
+ size 33384883
tokenizer_config.json ADDED
@@ -0,0 +1,24 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "backend": "tokenizers",
3
+ "boi_token": "<start_of_image>",
4
+ "bos_token": "<bos>",
5
+ "clean_up_tokenization_spaces": false,
6
+ "eoi_token": "<end_of_image>",
7
+ "eos_token": "<eos>",
8
+ "image_token": "<image_soft_token>",
9
+ "is_local": true,
10
+ "mask_token": "<mask>",
11
+ "model_max_length": 768,
12
+ "model_specific_special_tokens": {
13
+ "boi_token": "<start_of_image>",
14
+ "eoi_token": "<end_of_image>",
15
+ "image_token": "<image_soft_token>"
16
+ },
17
+ "pad_token": "<pad>",
18
+ "padding_side": "left",
19
+ "sp_model_kwargs": null,
20
+ "spaces_between_special_tokens": false,
21
+ "tokenizer_class": "GemmaTokenizer",
22
+ "unk_token": "<unk>",
23
+ "use_default_system_prompt": false
24
+ }