macpaw-research/mnemos_entity_extractor_v1_small
Browse files- README.md +62 -60
- config.json +79 -79
- model.safetensors +1 -1
- runs/Nov05_18-00-40_21820ec588a1/events.out.tfevents.1762365675.21820ec588a1.374.0 +3 -0
- runs/Nov05_18-00-40_21820ec588a1/events.out.tfevents.1762366113.21820ec588a1.374.1 +3 -0
- runs/Nov05_18-23-52_21820ec588a1/events.out.tfevents.1762367074.21820ec588a1.374.2 +3 -0
- runs/Nov05_18-23-52_21820ec588a1/events.out.tfevents.1762367470.21820ec588a1.374.3 +3 -0
- runs/Nov06_16-09-04_9857a63c5837/events.out.tfevents.1762445386.9857a63c5837.5460.0 +3 -0
- runs/Nov06_16-09-04_9857a63c5837/events.out.tfevents.1762445965.9857a63c5837.5460.1 +3 -0
- runs/Oct21_19-47-14_c090c1c7e874/events.out.tfevents.1761076072.c090c1c7e874.4195.0 +3 -0
- runs/Oct21_19-47-14_c090c1c7e874/events.out.tfevents.1761076439.c090c1c7e874.4195.1 +3 -0
- runs/Oct21_20-14-14_c090c1c7e874/events.out.tfevents.1761077692.c090c1c7e874.4195.2 +3 -0
- runs/Oct21_20-14-14_c090c1c7e874/events.out.tfevents.1761078080.c090c1c7e874.4195.3 +3 -0
- runs/Oct21_20-22-17_c090c1c7e874/events.out.tfevents.1761078175.c090c1c7e874.4195.4 +3 -0
- runs/Oct21_20-22-17_c090c1c7e874/events.out.tfevents.1761078577.c090c1c7e874.4195.5 +3 -0
- runs/Oct21_20-36-54_c090c1c7e874/events.out.tfevents.1761079052.c090c1c7e874.17733.0 +3 -0
- runs/Oct21_20-36-54_c090c1c7e874/events.out.tfevents.1761079414.c090c1c7e874.17733.1 +3 -0
- tokenizer_config.json +1 -1
- training_args.bin +1 -1
README.md
CHANGED
|
@@ -6,16 +6,18 @@ tags:
|
|
| 6 |
- named-entity-recognition
|
| 7 |
- generated_from_span_marker_trainer
|
| 8 |
widget:
|
| 9 |
-
- text:
|
| 10 |
-
|
| 11 |
-
- text:
|
| 12 |
-
|
| 13 |
-
|
| 14 |
-
|
| 15 |
-
|
| 16 |
-
|
| 17 |
-
|
| 18 |
-
|
|
|
|
|
|
|
| 19 |
pipeline_tag: token-classification
|
| 20 |
library_name: span-marker
|
| 21 |
metrics:
|
|
@@ -34,13 +36,13 @@ model-index:
|
|
| 34 |
split: eval
|
| 35 |
metrics:
|
| 36 |
- type: f1
|
| 37 |
-
value: 0.
|
| 38 |
name: F1
|
| 39 |
- type: precision
|
| 40 |
-
value: 0.
|
| 41 |
name: Precision
|
| 42 |
- type: recall
|
| 43 |
-
value: 0.
|
| 44 |
name: Recall
|
| 45 |
---
|
| 46 |
|
|
@@ -54,7 +56,7 @@ This is a [SpanMarker](https://github.com/tomaarsen/SpanMarkerNER) model that ca
|
|
| 54 |
- **Model Type:** SpanMarker
|
| 55 |
<!-- - **Encoder:** [Unknown](https://huggingface.co/unknown) -->
|
| 56 |
- **Maximum Sequence Length:** 512 tokens
|
| 57 |
-
- **Maximum Entity Length:**
|
| 58 |
<!-- - **Training Dataset:** [Unknown](https://huggingface.co/datasets/unknown) -->
|
| 59 |
<!-- - **Language:** Unknown -->
|
| 60 |
<!-- - **License:** Unknown -->
|
|
@@ -65,51 +67,51 @@ This is a [SpanMarker](https://github.com/tomaarsen/SpanMarkerNER) model that ca
|
|
| 65 |
- **Thesis:** [SpanMarker For Named Entity Recognition](https://raw.githubusercontent.com/tomaarsen/SpanMarkerNER/main/thesis.pdf)
|
| 66 |
|
| 67 |
### Model Labels
|
| 68 |
-
| Label | Examples
|
| 69 |
-
|:---------------|:----------------------------------------------------------------|
|
| 70 |
-
| action | "
|
| 71 |
-
| app_data_type | "
|
| 72 |
-
| app_name | "
|
| 73 |
-
|
|
| 74 |
-
|
|
| 75 |
-
|
|
| 76 |
-
|
|
| 77 |
-
|
|
| 78 |
-
|
|
| 79 |
-
|
|
| 80 |
-
|
|
| 81 |
-
|
|
| 82 |
-
|
|
| 83 |
-
|
|
| 84 |
-
|
|
| 85 |
-
|
|
| 86 |
-
| system_command | "
|
| 87 |
-
| time | "
|
| 88 |
|
| 89 |
## Evaluation
|
| 90 |
|
| 91 |
### Metrics
|
| 92 |
| Label | Precision | Recall | F1 |
|
| 93 |
|:---------------|:----------|:-------|:-------|
|
| 94 |
-
| **all** | 0.
|
| 95 |
-
| action | 0.
|
| 96 |
-
| app_data_type | 0.
|
| 97 |
-
| app_name | 0.
|
| 98 |
-
|
|
| 99 |
-
|
|
| 100 |
-
|
|
| 101 |
-
|
|
| 102 |
-
|
|
| 103 |
-
|
|
| 104 |
-
|
|
| 105 |
-
|
|
| 106 |
-
|
|
| 107 |
-
|
|
| 108 |
-
|
|
| 109 |
-
|
|
| 110 |
-
|
|
| 111 |
-
| system_command | 0.
|
| 112 |
-
| time | 0.
|
| 113 |
|
| 114 |
## Uses
|
| 115 |
|
|
@@ -121,7 +123,7 @@ from span_marker import SpanMarkerModel
|
|
| 121 |
# Download from the 🤗 Hub
|
| 122 |
model = SpanMarkerModel.from_pretrained("span_marker_model_id")
|
| 123 |
# Run inference
|
| 124 |
-
entities = model.predict("
|
| 125 |
```
|
| 126 |
|
| 127 |
### Downstream Use
|
|
@@ -172,8 +174,8 @@ trainer.save_model("span_marker_model_id-finetuned")
|
|
| 172 |
### Training Set Metrics
|
| 173 |
| Training set | Min | Median | Max |
|
| 174 |
|:----------------------|:----|:--------|:----|
|
| 175 |
-
| Sentence length |
|
| 176 |
-
| Entities per sentence |
|
| 177 |
|
| 178 |
### Training Hyperparameters
|
| 179 |
- learning_rate: 5e-05
|
|
@@ -183,17 +185,17 @@ trainer.save_model("span_marker_model_id-finetuned")
|
|
| 183 |
- optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
|
| 184 |
- lr_scheduler_type: linear
|
| 185 |
- lr_scheduler_warmup_ratio: 0.1
|
| 186 |
-
- num_epochs:
|
| 187 |
- mixed_precision_training: Native AMP
|
| 188 |
|
| 189 |
### Training Results
|
| 190 |
| Epoch | Step | Validation Loss | Validation Precision | Validation Recall | Validation F1 | Validation Accuracy |
|
| 191 |
|:------:|:----:|:---------------:|:--------------------:|:-----------------:|:-------------:|:-------------------:|
|
| 192 |
-
| 1.
|
| 193 |
-
| 3.
|
| 194 |
|
| 195 |
### Framework Versions
|
| 196 |
-
- Python: 3.12.
|
| 197 |
- SpanMarker: 1.7.0
|
| 198 |
- Transformers: 4.51.3
|
| 199 |
- PyTorch: 2.8.0+cu126
|
|
|
|
| 6 |
- named-entity-recognition
|
| 7 |
- generated_from_span_marker_trainer
|
| 8 |
widget:
|
| 9 |
+
- text: On 07 Nov, send my brother the summary from section 2 of the document and
|
| 10 |
+
enable airplane mode on my phone
|
| 11 |
+
- text: Could you please share the' Budget Reports' folder with me and update the
|
| 12 |
+
notification settings in Slack before the Quarterly Review Meeting? Also, send
|
| 13 |
+
the details to my email at emily . chen @ workmail . com
|
| 14 |
+
- text: Find all images from March 3rd that are less than 1MB, and read out the caption
|
| 15 |
+
under figure 5 . Set the device to silent mode
|
| 16 |
+
- text: Please send the document named annual_report_2023 . xlsx from the Finance
|
| 17 |
+
folder, specifically the summary on page 5, to my manager at manager @ acme .
|
| 18 |
+
com
|
| 19 |
+
- text: Text my mother at + 44 7911 123456 the summary from paragraph 4, and then
|
| 20 |
+
enable bluetooth
|
| 21 |
pipeline_tag: token-classification
|
| 22 |
library_name: span-marker
|
| 23 |
metrics:
|
|
|
|
| 36 |
split: eval
|
| 37 |
metrics:
|
| 38 |
- type: f1
|
| 39 |
+
value: 0.8683998712169995
|
| 40 |
name: F1
|
| 41 |
- type: precision
|
| 42 |
+
value: 0.8558622877994606
|
| 43 |
name: Precision
|
| 44 |
- type: recall
|
| 45 |
+
value: 0.8813102434242771
|
| 46 |
name: Recall
|
| 47 |
---
|
| 48 |
|
|
|
|
| 56 |
- **Model Type:** SpanMarker
|
| 57 |
<!-- - **Encoder:** [Unknown](https://huggingface.co/unknown) -->
|
| 58 |
- **Maximum Sequence Length:** 512 tokens
|
| 59 |
+
- **Maximum Entity Length:** 12 words
|
| 60 |
<!-- - **Training Dataset:** [Unknown](https://huggingface.co/datasets/unknown) -->
|
| 61 |
<!-- - **Language:** Unknown -->
|
| 62 |
<!-- - **License:** Unknown -->
|
|
|
|
| 67 |
- **Thesis:** [SpanMarker For Named Entity Recognition](https://raw.githubusercontent.com/tomaarsen/SpanMarkerNER/main/thesis.pdf)
|
| 68 |
|
| 69 |
### Model Labels
|
| 70 |
+
| Label | Examples |
|
| 71 |
+
|:---------------|:--------------------------------------------------------------------------------|
|
| 72 |
+
| action | "Remind", "scheduled", "review" |
|
| 73 |
+
| app_data_type | "items", "images", "videos" |
|
| 74 |
+
| app_name | "Camera", "phone", "Slack" |
|
| 75 |
+
| contact_info | "sarah . lee @ company . org", "123 Maple Street , Springfield", "home address" |
|
| 76 |
+
| date | "20 . 10 . 1999", "before", "January 18 - June 15" |
|
| 77 |
+
| event_title | "team sync", "Marketing Strategy Meeting", "Budget Planning" |
|
| 78 |
+
| file_name | "notes", "budget_overview . xlsx", "project_plan . docx" |
|
| 79 |
+
| file_size | "under 500 kb", "smaller than 50 kb", "exceeding 100 mb" |
|
| 80 |
+
| file_type | "documents", "document", "image" |
|
| 81 |
+
| folder_name | "Projects", "Work", "Photos" |
|
| 82 |
+
| in_file_data | "appendix section", "page 10", "section 5" |
|
| 83 |
+
| limits | "top 8", "all", "every" |
|
| 84 |
+
| location | "Room 204", "server room", "library" |
|
| 85 |
+
| person_name | "Jonathan Kim", "Mr . Osei", "Lucas Müller" |
|
| 86 |
+
| relationship | "manager", "brother", "cousin" |
|
| 87 |
+
| setting | "brightness", "airplane mode", "notifications" |
|
| 88 |
+
| system_command | "disable", "move", "switch on" |
|
| 89 |
+
| time | "9 : 00 AM", "10 : 45", "10 : 00 AM" |
|
| 90 |
|
| 91 |
## Evaluation
|
| 92 |
|
| 93 |
### Metrics
|
| 94 |
| Label | Precision | Recall | F1 |
|
| 95 |
|:---------------|:----------|:-------|:-------|
|
| 96 |
+
| **all** | 0.8559 | 0.8813 | 0.8684 |
|
| 97 |
+
| action | 0.8173 | 0.9245 | 0.8676 |
|
| 98 |
+
| app_data_type | 0.7960 | 0.6828 | 0.7351 |
|
| 99 |
+
| app_name | 0.9432 | 0.9432 | 0.9432 |
|
| 100 |
+
| contact_info | 0.8722 | 0.9091 | 0.8903 |
|
| 101 |
+
| date | 0.9160 | 0.8993 | 0.9076 |
|
| 102 |
+
| event_title | 0.8659 | 0.9107 | 0.8877 |
|
| 103 |
+
| file_name | 0.9371 | 0.9280 | 0.9326 |
|
| 104 |
+
| file_size | 0.7810 | 0.7810 | 0.7810 |
|
| 105 |
+
| file_type | 0.7731 | 0.8786 | 0.8225 |
|
| 106 |
+
| folder_name | 0.9618 | 0.8968 | 0.9282 |
|
| 107 |
+
| in_file_data | 0.7486 | 0.7867 | 0.7672 |
|
| 108 |
+
| limits | 0.9048 | 0.6786 | 0.7755 |
|
| 109 |
+
| location | 0.8917 | 0.8571 | 0.8741 |
|
| 110 |
+
| person_name | 0.9885 | 0.9885 | 0.9885 |
|
| 111 |
+
| relationship | 0.9505 | 0.9541 | 0.9523 |
|
| 112 |
+
| setting | 0.8974 | 0.9255 | 0.9112 |
|
| 113 |
+
| system_command | 0.7889 | 0.7441 | 0.7659 |
|
| 114 |
+
| time | 0.9076 | 0.8587 | 0.8825 |
|
| 115 |
|
| 116 |
## Uses
|
| 117 |
|
|
|
|
| 123 |
# Download from the 🤗 Hub
|
| 124 |
model = SpanMarkerModel.from_pretrained("span_marker_model_id")
|
| 125 |
# Run inference
|
| 126 |
+
entities = model.predict("Text my mother at + 44 7911 123456 the summary from paragraph 4, and then enable bluetooth")
|
| 127 |
```
|
| 128 |
|
| 129 |
### Downstream Use
|
|
|
|
| 174 |
### Training Set Metrics
|
| 175 |
| Training set | Min | Median | Max |
|
| 176 |
|:----------------------|:----|:--------|:----|
|
| 177 |
+
| Sentence length | 3 | 19.0206 | 53 |
|
| 178 |
+
| Entities per sentence | 1 | 5.7015 | 13 |
|
| 179 |
|
| 180 |
### Training Hyperparameters
|
| 181 |
- learning_rate: 5e-05
|
|
|
|
| 185 |
- optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
|
| 186 |
- lr_scheduler_type: linear
|
| 187 |
- lr_scheduler_warmup_ratio: 0.1
|
| 188 |
+
- num_epochs: 5
|
| 189 |
- mixed_precision_training: Native AMP
|
| 190 |
|
| 191 |
### Training Results
|
| 192 |
| Epoch | Step | Validation Loss | Validation Precision | Validation Recall | Validation F1 | Validation Accuracy |
|
| 193 |
|:------:|:----:|:---------------:|:--------------------:|:-----------------:|:-------------:|:-------------------:|
|
| 194 |
+
| 1.8553 | 1000 | 0.0344 | 0.8301 | 0.8650 | 0.8472 | 0.9204 |
|
| 195 |
+
| 3.7106 | 2000 | 0.0271 | 0.8524 | 0.8804 | 0.8662 | 0.9316 |
|
| 196 |
|
| 197 |
### Framework Versions
|
| 198 |
+
- Python: 3.12.12
|
| 199 |
- SpanMarker: 1.7.0
|
| 200 |
- Transformers: 4.51.3
|
| 201 |
- PyTorch: 2.8.0+cu126
|
config.json
CHANGED
|
@@ -35,37 +35,37 @@
|
|
| 35 |
"1": "B-action",
|
| 36 |
"2": "B-app_data_type",
|
| 37 |
"3": "B-app_name",
|
| 38 |
-
"4": "B-
|
| 39 |
-
"5": "B-
|
| 40 |
-
"6": "B-
|
| 41 |
-
"7": "B-
|
| 42 |
-
"8": "B-
|
| 43 |
-
"9": "B-
|
| 44 |
-
"10": "B-
|
| 45 |
-
"11": "B-
|
| 46 |
-
"12": "B-
|
| 47 |
-
"13": "B-
|
| 48 |
-
"14": "B-
|
| 49 |
-
"15": "B-
|
| 50 |
-
"16": "B-
|
| 51 |
"17": "B-system_command",
|
| 52 |
"18": "B-time",
|
| 53 |
"19": "I-action",
|
| 54 |
"20": "I-app_data_type",
|
| 55 |
"21": "I-app_name",
|
| 56 |
-
"22": "I-
|
| 57 |
-
"23": "I-
|
| 58 |
-
"24": "I-
|
| 59 |
-
"25": "I-
|
| 60 |
-
"26": "I-
|
| 61 |
-
"27": "I-
|
| 62 |
-
"28": "I-
|
| 63 |
-
"29": "I-
|
| 64 |
-
"30": "I-
|
| 65 |
-
"31": "I-
|
| 66 |
-
"32": "I-
|
| 67 |
-
"33": "I-
|
| 68 |
-
"34": "I-
|
| 69 |
"35": "I-system_command",
|
| 70 |
"36": "I-time"
|
| 71 |
},
|
|
@@ -77,37 +77,37 @@
|
|
| 77 |
"B-action": 1,
|
| 78 |
"B-app_data_type": 2,
|
| 79 |
"B-app_name": 3,
|
| 80 |
-
"B-
|
| 81 |
-
"B-
|
| 82 |
-
"B-
|
| 83 |
-
"B-
|
| 84 |
-
"B-
|
| 85 |
-
"B-
|
| 86 |
-
"B-
|
| 87 |
-
"B-
|
| 88 |
-
"B-
|
| 89 |
-
"B-
|
| 90 |
-
"B-
|
| 91 |
-
"B-
|
| 92 |
-
"B-
|
| 93 |
"B-system_command": 17,
|
| 94 |
"B-time": 18,
|
| 95 |
"I-action": 19,
|
| 96 |
"I-app_data_type": 20,
|
| 97 |
"I-app_name": 21,
|
| 98 |
-
"I-
|
| 99 |
-
"I-
|
| 100 |
-
"I-
|
| 101 |
-
"I-
|
| 102 |
-
"I-
|
| 103 |
-
"I-
|
| 104 |
-
"I-
|
| 105 |
-
"I-
|
| 106 |
-
"I-
|
| 107 |
-
"I-
|
| 108 |
-
"I-
|
| 109 |
-
"I-
|
| 110 |
-
"I-
|
| 111 |
"I-system_command": 35,
|
| 112 |
"I-time": 36,
|
| 113 |
"O": 0
|
|
@@ -155,25 +155,25 @@
|
|
| 155 |
"use_cache": true,
|
| 156 |
"vocab_size": 29000
|
| 157 |
},
|
| 158 |
-
"entity_max_length":
|
| 159 |
"id2label": {
|
| 160 |
"0": "O",
|
| 161 |
"1": "action",
|
| 162 |
"2": "app_data_type",
|
| 163 |
"3": "app_name",
|
| 164 |
-
"4": "
|
| 165 |
-
"5": "
|
| 166 |
-
"6": "
|
| 167 |
-
"7": "
|
| 168 |
-
"8": "
|
| 169 |
-
"9": "
|
| 170 |
-
"10": "
|
| 171 |
-
"11": "
|
| 172 |
-
"12": "
|
| 173 |
-
"13": "
|
| 174 |
-
"14": "
|
| 175 |
-
"15": "
|
| 176 |
-
"16": "
|
| 177 |
"17": "system_command",
|
| 178 |
"18": "time"
|
| 179 |
},
|
|
@@ -221,19 +221,19 @@
|
|
| 221 |
"action": 1,
|
| 222 |
"app_data_type": 2,
|
| 223 |
"app_name": 3,
|
| 224 |
-
"
|
| 225 |
-
"
|
| 226 |
-
"
|
| 227 |
-
"
|
| 228 |
-
"
|
| 229 |
-
"
|
| 230 |
-
"
|
| 231 |
-
"
|
| 232 |
-
"
|
| 233 |
-
"
|
| 234 |
-
"
|
| 235 |
-
"
|
| 236 |
-
"
|
| 237 |
"system_command": 17,
|
| 238 |
"time": 18
|
| 239 |
},
|
|
|
|
| 35 |
"1": "B-action",
|
| 36 |
"2": "B-app_data_type",
|
| 37 |
"3": "B-app_name",
|
| 38 |
+
"4": "B-contact_info",
|
| 39 |
+
"5": "B-date",
|
| 40 |
+
"6": "B-event_title",
|
| 41 |
+
"7": "B-file_name",
|
| 42 |
+
"8": "B-file_size",
|
| 43 |
+
"9": "B-file_type",
|
| 44 |
+
"10": "B-folder_name",
|
| 45 |
+
"11": "B-in_file_data",
|
| 46 |
+
"12": "B-limits",
|
| 47 |
+
"13": "B-location",
|
| 48 |
+
"14": "B-person_name",
|
| 49 |
+
"15": "B-relationship",
|
| 50 |
+
"16": "B-setting",
|
| 51 |
"17": "B-system_command",
|
| 52 |
"18": "B-time",
|
| 53 |
"19": "I-action",
|
| 54 |
"20": "I-app_data_type",
|
| 55 |
"21": "I-app_name",
|
| 56 |
+
"22": "I-contact_info",
|
| 57 |
+
"23": "I-date",
|
| 58 |
+
"24": "I-event_title",
|
| 59 |
+
"25": "I-file_name",
|
| 60 |
+
"26": "I-file_size",
|
| 61 |
+
"27": "I-file_type",
|
| 62 |
+
"28": "I-folder_name",
|
| 63 |
+
"29": "I-in_file_data",
|
| 64 |
+
"30": "I-limits",
|
| 65 |
+
"31": "I-location",
|
| 66 |
+
"32": "I-person_name",
|
| 67 |
+
"33": "I-relationship",
|
| 68 |
+
"34": "I-setting",
|
| 69 |
"35": "I-system_command",
|
| 70 |
"36": "I-time"
|
| 71 |
},
|
|
|
|
| 77 |
"B-action": 1,
|
| 78 |
"B-app_data_type": 2,
|
| 79 |
"B-app_name": 3,
|
| 80 |
+
"B-contact_info": 4,
|
| 81 |
+
"B-date": 5,
|
| 82 |
+
"B-event_title": 6,
|
| 83 |
+
"B-file_name": 7,
|
| 84 |
+
"B-file_size": 8,
|
| 85 |
+
"B-file_type": 9,
|
| 86 |
+
"B-folder_name": 10,
|
| 87 |
+
"B-in_file_data": 11,
|
| 88 |
+
"B-limits": 12,
|
| 89 |
+
"B-location": 13,
|
| 90 |
+
"B-person_name": 14,
|
| 91 |
+
"B-relationship": 15,
|
| 92 |
+
"B-setting": 16,
|
| 93 |
"B-system_command": 17,
|
| 94 |
"B-time": 18,
|
| 95 |
"I-action": 19,
|
| 96 |
"I-app_data_type": 20,
|
| 97 |
"I-app_name": 21,
|
| 98 |
+
"I-contact_info": 22,
|
| 99 |
+
"I-date": 23,
|
| 100 |
+
"I-event_title": 24,
|
| 101 |
+
"I-file_name": 25,
|
| 102 |
+
"I-file_size": 26,
|
| 103 |
+
"I-file_type": 27,
|
| 104 |
+
"I-folder_name": 28,
|
| 105 |
+
"I-in_file_data": 29,
|
| 106 |
+
"I-limits": 30,
|
| 107 |
+
"I-location": 31,
|
| 108 |
+
"I-person_name": 32,
|
| 109 |
+
"I-relationship": 33,
|
| 110 |
+
"I-setting": 34,
|
| 111 |
"I-system_command": 35,
|
| 112 |
"I-time": 36,
|
| 113 |
"O": 0
|
|
|
|
| 155 |
"use_cache": true,
|
| 156 |
"vocab_size": 29000
|
| 157 |
},
|
| 158 |
+
"entity_max_length": 12,
|
| 159 |
"id2label": {
|
| 160 |
"0": "O",
|
| 161 |
"1": "action",
|
| 162 |
"2": "app_data_type",
|
| 163 |
"3": "app_name",
|
| 164 |
+
"4": "contact_info",
|
| 165 |
+
"5": "date",
|
| 166 |
+
"6": "event_title",
|
| 167 |
+
"7": "file_name",
|
| 168 |
+
"8": "file_size",
|
| 169 |
+
"9": "file_type",
|
| 170 |
+
"10": "folder_name",
|
| 171 |
+
"11": "in_file_data",
|
| 172 |
+
"12": "limits",
|
| 173 |
+
"13": "location",
|
| 174 |
+
"14": "person_name",
|
| 175 |
+
"15": "relationship",
|
| 176 |
+
"16": "setting",
|
| 177 |
"17": "system_command",
|
| 178 |
"18": "time"
|
| 179 |
},
|
|
|
|
| 221 |
"action": 1,
|
| 222 |
"app_data_type": 2,
|
| 223 |
"app_name": 3,
|
| 224 |
+
"contact_info": 4,
|
| 225 |
+
"date": 5,
|
| 226 |
+
"event_title": 6,
|
| 227 |
+
"file_name": 7,
|
| 228 |
+
"file_size": 8,
|
| 229 |
+
"file_type": 9,
|
| 230 |
+
"folder_name": 10,
|
| 231 |
+
"in_file_data": 11,
|
| 232 |
+
"limits": 12,
|
| 233 |
+
"location": 13,
|
| 234 |
+
"person_name": 14,
|
| 235 |
+
"relationship": 15,
|
| 236 |
+
"setting": 16,
|
| 237 |
"system_command": 17,
|
| 238 |
"time": 18
|
| 239 |
},
|
model.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 433394300
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:08d86be905c6b32aff2b8be742fb1eb4d425ce9600ceebd72bac292947b7837b
|
| 3 |
size 433394300
|
runs/Nov05_18-00-40_21820ec588a1/events.out.tfevents.1762365675.21820ec588a1.374.0
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:869e0c679fa317b8d95d549f9ca04e3a0078246e81e1f180706345d966e90d32
|
| 3 |
+
size 22025
|
runs/Nov05_18-00-40_21820ec588a1/events.out.tfevents.1762366113.21820ec588a1.374.1
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:79d985b1d20645c78c1b3ac32bb873e8b253a8cae15104e73ff28f012217c152
|
| 3 |
+
size 592
|
runs/Nov05_18-23-52_21820ec588a1/events.out.tfevents.1762367074.21820ec588a1.374.2
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:e92753fc3d7986db88d7b2de0cc5c9f1beb0d897bbd7bae67455d7a35557379c
|
| 3 |
+
size 22869
|
runs/Nov05_18-23-52_21820ec588a1/events.out.tfevents.1762367470.21820ec588a1.374.3
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:46c4fc0c98d60e4c425c22656f76adc78508351dc2293dc2e33532478164181f
|
| 3 |
+
size 592
|
runs/Nov06_16-09-04_9857a63c5837/events.out.tfevents.1762445386.9857a63c5837.5460.0
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:981258f9f48640a8b936f1d61e08335ada0d5a33fe53e39a3b4fd5f2af6fc910
|
| 3 |
+
size 22870
|
runs/Nov06_16-09-04_9857a63c5837/events.out.tfevents.1762445965.9857a63c5837.5460.1
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:915cc5a6326f6ac7c2e3ee34d9410026318446679a4dcd15c45a57024474616b
|
| 3 |
+
size 592
|
runs/Oct21_19-47-14_c090c1c7e874/events.out.tfevents.1761076072.c090c1c7e874.4195.0
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:61784a85c19debc5404ce4f6806cd97a2fe5e4e3d11932cc533b77de55593e18
|
| 3 |
+
size 20049
|
runs/Oct21_19-47-14_c090c1c7e874/events.out.tfevents.1761076439.c090c1c7e874.4195.1
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:476ea552d60f9d0d9cb3b746ba79af3b76833517124f7ae8d8d45ff143eb1461
|
| 3 |
+
size 592
|
runs/Oct21_20-14-14_c090c1c7e874/events.out.tfevents.1761077692.c090c1c7e874.4195.2
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:3ce3783a453c78f27e51fe7278a38afa0e3b5a9f39696a80832c05e195e4e319
|
| 3 |
+
size 18607
|
runs/Oct21_20-14-14_c090c1c7e874/events.out.tfevents.1761078080.c090c1c7e874.4195.3
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:37b27517ccbd1baca23016914d07e1aa966b896c5aa8a87ac5cf9fc06bea4253
|
| 3 |
+
size 592
|
runs/Oct21_20-22-17_c090c1c7e874/events.out.tfevents.1761078175.c090c1c7e874.4195.4
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:24b1aedd53e1d5480bf951007a7792f7d1774910b076a33e6b5b7c6f2ba1641e
|
| 3 |
+
size 21656
|
runs/Oct21_20-22-17_c090c1c7e874/events.out.tfevents.1761078577.c090c1c7e874.4195.5
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:ac972dc18dc55ea42a4437b5845442db1130047e3a0c732372dddee1b5e2bc52
|
| 3 |
+
size 592
|
runs/Oct21_20-36-54_c090c1c7e874/events.out.tfevents.1761079052.c090c1c7e874.17733.0
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:1f35252800a2032c0dc1f066cd6bf50d38c349a1add6d50005e1b4c405ae88ed
|
| 3 |
+
size 21656
|
runs/Oct21_20-36-54_c090c1c7e874/events.out.tfevents.1761079414.c090c1c7e874.17733.1
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:b8f7176b419be24f01fba37e3e41060842b97a056197f6cba9a56954e69e8c17
|
| 3 |
+
size 592
|
tokenizer_config.json
CHANGED
|
@@ -61,7 +61,7 @@
|
|
| 61 |
"clean_up_tokenization_spaces": false,
|
| 62 |
"cls_token": "[CLS]",
|
| 63 |
"do_lower_case": false,
|
| 64 |
-
"entity_max_length":
|
| 65 |
"extra_special_tokens": {},
|
| 66 |
"marker_max_length": 128,
|
| 67 |
"mask_token": "[MASK]",
|
|
|
|
| 61 |
"clean_up_tokenization_spaces": false,
|
| 62 |
"cls_token": "[CLS]",
|
| 63 |
"do_lower_case": false,
|
| 64 |
+
"entity_max_length": 12,
|
| 65 |
"extra_special_tokens": {},
|
| 66 |
"marker_max_length": 128,
|
| 67 |
"mask_token": "[MASK]",
|
training_args.bin
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 5841
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:0291a4ea7c29e238e8da7490aa760d92be7f216c970f8d8a1700e0eb6ff9054c
|
| 3 |
size 5841
|