gpadres commited on
Commit
ebf34ea
·
1 Parent(s): d9003d8

GPJ: updating model card

Browse files
Files changed (1) hide show
  1. README.md +7 -8
README.md CHANGED
@@ -2,16 +2,15 @@
2
  tags:
3
  - model_hub_mixin
4
  - pytorch_model_hub_mixin
5
- base_model: allenai/longformer-base-4096
6
- datasets:
7
- - mattmdjaga/text-anonymization-benchmark-train
8
  license: apache-2.0
9
- model_id: pii-classifier-tab-datset-1
 
 
10
  ---
11
 
12
- # Model Card for {{ base_model | default("Model ID", true) }}
13
 
 
14
 
15
- This model has been pushed to the Hub using the [PytorchModelHubMixin](https://huggingface.co/docs/huggingface_hub/package_reference/mixins#huggingface_hub.PyTorchModelHubMixin) integration:
16
- - Library: [More Information Needed]
17
- - Docs: [More Information Needed]
 
2
  tags:
3
  - model_hub_mixin
4
  - pytorch_model_hub_mixin
5
+ datasets: mattmdjaga/text-anonymization-benchmark-train
 
 
6
  license: apache-2.0
7
+ base_model: allenai/longformer-base-4096
8
+ base_model_relation: finetune
9
+ model_id: pii-classifier-tab-dataset
10
  ---
11
 
12
+ # Model Card for pii-classifier-tab-dataset
13
 
14
+ Model is a Longformer with a classification head, finetuned on **Text Anonymization Benchmark (TAB)** dataset for indicating if a token is part of a **Personal Identifiable Information (PII)** and should be masked out or not. Model output is the logits of the input sequence, where the classes are 1 (MASK) or 0 (NO-MASK), e.g. no IOB format used.
15
 
16
+ Model is used as an example in [LeakPro repo](https://github.com/aidotse/LeakPro). For further detail, see example [notebook](https://github.com/aidotse/LeakPro/blob/gpj_syn_text_pii_scanner/examples/synthetic_data/syn_text_pii_scanner_example.ipynb).