alphagyuu
/

Korean-PII-Masking-BertForTokenClassification

Token Classification

Model card Files Files and versions

alphagyuu commited on Mar 13, 2025

Commit

41c1a76

·

verified ·

1 Parent(s): 058aa83

Update README.md

Files changed (1) hide show

README.md +20 -0

README.md CHANGED Viewed

@@ -1,3 +1,23 @@
 ---
 license: apache-2.0
 language:

+# Korean-PII-Masking-BERT
+**GitHub Repository**: [alphagyuu/Korean-PII-Masking-BERT](https://github.com/alphagyuu/Korean-PII-Masking-BERT)
+Korean-PII-Masking-BERT is a token classification model fine-tuned on KcBERT’s **TokenClassifier** using a processed version of "Korean SNS" dataset from **AI-Hub**.
+## 🖥️ Python Implementation
+- **Tokenizer**:
+  ```python
+  BertTokenizer.from_pretrained('beomi/kcbert-base', do_lower_case=False)
+  ```
+- **Model**:
+  ```python
+  TFBertForTokenClassification.from_pretrained('alphagyuu/Korean-PII-Masking-BertForTokenClassification', num_labels=len(tag2idx))
+  ```
+- **Libraries**:
+  - `transformers`, `tensorflow`, `numpy`, `pandas`, `sklearn`
 ---
 license: apache-2.0
 language: