Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -248,6 +248,76 @@ llama-server \
|
|
| 248 |
| **0.3-0.5** | Balanced | Good balance of accuracy and flexibility |
|
| 249 |
| **0.6-1.0** | Creative | More flexible but may miss some PII |
|
| 250 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 251 |
## π Limitations & Biases
|
| 252 |
|
| 253 |
### Current Limitations
|
|
|
|
| 248 |
| **0.3-0.5** | Balanced | Good balance of accuracy and flexibility |
|
| 249 |
| **0.6-1.0** | Creative | More flexible but may miss some PII |
|
| 250 |
|
| 251 |
+
## π‘ Examples
|
| 252 |
+
|
| 253 |
+
Here are real examples of the model in action, tested across different sectors and text types:
|
| 254 |
+
|
| 255 |
+
### π₯ Medical Records
|
| 256 |
+
**Input:**
|
| 257 |
+
```
|
| 258 |
+
Patient John Smith, born on March 15, 1985, visited Dr. Emily Johnson at St. Mary Hospital on January 10, 2024. His phone number is (555) 123-4567 and he lives at 123 Oak Street, Springfield, IL 62701.
|
| 259 |
+
```
|
| 260 |
+
|
| 261 |
+
**Output:**
|
| 262 |
+
```
|
| 263 |
+
Patient [FIRSTNAME_1] [MIDDLENAME_1], born on [DOB_1], visited Dr. [MIDDLENAME_2] [LASTNAME_1] at [CITY_1] Hospital on [DATE_1]. His phone number is [PHONENUMBER_1] and he lives at [BUILDINGNUMBER_1] [STREET_1], [STATE_1], [STATE_2] [STATE_3].
|
| 264 |
+
```
|
| 265 |
+
|
| 266 |
+
### βοΈ Legal Documents
|
| 267 |
+
**Input:**
|
| 268 |
+
```
|
| 269 |
+
Attorney Robert Davis from Davis & Associates LLP filed a lawsuit on behalf of client Sarah Johnson. The case involves Ms. Johnson's accident on December 15, 2023, at 456 Main Street, Boston, MA. Contact information: rdavis@lawfirm.com, (617) 555-0123.
|
| 270 |
+
```
|
| 271 |
+
|
| 272 |
+
**Output:**
|
| 273 |
+
```
|
| 274 |
+
Attorney [FIRSTNAME_1] [LASTNAME_1] from [COMPANYNAME_1] filed a lawsuit on behalf of client [FIRSTNAME_2] [LASTNAME_2]. The case involves Ms. [LASTNAME_3]'s accident on [DATE_1], at [BUILDINGNUMBER_1] [STREET_1], [STATE_1], [STATE_2]. Contact information: [EMAIL_1], [PHONENUMBER_1].
|
| 275 |
+
```
|
| 276 |
+
|
| 277 |
+
### π₯ HR Records
|
| 278 |
+
**Input:**
|
| 279 |
+
```
|
| 280 |
+
Employee record for Michael Chen (ID: EMP-2023-0456). Born: July 22, 1990. Position: Senior Software Engineer. Salary: $125,000 annually. Address: 789 Pine Avenue, Seattle, WA 98101. Email: mchen@techcorp.com. Emergency contact: Jennifer Chen, sister, phone (206) 555-9876.
|
| 281 |
+
```
|
| 282 |
+
|
| 283 |
+
**Output:**
|
| 284 |
+
```
|
| 285 |
+
Employee record for [FIRSTNAME_1] [LASTNAME_1] (ID: EMP-[DOB_1]). Born: [DOB_2]. Position: [JOBTITLE_1]. Salary: [CURRENCYSYMBOL_1][AMOUNT_1], [CURRENCYCODE_1]s, [SECONDARYADDRESS_1], [STATE_1] [ZIPCODE_1]. Email: [EMAIL_1]. Emergency contact: [FIRSTNAME_2] [LASTNAME_2], sister, phone ([PHONENUMBER_1]).
|
| 286 |
+
```
|
| 287 |
+
|
| 288 |
+
### π° Financial Records
|
| 289 |
+
**Input:**
|
| 290 |
+
```
|
| 291 |
+
Bank statement for account holder Lisa Rodriguez, Account #9876543210. Transaction on March 5, 2024: Deposit of $2,500 from employer TechSolutions Inc. Address: 321 Elm Drive, Austin, TX 78701. Phone: (512) 555-2468. Email: lisa.rodriguez@email.com.
|
| 292 |
+
```
|
| 293 |
+
|
| 294 |
+
**Output:**
|
| 295 |
+
```
|
| 296 |
+
Bank statement for account holder [FIRSTNAME_1] [MIDDLENAME_1], Account #[ACCOUNTNUMBER_1]. Transaction on [DATE_1]: Deposit of [CURRENCYSYMBOL_1]2,500 from employer [COMPANYNAME_1]. Address: [BUILDINGNUMBER_1] [STREET_1], [CITY_1], [STATE_1] [STATE_2]. Phone: [PHONENUMBER_1]. Email: [EMAIL_1].
|
| 297 |
+
```
|
| 298 |
+
|
| 299 |
+
### π¬ Social Media / Personal
|
| 300 |
+
**Input:**
|
| 301 |
+
```
|
| 302 |
+
Hey everyone! My friend David Wilson just got engaged to his girlfriend Maria Garcia. They met at Stanford University in 2018 and have been dating for 5 years. David works as a data scientist at Google in Mountain View, CA. Maria is a doctor at Stanford Hospital. Their wedding is planned for June 15, 2025, at Napa Valley Vineyard. Send congratulations to david.wilson@gmail.com or call (650) 555-0199!
|
| 303 |
+
```
|
| 304 |
+
|
| 305 |
+
**Output:**
|
| 306 |
+
```
|
| 307 |
+
Hey everyone! My friend [FIRSTNAME_1] [LASTNAME_1] just got engaged to his girlfriend [FIRSTNAME_2] [LASTNAME_2]. They met at Stanford University in 2018 and have been dating for 5 years. [FIRSTNAME_3] works as a data scientist at Google in [CITY_1], [STATE_1]. [FIRSTNAME_4] is a doctor at Stanford Hospital. Their wedding is planned for [DATE_1], at [STREET_1]. Send congratulations to [EMAIL_1] or call [PHONENUMBER_1]!
|
| 308 |
+
```
|
| 309 |
+
|
| 310 |
+
### π Meeting Notes
|
| 311 |
+
**Input:**
|
| 312 |
+
```
|
| 313 |
+
Meeting notes: Dr. Amanda White (awhite@hospital.org, (415) 555-1122) discussed patient care with nurse James Brown. Patient: Mark Johnson, DOB 11/20/1975, diagnosed with diabetes on 03/10/2023. Address: 789 Oak Ave, San Francisco, CA 94102.
|
| 314 |
+
```
|
| 315 |
+
|
| 316 |
+
**Output:**
|
| 317 |
+
```
|
| 318 |
+
Meeting notes: Dr. [FIRSTNAME_1] [MIDDLENAME_1] [LASTNAME_1], [PHONENUMBER_1] discussed patient care with nurse [FIRSTNAME_2] [LASTNAME_2]. Patient: [FIRSTNAME_3] [MIDDLENAME_3], DOB [DOB_1], diagnosed with diabetes on [DATE_1]. Address: [BUILDINGNUMBER_1] [STREET_1], [CITY_1], [STATE_1] [STATE_2].
|
| 319 |
+
```
|
| 320 |
+
|
| 321 |
## π Limitations & Biases
|
| 322 |
|
| 323 |
### Current Limitations
|