Spaces:
Build error
A newer version of the Gradio SDK is available: 6.14.0
title: Data Faker
emoji: 🌖
colorFrom: green
colorTo: pink
sdk: gradio
sdk_version: 4.20.1
app_file: app.py
pinned: false
license: mit
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
Data Faker
Data Faker is a tool designed to protect privacy and maintain confidentiality by anonymizing sensitive information in text data before passing it to language models like GPT-4. It provides a user-friendly interface for anonymizing and deanonymizing text using the Presidio Reversible Anonymizer.
Features
- Anonymize sensitive information such as names, email addresses, phone numbers, and more
- Deanonymize previously anonymized text to recover the original information
- Maintain a separate anonymizer instance for each user session
- Simple and intuitive web-based interface using Gradio
Installation
- Clone the repository:
git clone https://github.com/mayzyo/data-faker.git
- Navigate to the project directory:
cd data-faker
- Install the required dependencies:
pip install -r requirements.txt
Usage
- Run the application:
python app.py
Open your web browser and navigate to the provided URL (e.g.,
http://localhost:7860).Enter the text you want to anonymize in the "Text to Convert" input box.
Click the "Generate Fake" button to anonymize the text. The anonymized text will appear in the "Output Text" box.
To deanonymize the text, click the "Revert Fake" button. The original text will be restored in the "Output Text" box.
To reset the anonymizer instance and clear the input and output boxes, click the "Reset" button.
Configuration
The AnonymiserManager class in anonymiser_manager.py uses the Presidio Reversible Anonymizer to handle the anonymization and deanonymization process. You can customize the analyzed_fields parameter to specify the types of sensitive information you want to anonymize.
Contributing
Contributions are welcome! If you find any issues or have suggestions for improvements, please open an issue or submit a pull request.
License
This project is licensed under the MIT License.