Gigi

add dataset link

6096583 3 months ago

4.23 kB

	---
	library_name: peft
	base_model: Qwen/Qwen2.5-7B-Instruct
	pipeline_tag: text-classification
	---

	# Privacy Comparator

	A learned model for pairwise comparison of privacy strength between messages.

	---

	## Model Details

	### Model Description

	Privacy Comparator is a learned model that compares two messages and determines which provides stronger protection of personal or sensitive information.

	Given two inputs:

	```
	A: message
	B: message
	```

	the model outputs:

	```
	A message A is more privacy-preserving
	B message B is more privacy-preserving
	SAME messages offer the same level of privacy protection
	```

	The model performs relative privacy comparison and can be applied to arbitrary message pairs, regardless of how they were generated.

	It does not:

	- detect PII
	- assign absolute privacy scores
	- generate redactions

	Instead, it learns a preference relation over messages in terms of privacy strength.

	---

	### Base Model

	Finetuned from: Qwen/Qwen2.5-7B-Instruct

	Implemented as a LoRA adapter.

	---

	### License

	This adapter inherits the license constraints of the base model.

	---

	## Uses

	### Intended Use

	- Privacy-preserving text comparison
	- Ranking anonymization strategies
	- Evaluating relative disclosure risk

	For example, when multiple transformation strategies are applied to the same input:

	```
	m_i = τ(x; a_i)
	```

	where:

	- `x` is the original message
	- `a_i` is a transformation strategy (e.g., redact, abstract, retain sensitive spans)
	- `τ` applies the chosen strategy to produce a privacy-preserving version

	Example:

	Original message:

	```
	Lucy lives at 139 Tremont St in Boston.
	```

	Different strategies may produce:

	```
	m₁: [NAME1] lives at [ADDRESS1] in [CITY1].
	m₂: A person lives at a residential address in a major city in U.S.
	m₃: A person lives at [ADDRESS1] in Boston.
	```

	The comparator can rank such variants based on which better protects sensitive information.

	For more details on the transformation framework, please refer to the associated paper.

	---

	### Out-of-Scope Use

	This model is not intended for:

	- PII detection
	- Safety moderation
	- Utility evaluation
	- Generating anonymized text

	It performs relative comparison only.

	---

	## Training Details

	- LoRA rank: 8
	- Learning rate: 1e-4
	- Epochs: 2
	- Context length: 2048
	- Global batch size: 2048

	Training performed using Fireworks AI.

	## Training Data

	This model is fine-tuned via supervised fine-tuning (SFT) with LoRA on pairwise privacy-preference comparisons.

	Training labels are generated using a teacher model (OpenAI o3) on [ShareGPT90K](https://huggingface.co/datasets/liyucheng/ShareGPT90K)-derived privacy-variant pairs.
	As described in the paper, o3 was selected based on its alignment with human ground truth under high-consensus cases.

	In addition, we release a human-labeled evaluation set of 150 A/B pairs.
	Each pair is annotated by at least 5 qualified participants (52 unique participants total), with provided `consensus` labels and `consensus_ratio`.

	For details on data construction, model selection, and annotation procedures, please refer to the paper.

	---
	## Released Dataset (Human Ground Truth)

	We release a human-labeled [dataset](https://github.com/PEACH-Research-Lab/Operationalize-Data-Minimization/blob/main/human_labeled_datasets/DATASET_CARD.md) of 150 pairwise privacy-preference comparisons.

	Each JSONL entry contains:
	- `survey_id`, `conversation_id`, `pair_index`
	- `answers`: anonymized participant votes (`participant_1`, `participant_2`, ...)
	- `consensus`, `consensus_ratio`
	- `message_A`, `message_B`

	### Participant Privacy
	All participant identifiers are anonymized. No Prolific IDs or direct participant identifiers are released.

	---

	## Model Outputs

	The model produces structured JSON decisions:

	```json
	{
	"reason": "...",
	"response": "A" \| "B" \| "SAME"
	}
	```

	---

	## Resources

	Paper: [OpenReview](https://iclr.cc/virtual/2026/poster/10007115)
	Code: [Operationalize Data Minimization](https://github.com/PEACH-Research-Lab/Operationalize-Data-Minimization)

	For full details of the transformation framework and action search procedure, please refer to the paper.