ai-eldorado
/

Brazilian_CLT_DPO

4-bit precision

Model card Files Files and versions

Brazilian_CLT_DPO / README.md

wandemberg-eld's picture

Update README.md

212f7c0 verified 6 months ago

|

history blame contribute delete

2.52 kB

	---
	license: llama3
	datasets:
	- ai-eldorado/Brazilian_CLT_preferences
	language:
	- en
	- pt
	base_model:
	- meta-llama/Meta-Llama-3-8B-Instruct
	tags:
	- legal
	---

	Model Description
	This model is a fine-tuned version of LLaMA-3 8B Instruct (4-bit quantized), optimized using Direct Preference Optimization (DPO) for answering legal questions related to Brazil’s Consolidation of Labor Laws (CLT). The fine-tuning process leveraged a curated dataset of 736 human-preference triplets, annotated by HR specialists and legal experts, to align the model with domain-specific expectations for accuracy and compliance.

	Intended Use
	The model is designed for legal question answering in the context of Brazilian labor law, supporting HR departments, compliance teams, and legal professionals. It aims to provide factually accurate and semantically aligned responses to CLT-related queries.

	Training Details
	- Base Model: LLaMA-3 8B (4-bit quantized)
	- Fine-tuning Method: Direct Preference Optimization (DPO)
	- Dataset: 736 validated human-preference entries on CLT-related questions
	- Hyperparameters:
	- Batch size: 2
	- Gradient accumulation: 3
	- Epochs: 1
	- Learning rate: 5e-6
	- Optimizer: AdamW 8-bit

	Performance Summary
	Compared to the base model, this DPO-tuned model achieved:
	- +11% improvement in factual accuracy
	- Higher semantic similarity scores
	- Slight trade-off in fluency and argumentative structure


	#### Ethical Considerations
	- Legal Disclaimer: This model does not replace professional legal advice. Users should consult qualified professionals for critical decisions.
	- Risk of Misinterpretation: Responses may omit nuances or context-specific interpretations of labor law.
	- Data Privacy: The model was trained on synthetic and curated datasets, not on personal or confidential data.


	#### Bias and Fairness
	- The dataset was curated by HR and legal experts to minimize bias, but:
	- Regional Bias: Focused exclusively on Brazilian CLT; not applicable to other jurisdictions.
	- Interpretation Bias: Human annotators’ preferences may reflect subjective interpretations of legal norms.


	#### Limitations
	- Domain-specific; performance may degrade outside CLT-related queries.
	- BLEU and ROUGE scores remain low due to metric limitations in legal contexts.
	- Limited training data may affect generalization to complex or ambiguous cases.



	#### Citation
	soon