Brazilian_CLT_DPO / README.md
wandemberg-eld's picture
Update README.md
212f7c0 verified
---
license: llama3
datasets:
- ai-eldorado/Brazilian_CLT_preferences
language:
- en
- pt
base_model:
- meta-llama/Meta-Llama-3-8B-Instruct
tags:
- legal
---
**Model Description**
This model is a fine-tuned version of **LLaMA-3 8B Instruct (4-bit quantized)**, optimized using **Direct Preference Optimization (DPO)** for answering legal questions related to Brazil’s Consolidation of Labor Laws (CLT). The fine-tuning process leveraged a curated dataset of **736 human-preference triplets**, annotated by HR specialists and legal experts, to align the model with domain-specific expectations for accuracy and compliance.
**Intended Use**
The model is designed for **legal question answering** in the context of Brazilian labor law, supporting HR departments, compliance teams, and legal professionals. It aims to provide **factually accurate and semantically aligned responses** to CLT-related queries.
**Training Details**
- **Base Model:** LLaMA-3 8B (4-bit quantized)
- **Fine-tuning Method:** Direct Preference Optimization (DPO)
- **Dataset:** 736 validated human-preference entries on CLT-related questions
- **Hyperparameters:**
- Batch size: 2
- Gradient accumulation: 3
- Epochs: 1
- Learning rate: 5e-6
- Optimizer: AdamW 8-bit
**Performance Summary**
Compared to the base model, this DPO-tuned model achieved:
- **+11% improvement in factual accuracy**
- Higher semantic similarity scores
- Slight trade-off in fluency and argumentative structure
#### **Ethical Considerations**
- **Legal Disclaimer:** This model does **not** replace professional legal advice. Users should consult qualified professionals for critical decisions.
- **Risk of Misinterpretation:** Responses may omit nuances or context-specific interpretations of labor law.
- **Data Privacy:** The model was trained on synthetic and curated datasets, not on personal or confidential data.
#### **Bias and Fairness**
- The dataset was curated by HR and legal experts to minimize bias, but:
- **Regional Bias:** Focused exclusively on Brazilian CLT; not applicable to other jurisdictions.
- **Interpretation Bias:** Human annotators’ preferences may reflect subjective interpretations of legal norms.
#### **Limitations**
- Domain-specific; performance may degrade outside CLT-related queries.
- BLEU and ROUGE scores remain low due to metric limitations in legal contexts.
- Limited training data may affect generalization to complex or ambiguous cases.
#### **Citation**
soon