ai-eldorado
/

Brazilian_CLT_lora

Model card Files Files and versions

Brazilian_CLT_lora / README.md

wandemberg-eld's picture

Update README.md

248d966 verified 6 months ago

|

history blame contribute delete

1.85 kB

	---
	license: llama3
	datasets:
	- ai-eldorado/Brazilian_CLT_preferences
	language:
	- en
	- pt
	base_model:
	- meta-llama/Llama-3.1-8B-Instruct
	tags:
	- legal
	---

	Model Description
	This repository provides a LoRA adapter trained on the same human-preference dataset used for the DPO model. It enables efficient fine-tuning of LLaMA-3 8B Instruct (4-bit) for Brazilian labor law applications without requiring full model retraining.

	Intended Use
	The LoRA adapter is intended for developers and researchers who want to adapt LLaMA-3 models for CLT-related tasks while minimizing computational costs. It is particularly useful for resource-constrained environments or for integrating into multi-agent legal assistant systems.

	Training Details
	- Base Model: LLaMA-3 8B (4-bit quantized)
	- Method: LoRA fine-tuning with DPO-aligned dataset
	- Dataset: 736 human-preference entries on CLT-related questions
	- Hyperparameters: Same as the DPO model

	Performance Summary
	When merged with the base model, the LoRA adapter reproduces the improvements observed in the full DPO model:
	- Increased factual accuracy
	- Better semantic alignment with CLT regulations


	#### Ethical Considerations
	- Legal Disclaimer: Not a substitute for professional legal advice.
	- Risk of Misuse: Incorrect merging or use outside intended domain may lead to inaccurate outputs.
	- Data Privacy: No personal data was used in training.


	#### Bias and Fairness
	- Same considerations as the DPO model:
	- Regional and interpretation biases may exist.
	- Limited dataset size could affect fairness in edge cases.


	#### Limitations
	- Requires merging with the base LLaMA-3 model for inference.
	- Domain-specific; not suitable for general-purpose legal reasoning.


	#### Citation
	soon