ai-eldorado/Brazilian_CLT_preferences
Viewer • Updated • 736 • 12
Model Description
This repository provides a LoRA adapter trained on the same human-preference dataset used for the DPO model. It enables efficient fine-tuning of LLaMA-3 8B Instruct (4-bit) for Brazilian labor law applications without requiring full model retraining.
Intended Use
The LoRA adapter is intended for developers and researchers who want to adapt LLaMA-3 models for CLT-related tasks while minimizing computational costs. It is particularly useful for resource-constrained environments or for integrating into multi-agent legal assistant systems.
Training Details
Performance Summary
When merged with the base model, the LoRA adapter reproduces the improvements observed in the full DPO model:
@article{moraescomparing,
title={Comparing RAG, DPO and Agentic Approaches in Systems Performance on Q\&A about Brazilian Labor Legislation},
author={Moraes, Gabriel K and Luiz, Pedro Augusto and Dias, Gabriel and de Farias, Vitor GCB and Fabiana, CQ de O and Fabris, Vitor L and Vicente, Matheus HR and do Nascimento, Leonardo R and Oliveira, Charles S and dos Santos, Leonardo T and others}
}
Base model
meta-llama/Llama-3.1-8B