wandemberg-eld commited on
Commit
212f7c0
·
verified ·
1 Parent(s): 3f218c7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +13 -15
README.md CHANGED
@@ -1,18 +1,16 @@
1
- ---
2
- license: llama3
3
- datasets:
4
- - ai-eldorado/Brazilian_CLT_preferences
5
- language:
6
- - en
7
- - pt
8
- base_model:
9
- - meta-llama/Meta-Llama-3-8B-Instruct
10
- tags:
11
- - legal
12
- ---
13
-
14
-
15
- ### ✅ **Model Card for `ai-eldorado/Brazilian_CLT_DPO`**
16
  **Model Description**
17
  This model is a fine-tuned version of **LLaMA-3 8B Instruct (4-bit quantized)**, optimized using **Direct Preference Optimization (DPO)** for answering legal questions related to Brazil’s Consolidation of Labor Laws (CLT). The fine-tuning process leveraged a curated dataset of **736 human-preference triplets**, annotated by HR specialists and legal experts, to align the model with domain-specific expectations for accuracy and compliance.
18
 
 
1
+ ---
2
+ license: llama3
3
+ datasets:
4
+ - ai-eldorado/Brazilian_CLT_preferences
5
+ language:
6
+ - en
7
+ - pt
8
+ base_model:
9
+ - meta-llama/Meta-Llama-3-8B-Instruct
10
+ tags:
11
+ - legal
12
+ ---
13
+
 
 
14
  **Model Description**
15
  This model is a fine-tuned version of **LLaMA-3 8B Instruct (4-bit quantized)**, optimized using **Direct Preference Optimization (DPO)** for answering legal questions related to Brazil’s Consolidation of Labor Laws (CLT). The fine-tuning process leveraged a curated dataset of **736 human-preference triplets**, annotated by HR specialists and legal experts, to align the model with domain-specific expectations for accuracy and compliance.
16