nbeerbower commited on
Commit
5670e40
·
verified ·
1 Parent(s): 4bdc116

Add model card with training configuration

Browse files
Files changed (1) hide show
  1. README.md +56 -0
README.md ADDED
@@ -0,0 +1,56 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: transformers
3
+ pipeline_tag: text-generation
4
+ tags:
5
+ - merlina
6
+ - grimoire
7
+ - text-generation
8
+ - sft
9
+ datasets:
10
+ - hemlang/Hemlock2-DPO
11
+ - hemlang/hemlock-formulary-SFT
12
+ - hemlang/hemlock-codex-SFT
13
+ base_model:
14
+ - nbeerbower/A2-Coder
15
+ ---
16
+
17
+ # A2-Hemlock-Coder
18
+
19
+ ## Training Configuration
20
+
21
+ | Parameter | Value |
22
+ |-----------|-------|
23
+ | Training Mode | SFT |
24
+ | Base Model | `nbeerbower/A2-Coder` |
25
+ | Learning Rate | 0.0001 |
26
+ | Epochs | 2 |
27
+ | Batch Size | 1 |
28
+ | Gradient Accumulation | 16 |
29
+ | Effective Batch Size | 16 |
30
+ | Max Sequence Length | 2048 |
31
+ | Optimizer | paged_adamw_8bit |
32
+ | LR Scheduler | cosine |
33
+ | Warmup Ratio | 0.05 |
34
+ | Weight Decay | 0.01 |
35
+ | Max Grad Norm | 0.5 |
36
+ | Seed | 42 |
37
+ | LoRA Rank (r) | 128 |
38
+ | LoRA Alpha | 128 |
39
+ | LoRA Dropout | 0.05 |
40
+ | Target Modules | k_proj, o_proj, q_proj, v_proj, down_proj, gate_proj, up_proj |
41
+ | Quantization | 4-bit (NF4) |
42
+ | GPU | NVIDIA RTX A6000 |
43
+
44
+ ## Datasets
45
+
46
+ Trained on 3 concatenated datasets:
47
+
48
+ 1. [`hemlang/Hemlock2-DPO`](https://huggingface.co/datasets/hemlang/Hemlock2-DPO) (split: `train`)
49
+ 2. [`hemlang/hemlock-formulary-SFT`](https://huggingface.co/datasets/hemlang/hemlock-formulary-SFT) (split: `train`)
50
+ 3. [`hemlang/hemlock-codex-SFT`](https://huggingface.co/datasets/hemlang/hemlock-codex-SFT) (split: `train`)
51
+
52
+ ---
53
+
54
+ ![Trained with Merlina](https://raw.githubusercontent.com/Schneewolf-Labs/Merlina/refs/heads/main/frontend/madewithmerlina_smol.png)
55
+
56
+ [Merlina on GitHub](https://github.com/Schneewolf-Labs/Merlina)