kmd2525 commited on
Commit
7168f63
·
verified ·
1 Parent(s): 96b620a

Upload v8_stage3_xml

Browse files
README.md ADDED
@@ -0,0 +1,73 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model: kmd2525/v8_stage2_yaml-merged
4
+ tags:
5
+ - structured-output
6
+ - xml
7
+ - sft
8
+ - sequential-format-learning
9
+ language:
10
+ - en
11
+ - ja
12
+ ---
13
+
14
+ # v8_stage3_xml-merged
15
+
16
+ ## Model Description
17
+
18
+ This model is **Stage 3** of the Sequential Format Learning (v8 strategy) for structured data output.
19
+
20
+ ### Training Strategy
21
+
22
+ Based on Person U's approach that achieved 0.84 on the leaderboard:
23
+ - Train one format at a time
24
+ - Merge LoRA to base model after each stage
25
+ - Use merged model as the base for the next stage
26
+
27
+ ### Stage 3 Focus: XML
28
+
29
+ - **Format**: XML (500 samples)
30
+ - **Goal**: 95%+ parse success rate for XML with proper & escaping
31
+ - **Base Model**: `kmd2525/v8_stage2_yaml-merged` (Stage 2 merged model)
32
+
33
+ ### Previous Stages
34
+
35
+ - Stage 1: JSON/CSV (800 samples) → JSON 100%, CSV 100%
36
+ - Stage 2: YAML (500 samples) → YAML 100%
37
+
38
+ ### Training Parameters
39
+
40
+ - MAX_SEQ_LEN: 1024
41
+ - EPOCHS: 2
42
+ - Learning Rate: 3e-05
43
+ - LoRA R: 64, Alpha: 128
44
+
45
+ ### Sequential Format Learning Pipeline
46
+
47
+ ```
48
+ Stage 1: JSON/CSV (800) ✅
49
+
50
+ Stage 2: YAML (500) ✅
51
+
52
+ Stage 3: XML (500) ← This model
53
+
54
+ Stage 4: Mixed/TOML (1000)
55
+
56
+ Final Model → LB 0.8+
57
+ ```
58
+
59
+ ### Usage
60
+
61
+ ```python
62
+ from transformers import AutoModelForCausalLM, AutoTokenizer
63
+
64
+ model = AutoModelForCausalLM.from_pretrained("kmd2525/v8_stage3_xml-merged")
65
+ tokenizer = AutoTokenizer.from_pretrained("kmd2525/v8_stage3_xml-merged")
66
+ ```
67
+
68
+ ### Next Stage
69
+
70
+ Use this model as the base for Stage 4 (Mixed/TOML final tuning):
71
+ ```python
72
+ os.environ["SFT_BASE_MODEL"] = "kmd2525/v8_stage3_xml-merged"
73
+ ```
model-00001-of-00002.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:97bc706b40b27830ff9520172605eaa710d50fb255d9821064e7739c8441e349
3
  size 4967215360
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:118f5927f6a4a15fefe0478da24d70a4f710b4256183dbfd88785e8eefa67f04
3
  size 4967215360
model-00002-of-00002.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:cac8aa941c63a630a64380705933ed9718c0192b5b78a16ec8e389a7d2748848
3
  size 3077766632
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:458d2062377a075191662ce63a86259599afa4d26fbb88e7784cd981765a72a7
3
  size 3077766632