Andy-ML-And-AI commited on
Commit
6ccd13d
Β·
verified Β·
1 Parent(s): d0f04f2

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +0 -5
README.md CHANGED
@@ -24,7 +24,6 @@ pipeline_tag: text-generation
24
  # CTD-Qwen3-8B (Code Till Death)
25
 
26
  CTD-Qwen3-8B is a LoRA fine-tune of the Qwen3-8B base model.
27
-
28
  ---
29
 
30
  ## πŸ›  Experimental Setup
@@ -37,7 +36,6 @@ CTD-Qwen3-8B is a LoRA fine-tune of the Qwen3-8B base model.
37
  - MLP: gate, up, down
38
  - Training time: ~1 hour 17 minutes
39
  - Total steps: 50
40
-
41
  ---
42
 
43
  ## 🧠 Dataset & Objective
@@ -48,7 +46,6 @@ Training on a specific 30k subset of the
48
  - Uses chat template with assistant response in the *thinking* field
49
  - Objective: encourage *thinking over direct response*
50
  - Sequence length limited to 4096 tokens (for code complexity + VRAM constraints)
51
-
52
  ---
53
 
54
  ## πŸ“‰ Training Logs
@@ -62,7 +59,6 @@ With only 50 steps, the loss shows expected variance given model + dataset compl
62
  | 50 | 0.6785 |
63
 
64
  - Global batch size: 8 (1 device Γ— 8 gradient steps)
65
-
66
  ---
67
 
68
  ## πŸ“Š Evaluation (Ongoing)
@@ -73,7 +69,6 @@ Currently running benchmarks using the **lm-eval** library:
73
  - GSM8K (Math)
74
 
75
  Comparisons are being made against the base model.
76
-
77
  ---
78
 
79
  ## πŸ” Reproduction
 
24
  # CTD-Qwen3-8B (Code Till Death)
25
 
26
  CTD-Qwen3-8B is a LoRA fine-tune of the Qwen3-8B base model.
 
27
  ---
28
 
29
  ## πŸ›  Experimental Setup
 
36
  - MLP: gate, up, down
37
  - Training time: ~1 hour 17 minutes
38
  - Total steps: 50
 
39
  ---
40
 
41
  ## 🧠 Dataset & Objective
 
46
  - Uses chat template with assistant response in the *thinking* field
47
  - Objective: encourage *thinking over direct response*
48
  - Sequence length limited to 4096 tokens (for code complexity + VRAM constraints)
 
49
  ---
50
 
51
  ## πŸ“‰ Training Logs
 
59
  | 50 | 0.6785 |
60
 
61
  - Global batch size: 8 (1 device Γ— 8 gradient steps)
 
62
  ---
63
 
64
  ## πŸ“Š Evaluation (Ongoing)
 
69
  - GSM8K (Math)
70
 
71
  Comparisons are being made against the base model.
 
72
  ---
73
 
74
  ## πŸ” Reproduction