Andy-ML-And-AI commited on
Commit
bc48c43
Β·
verified Β·
1 Parent(s): 6ccd13d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -3
README.md CHANGED
@@ -9,6 +9,7 @@ tags:
9
  - qlora
10
  - reasoning
11
  - code
 
12
  license: apache-2.0
13
  language:
14
  - en
@@ -21,9 +22,10 @@ library_name: adapter
21
  pipeline_tag: text-generation
22
  ---
23
 
24
- # CTD-Qwen3-8B (Code Till Death)
 
 
25
 
26
- CTD-Qwen3-8B is a LoRA fine-tune of the Qwen3-8B base model.
27
  ---
28
 
29
  ## πŸ›  Experimental Setup
@@ -36,6 +38,7 @@ CTD-Qwen3-8B is a LoRA fine-tune of the Qwen3-8B base model.
36
  - MLP: gate, up, down
37
  - Training time: ~1 hour 17 minutes
38
  - Total steps: 50
 
39
  ---
40
 
41
  ## 🧠 Dataset & Objective
@@ -46,6 +49,7 @@ Training on a specific 30k subset of the
46
  - Uses chat template with assistant response in the *thinking* field
47
  - Objective: encourage *thinking over direct response*
48
  - Sequence length limited to 4096 tokens (for code complexity + VRAM constraints)
 
49
  ---
50
 
51
  ## πŸ“‰ Training Logs
@@ -59,6 +63,7 @@ With only 50 steps, the loss shows expected variance given model + dataset compl
59
  | 50 | 0.6785 |
60
 
61
  - Global batch size: 8 (1 device Γ— 8 gradient steps)
 
62
  ---
63
 
64
  ## πŸ“Š Evaluation (Ongoing)
@@ -69,6 +74,7 @@ Currently running benchmarks using the **lm-eval** library:
69
  - GSM8K (Math)
70
 
71
  Comparisons are being made against the base model.
 
72
  ---
73
 
74
  ## πŸ” Reproduction
@@ -77,7 +83,7 @@ Comparisons are being made against the base model.
77
  from unsloth import FastLanguageModel
78
 
79
  model, tokenizer = FastLanguageModel.from_pretrained(
80
- model_name = "Andy-ML-And-AI/CTD-Qwen3-8B",
81
  max_seq_length = 4096,
82
  load_in_4bit = True,
83
  )
 
9
  - qlora
10
  - reasoning
11
  - code
12
+ - hyperthinkcode
13
  license: apache-2.0
14
  language:
15
  - en
 
22
  pipeline_tag: text-generation
23
  ---
24
 
25
+ # HyperThinkCode-Qwen3-8B-v1
26
+
27
+ HyperThinkCode-Qwen3-8B-v1 is a LoRA fine-tune of the Qwen3-8B base model.
28
 
 
29
  ---
30
 
31
  ## πŸ›  Experimental Setup
 
38
  - MLP: gate, up, down
39
  - Training time: ~1 hour 17 minutes
40
  - Total steps: 50
41
+
42
  ---
43
 
44
  ## 🧠 Dataset & Objective
 
49
  - Uses chat template with assistant response in the *thinking* field
50
  - Objective: encourage *thinking over direct response*
51
  - Sequence length limited to 4096 tokens (for code complexity + VRAM constraints)
52
+
53
  ---
54
 
55
  ## πŸ“‰ Training Logs
 
63
  | 50 | 0.6785 |
64
 
65
  - Global batch size: 8 (1 device Γ— 8 gradient steps)
66
+
67
  ---
68
 
69
  ## πŸ“Š Evaluation (Ongoing)
 
74
  - GSM8K (Math)
75
 
76
  Comparisons are being made against the base model.
77
+
78
  ---
79
 
80
  ## πŸ” Reproduction
 
83
  from unsloth import FastLanguageModel
84
 
85
  model, tokenizer = FastLanguageModel.from_pretrained(
86
+ model_name = "Andy-ML-And-AI/HyperThinkCode-Qwen3-8B-v1",
87
  max_seq_length = 4096,
88
  load_in_4bit = True,
89
  )