InterstellarCG commited on
Commit
aa90ed5
·
verified ·
1 Parent(s): d12ce4a

Add model card

Browse files
Files changed (1) hide show
  1. README.md +59 -0
README.md ADDED
@@ -0,0 +1,59 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ language:
4
+ - en
5
+ tags:
6
+ - code-generation
7
+ - codefeedback
8
+ - hrm-text
9
+ base_model: sapientai/HRM-Text-1B
10
+ ---
11
+
12
+ # HRM-Text-1B-Code-Feedback
13
+
14
+ Fine-tuned version of [HRM-Text-1B](https://huggingface.co/sapientai/HRM-Text-1B) on the CodeFeedback dataset for code generation.
15
+
16
+ ## Model Details
17
+
18
+ - **Base Model:** sapientai/HRM-Text-1B (1B parameters, hierarchical reasoning model)
19
+ - **Training Data:** CodeFeedback dataset (~131k samples, filtered to <= 4096 tokens)
20
+ - **Training:** 2 epochs, ~8 hours on L40S GPU
21
+ - **Architecture:** Hierarchical Reasoning Model with H_cycles=2, L_cycles=3
22
+
23
+ ## Training Data Distribution
24
+
25
+ | Language | Samples |
26
+ |----------|---------|
27
+ | Python | ~80k |
28
+ | JavaScript | ~7.6k |
29
+ | React | ~550 |
30
+
31
+ ## Performance
32
+
33
+ | Task | Base | Fine-tuned |
34
+ |------|------|------------|
35
+ | C++ factorial | Broken (repeating includes) | Correct |
36
+ | JS reverse | Wrong syntax | Correct syntax |
37
+ | Java max | Wrong type | Better structure |
38
+
39
+ ## Usage
40
+
41
+
42
+
43
+ ## Training Details
44
+
45
+ - **Framework:** PyTorch with FlashAttention 3
46
+ - **Loss:** Cross-entropy
47
+ - **Hardware:** AWS L40S GPU
48
+ - **Training Time:** ~8 hours
49
+
50
+ ## Limitations
51
+
52
+ - Maximum sequence length: 4096 tokens
53
+ - Requires FlashAttention 3 for inference (Ada Lovelace or newer GPUs)
54
+ - Limited React/TypeScript performance due to small training data
55
+ - Best performance on Python code generation
56
+
57
+ ## License
58
+
59
+ MIT License