| --- |
| license: mit |
| language: |
| - en |
| tags: |
| - code-generation |
| - codefeedback |
| - hrm-text |
| base_model: sapientai/HRM-Text-1B |
| --- |
| |
| # HRM-Text-1B-Code-Feedback |
|
|
| Fine-tuned version of [HRM-Text-1B](https://huggingface.co/sapientai/HRM-Text-1B) on the CodeFeedback dataset for code generation. |
|
|
| ## Model Details |
|
|
| - **Base Model:** sapientai/HRM-Text-1B (1B parameters, hierarchical reasoning model) |
| - **Training Data:** CodeFeedback dataset (~131k samples, filtered to <= 4096 tokens) |
| - **Training:** 2 epochs, ~8 hours on L40S GPU |
| - **Architecture:** Hierarchical Reasoning Model with H_cycles=2, L_cycles=3 |
|
|
| ## Training Data Distribution |
|
|
| | Language | Samples | |
| |----------|---------| |
| | Python | ~80k | |
| | JavaScript | ~7.6k | |
| | React | ~550 | |
|
|
| ## Performance |
|
|
| | Task | Base | Fine-tuned | |
| |------|------|------------| |
| | C++ factorial | Broken (repeating includes) | Correct | |
| | JS reverse | Wrong syntax | Correct syntax | |
| | Java max | Wrong type | Better structure | |
|
|
| ## Usage |
|
|
|
|
|
|
| ## Training Details |
|
|
| - **Framework:** PyTorch with FlashAttention 3 |
| - **Loss:** Cross-entropy |
| - **Hardware:** AWS L40S GPU |
| - **Training Time:** ~8 hours |
|
|
| ## Limitations |
|
|
| - Maximum sequence length: 4096 tokens |
| - Requires FlashAttention 3 for inference (Ada Lovelace or newer GPUs) |
| - Limited React/TypeScript performance due to small training data |
| - Best performance on Python code generation |
|
|
| ## License |
|
|
| MIT License |
|
|