InterstellarCG
/

HRM-Text-1B-Code-Feedback

code-generation

Model card Files Files and versions

HRM-Text-1B-Code-Feedback / README.md

InterstellarCG's picture

Add model card

aa90ed5 verified 1 day ago

|

history blame contribute delete

1.39 kB

	---
	license: mit
	language:
	- en
	tags:
	- code-generation
	- codefeedback
	- hrm-text
	base_model: sapientai/HRM-Text-1B
	---

	# HRM-Text-1B-Code-Feedback

	Fine-tuned version of [HRM-Text-1B](https://huggingface.co/sapientai/HRM-Text-1B) on the CodeFeedback dataset for code generation.

	## Model Details

	- Base Model: sapientai/HRM-Text-1B (1B parameters, hierarchical reasoning model)
	- Training Data: CodeFeedback dataset (~131k samples, filtered to <= 4096 tokens)
	- Training: 2 epochs, ~8 hours on L40S GPU
	- Architecture: Hierarchical Reasoning Model with H_cycles=2, L_cycles=3

	## Training Data Distribution

	\| Language \| Samples \|
	\|----------\|---------\|
	\| Python \| ~80k \|
	\| JavaScript \| ~7.6k \|
	\| React \| ~550 \|

	## Performance

	\| Task \| Base \| Fine-tuned \|
	\|------\|------\|------------\|
	\| C++ factorial \| Broken (repeating includes) \| Correct \|
	\| JS reverse \| Wrong syntax \| Correct syntax \|
	\| Java max \| Wrong type \| Better structure \|

	## Usage



	## Training Details

	- Framework: PyTorch with FlashAttention 3
	- Loss: Cross-entropy
	- Hardware: AWS L40S GPU
	- Training Time: ~8 hours

	## Limitations

	- Maximum sequence length: 4096 tokens
	- Requires FlashAttention 3 for inference (Ada Lovelace or newer GPUs)
	- Limited React/TypeScript performance due to small training data
	- Best performance on Python code generation

	## License

	MIT License