---
base_model: sapientinc/HRM-Text-1B
library_name: transformers
pipeline_tag: text-generation
tags:
- transformers
- code
- python
- humaneval
- mbpp
---

# HRM-Text-1B-sft-code

Merged code post-training release from `sapientinc/HRM-Text-1B` plus:

[`josephmayo/HRM-Text-1B-sft-code-LoRA`](https://huggingface.co/josephmayo/HRM-Text-1B-sft-code-LoRA)

`sapientinc/HRM-Text-1B` is a pretrained-only HRM text model. This merged release packages the code post-trained LoRA into the base weights for direct use.

## Training Summary

- Base model: `sapientinc/HRM-Text-1B`
- Method: supervised LoRA post-training, then merged into base weights
- Training rows: `384`
- Max steps: `120`
- LoRA rank: `64`
- Learning rate: `8e-6`
- Final train loss: `0.3275703112284342`

## Validation

Local code validation:

- Base model score: `5/100`
- Merged model score: `24/100`
- Absolute improvement: `+19/100`
- Relative improvement: `4.8x` over base
- HumanEval slice: `14/50`
- MBPP slice: `10/50`

The score above is the local validation result used for this release.

## Use

```python
from transformers import AutoModelForCausalLM, AutoTokenizer

model_id = "josephmayo/HRM-Text-1B-sft-code"

tokenizer = AutoTokenizer.from_pretrained(model_id, trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained(model_id, trust_remote_code=True)
model.eval()
```

## Notes

- This is the merged release of the LoRA.
- Adapter repo: `josephmayo/HRM-Text-1B-sft-code-LoRA`