File size: 832 Bytes
3a62af9
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
---
library_name: transformers
tags: [hrm, moe, hierarchical-reasoning, custom-architecture]
---

# Hierarchical Reasoning Model (HRM)

Custom MoE language model with 3-level hierarchical reasoning and DeepSeek-V3 memory strategies.

**Architecture:** 3 levels 路 16 experts (4 active) 路 MLA attention 路 Hierarchical memory  
**Parameters:** ~350M total, ~45M active per token  
**Trained on:** OpenHermes-2.5

## Usage

```python
from transformers import AutoModelForCausalLM, AutoTokenizer

model = AutoModelForCausalLM.from_pretrained("Scorched2/shader-v2", trust_remote_code=True)
tokenizer = AutoTokenizer.from_pretrained("Scorched2/shader-v2")

inputs = tokenizer("### Instruction:\nExplain AI.\n\n### Response:\n", return_tensors="pt")
out = model.generate(**inputs, max_new_tokens=200)
print(tokenizer.decode(out[0]))
```