shader-v2 / README.md
Scorched2's picture
Upload folder using huggingface_hub
3a62af9 verified
---
library_name: transformers
tags: [hrm, moe, hierarchical-reasoning, custom-architecture]
---
# Hierarchical Reasoning Model (HRM)
Custom MoE language model with 3-level hierarchical reasoning and DeepSeek-V3 memory strategies.
**Architecture:** 3 levels 路 16 experts (4 active) 路 MLA attention 路 Hierarchical memory
**Parameters:** ~350M total, ~45M active per token
**Trained on:** OpenHermes-2.5
## Usage
```python
from transformers import AutoModelForCausalLM, AutoTokenizer
model = AutoModelForCausalLM.from_pretrained("Scorched2/shader-v2", trust_remote_code=True)
tokenizer = AutoTokenizer.from_pretrained("Scorched2/shader-v2")
inputs = tokenizer("### Instruction:\nExplain AI.\n\n### Response:\n", return_tensors="pt")
out = model.generate(**inputs, max_new_tokens=200)
print(tokenizer.decode(out[0]))
```