SofiTesfay2010
/

HRM-LLM

Model card Files Files and versions

HRM-LLM / README.md

SofiTesfay2010's picture

Update README.md

e938ebc verified 7 months ago

|

history blame contribute delete

2.32 kB

	---
	base_model:
	- sapientinc/HRM-checkpoint-ARC-2
	- sapientinc/HRM-checkpoint-sudoku-extreme
	- sapientinc/HRM-checkpoint-maze-30x30-hard
	- google/flan-t5-small
	---

	HRM-LLM: A truly decentralized, human-like reasoning model built by the community

	HRM-LLM is a community-driven large language model powered by the Hierarchical Reasoning Model (HRM) architecture. It aims to be truly decentralized: anyone can train, contribute, and scale it forward from anywhere. HRM-LLM is designed to think and work like a human—iterating, refining, and allocating compute adaptively—so it learns efficiently and generalizes across tasks.

	Why HRM-LLM?
	- Human-like reasoning core: HRM brings hierarchical representations and adaptive computation to mimic iterative human thinking and planning.
	- Adaptive Computation Time (ACT): The model dynamically decides how much “thought” to spend per token—more for hard tokens, less for easy ones.
	- Decentralized and scalable: Anyone can hop in, train a few steps, and push a unified checkpoint to the Hub. Every contribution compounds.
	- Simple, hackable stack: PyTorch + Transformers + Datasets. Easy to extend, easy to improve.
	- Community-aligned progress: Transparent training, open checkpoints, and community governance.

	What this model aims to do
	- Break down complex problems into stages, reason across them, and refine answers over multiple internal steps.
	- Learn efficient patterns via ACT, saving compute where possible and spending it where it matters most.
	- Become a robust, general-purpose assistant shaped by its global community of contributors.

	How you can help
	- Train a few steps in Colab (or locally) and push your contribution.
	- Experiment with hyperparameters, tokenizers, datasets, or new HRM blocks.
	- Share insights and logs to improve the next iteration.

	License
	- This project is licensed under Apache-2.0. You’re free to use, modify, and distribute—with attribution and notice.

	Jump in and train
	- Colab (1-click): https://colab.research.google.com/drive/1xZNYC-yhwdJxzbpwRekE_rDjTki5CvEv?usp=sharing

	Quick start: contribute training from your environment
	Run this to join training and push your contribution to the shared checkpoint.

	That’s it—share the Colab link, invite contributors, and let the community grow HRM-LLM together.