Swarnabha123
/

AGI

custom-architecture

Model card Files Files and versions

AGI / README.md

Swarnabha123's picture

Upload README.md with huggingface_hub

29ac9ac verified about 2 months ago

|

history blame contribute delete

601 Bytes

	---
	tags:
	- mamba
	- recursive-flow
	- pytorch
	- custom-architecture
	---

	# Recursive-Flow Mamba-2 (1.5B)

	This is an experimental AI model trained on an H100 using a custom Recursive-Flow Mamba architecture.

	## Architecture Details
	- Base: Mamba-2 (State Space Model)
	- Parameters: ~1.5 Billion
	- Physical Layers: 24
	- Recursive Depth: 3 Loops per layer (Effective Depth: 72)
	- Training Data: OpenMathInstruct-2 (Math Logic Focus)

	## How to Run
	This model requires custom code to handle the recursive loops.
	See the `chat.py` script used during training to load the weights.