File size: 601 Bytes
29ac9ac | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 | ---
tags:
- mamba
- recursive-flow
- pytorch
- custom-architecture
---
# Recursive-Flow Mamba-2 (1.5B)
This is an experimental AI model trained on an H100 using a custom **Recursive-Flow Mamba** architecture.
## Architecture Details
- **Base:** Mamba-2 (State Space Model)
- **Parameters:** ~1.5 Billion
- **Physical Layers:** 24
- **Recursive Depth:** 3 Loops per layer (Effective Depth: 72)
- **Training Data:** OpenMathInstruct-2 (Math Logic Focus)
## How to Run
This model requires custom code to handle the recursive loops.
See the `chat.py` script used during training to load the weights.
|