File size: 601 Bytes
29ac9ac
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
---
tags:
- mamba
- recursive-flow
- pytorch
- custom-architecture
---

# Recursive-Flow Mamba-2 (1.5B)

This is an experimental AI model trained on an H100 using a custom **Recursive-Flow Mamba** architecture.

## Architecture Details
- **Base:** Mamba-2 (State Space Model)
- **Parameters:** ~1.5 Billion
- **Physical Layers:** 24
- **Recursive Depth:** 3 Loops per layer (Effective Depth: 72)
- **Training Data:** OpenMathInstruct-2 (Math Logic Focus)

## How to Run
This model requires custom code to handle the recursive loops. 
See the `chat.py` script used during training to load the weights.