Swarnabha123 commited on
Commit
29ac9ac
·
verified ·
1 Parent(s): cefd30c

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +22 -3
README.md CHANGED
@@ -1,3 +1,22 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - mamba
4
+ - recursive-flow
5
+ - pytorch
6
+ - custom-architecture
7
+ ---
8
+
9
+ # Recursive-Flow Mamba-2 (1.5B)
10
+
11
+ This is an experimental AI model trained on an H100 using a custom **Recursive-Flow Mamba** architecture.
12
+
13
+ ## Architecture Details
14
+ - **Base:** Mamba-2 (State Space Model)
15
+ - **Parameters:** ~1.5 Billion
16
+ - **Physical Layers:** 24
17
+ - **Recursive Depth:** 3 Loops per layer (Effective Depth: 72)
18
+ - **Training Data:** OpenMathInstruct-2 (Math Logic Focus)
19
+
20
+ ## How to Run
21
+ This model requires custom code to handle the recursive loops.
22
+ See the `chat.py` script used during training to load the weights.