KronosLabs
/

Iapetus-v2-Kernel

Model card Files Files and versions

KronosLabs commited on Apr 17

Commit

c2b5b6a

·

verified ·

1 Parent(s): 4bddfeb

Update README.md

Files changed (1) hide show

README.md +4 -0

README.md CHANGED Viewed

@@ -13,8 +13,12 @@ It was trained on closed source datasets containing synthetically generated and
 The model contains the following layout:
 Size: 80B Parameters, A3B (3 Billion Active)
 Depth: 48 layers
 Hybrid layout: 12 * (3 * (Gated DeltaNet -> MoE) -> 1 * (Gated Attention -> MoE))
 MoE: 512 experts, 10 activated experts, 1 shared expert
 Context Length: 262,144 native

 The model contains the following layout:
 Size: 80B Parameters, A3B (3 Billion Active)
 Depth: 48 layers
 Hybrid layout: 12 * (3 * (Gated DeltaNet -> MoE) -> 1 * (Gated Attention -> MoE))
 MoE: 512 experts, 10 activated experts, 1 shared expert
 Context Length: 262,144 native