dragon / optimizers /__init__.py
alexandretl's picture
MLA | KDA | TPA | GDA | ResFormer | Mamba3 | DragonMimo (WIP) | tokenshift | SeeDNorm | shrink DA/GDN | gate shared across all block types |
bc8288b
raw
history blame contribute delete
54 Bytes
from .Ademamix import AdEMAMix
from .Snoo import Snoo