v0.5 CRITICAL: Fix patch_size to scale with image size (Lβ€256 always), reduce d_state for speed, add config table 1c47b5f verified krystv commited on 14 days ago
v0.4: Use mambapy.pscan (proven, grad-safe parallel scan) β no more torch.associative_scan 099d3c7 verified krystv commited on 14 days ago
v0.3: PARALLEL SSM scan (torch.associative_scan), patch_size 4β8, no more Python for-loop 380e43f verified krystv commited on 14 days ago
Fix: remove duplicate forward method in LiquidSSMBlock, clean up dead code" dab968e verified krystv commited on 14 days ago
CRITICAL FIX: OOM on T4 β rewrite SSM scan to not materialize 4D tensors, add gradient checkpointing, optimize Liquid CfC memory 82aa8b4 verified krystv commited on 14 days ago