Commit History

add _is_space flag, block inference during training, understand clone fix
3c15b8b

Nekochu commited on

fix understand_audio: clone tensors for inference mode, working on GPU (52s)
4b2f4ad

Nekochu commited on

add understand_audio (LM reverse), demucs-infer fix, commit refs, dtype fixes
6bfdc38

Nekochu commited on

major update: PyTorch inference, Gradio 6, session isolation, /understand captioning
ff239f5

Nekochu commited on

truncate long files to fit cap, show which files truncated/skipped
bc97006

Nekochu commited on

accept files until total audio cap reached, skip rest with warning
956dc8c

Nekochu commited on

replace per-file cap with total audio cap (30 min max)
1549b91

Nekochu commited on

add LoRA download button after training (gr.File output, like rvc-beatrice)
2d3c27c

Nekochu commited on

remove ace-server understand proxy, captioning stays librosa + txt sidecars
5b7a56f

Nekochu commited on

add /understand API endpoint for GGUF audio captioning
a4457c3

Nekochu commited on

SDPA first on Blackwell, FA2 only for Ampere/Hopper, txt caption support
04ccf32

Nekochu commited on

add GPU/CUDA auto-detect, mixed precision, flash_attn, txt caption parser
917e4ed

Nekochu commited on

update defaults: LR 3e-4, rank 32, alpha 2x rank (per Side-Step author)
04c031f

Nekochu commited on

save only on cancel/finish, max epochs 1000, default 3
d42aa91

Nekochu commited on

add mid/sas analysis modes (Demucs + ensemble), auto-select by dataset size
b38d0b1

Nekochu commited on

add auto-captioning (BPM/key/signature via librosa), add librosa+mutagen deps
1d42836

Nekochu commited on

switch training to standard turbo (11s/epoch), auto-select standard GGUF for LoRA inference
c0f2a13

Nekochu commited on

add XL variant mappings to _VARIANT_DIR
3c5ce9c

Nekochu commited on

fix: train on XL turbo (matches XL GGUF for inference), add XL checkpoint download
372f08e

Nekochu commited on

add full README with API docs, MCP, CLI, architecture
9d2d424

Nekochu commited on

allow custom LoRA values in dropdown (API clients)
2bd2612

Nekochu commited on

log ace-server restart, show output for adapter debugging
d2ae079

Nekochu commited on

fix: forward adapter to synth request, default LM to 1.7B
b23b6b8

Nekochu commited on

fix: adapter saved to clean dir, LM dropdown no 'Default', on-demand download
e62602f

Nekochu commited on

copy train_engine.py into Docker image
5fe3c53

Nekochu commited on

Side-Step training engine, tested locally on CPU
a07b39d

Nekochu commited on

switch back to 1.7B LM (fastest at 269s, 0.6B was 936s)
5e95353

Nekochu commited on

swap LM to 0.6B Q8_0 for speed test
88b9223

Nekochu commited on

swap LM 4B->1.7B Q8_0 for faster CPU inference
b14d3e8

Nekochu commited on

add LoRA adapter dropdown to inference UI
5c2e4e7

Nekochu commited on

default mp3, remove format selector, increase LM timeout to 900s
882ed5c

Nekochu commited on

remove accelerate (causes meta tensors), clean up patches
153f929

Nekochu commited on

fix meta tensor crash: force low_cpu_mem_usage=False and float32 for CPU
9ed24c7

Nekochu commited on

disable flash_sdp on CPU, force attn_implementation=sdpa for training
13f9406

Nekochu commited on

add granular logging + full stderr to diagnose preprocessing hang
6cee8bd

Nekochu commited on

fix: use float32 not bfloat16 for CPU training (bf16 deadlocks on CPU)
560b5e0

Nekochu commited on

redirect training subprocess stderr to log file for debugging
e69e9ec

Nekochu commited on

add einops + vector_quantize_pytorch for model loading
88ca206

Nekochu commited on

run training as detached subprocess to survive Gradio session timeout
a4a86a8

Nekochu commited on

use bfloat16 precision for training to halve RAM usage
a0e1f4c

Nekochu commited on

stop ace-server during training to free RAM, restart after, add log visibility
c2cb0b9

Nekochu commited on

pre-download training checkpoints at build time
c37e80e

Nekochu commited on

pin diffusers==0.30.3 for torch 2.4.x compat
f59d542

Nekochu commited on

pin torchaudio==2.4.0 (before torchcodec default backend)
148cd6b

Nekochu commited on

force uninstall torchcodec (torchaudio dependency, broken on Ubuntu 22.04)
9822bee

Nekochu commited on

monkey-patch torchaudio.load to use soundfile backend
afccbc0

Nekochu commited on

remove torchcodec (needs FFmpeg 5.x), keep soundfile backend
63d7cb8

Nekochu commited on

add torchcodec + ffmpeg for torchaudio audio loading
f1f383b

Nekochu commited on

add soundfile + libsndfile1 for torchaudio backend
dd7a793

Nekochu commited on

add all training deps: diffusers lightning numpy tensorboard
eb1b926

Nekochu commited on