fix: eliminate API leaks in training loop, set 30k steps for A100 a5c7dd0 garvitsachdeva commited on 20 days ago
perf: force ST to CUDA, disable LLM tasks, reduce to 100k steps b71b25c garvitsachdeva commited on 20 days ago