ML Training Debugger

Inspect logs, config, and gradients to diagnose a failed training run.