Qwen2.5-1.5B-think-spkd / run_spkd.log
JungHun's picture
Upload model: spkd_think
b4a6e80 verified
[2025-11-30 01:59:24,307][accelerate.accelerator][WARNING] - Gradient accumulation steps mismatch: GradientAccumulationPlugin has 1, DeepSpeed config has 2. Using DeepSpeed's value.