megatron training support

#7
by study-hjt - opened

ms-swift supports training hy_v3 with transformers/megatron backends.
PR:
https://github.com/modelscope/ms-swift/pull/9198
https://github.com/modelscope/mcore-bridge/pull/53

Tencent org

Thanks for sharing! Great to see ms-swift adding Megatron training support for Hy3-preview. Appreciate the contribution!

Sign up or log in to comment