Kernels
activation / torch-ext

Commit History

fix(rms_norm.py): add assertion for input gradients to handle unsupported cases in backward pass
f19f8f4

wyldecat commited on

feat: support sequence parallel with fused_add_rms_norm
151bb5a

wyldecat commited on

refactor(activation): change fused_add_rms_norm and fused_add_rms_norm_backward to out-place operations
7e4334d

wyldecat commited on

refactor(rms_norm): move RMS normalization logic to a new module for better organization and maintainability
66b3c5e

wyldecat commited on

feat: support sequence parallel with rms_norm
06d6367

wyldecat commited on

feat: make rms_norm as out-place
9d0a235

wyldecat commited on

Fix fused add rms norm (#4)
a1e5ca8
unverified

TaehyunKim TaehyunKimMotif commited on

feat: support reset_parameters()
605f22e

iamwyldecat commited on

feat(rms-norm): Impl fused RMSNorm
f3b99fb

iamwyldecat commited on

refactor(poly-norm): use const for immutable args
e85ecc9

iamwyldecat commited on

chore: use latest build image and misc
f5a7d38

iamwyldecat commited on

feat(poly-norm): add default value for eps argument
afd2a56

iamwyldecat commited on

feat(poly-norm): Add PolyNorm
44e9845

iamwyldecat commited on