LogicGoInfotechSpaces's picture
Fix CUDA operations to check device: only use fused ops when input is on CUDA
80d11e4