microsoft
/

bloom-deepspeed-inference-fp16

How to split tensors to x shards?

#1 opened about 3 years ago by