Inference on TPU-v3-32

#68

by zhiG - opened Jul 29, 2022

base: refs/heads/main

←

from: refs/pr/68

Discussion Files changed

+1736

-1101

zhiG

Jul 29, 2022

No description provided.

zhiG

Jul 29, 2022

I think maybe using TPU-v3-32 is the most cost-effective way for Bloom inference.

ybelkada

BigScience Workshop org Jul 29, 2022

hi @zhiG !
Thanks for the suggestion ! We already made an effort for BLOOM-tpu inference that you can have a look here: https://github.com/huggingface/bloom-jax-inference
Let us know if you have any more questions :)

ybelkada

BigScience Workshop org Jul 29, 2022

Also could you please move the discussion into a discussion instead of a PR 🙏 Thank you !

ybelkada changed pull request status to closed Jul 29, 2022

zhiG

Jul 29, 2022

Thank you for your information! I am quite new to the Huggingface community. I am sorry that I cannot find a method to move it to the discussion. I can't even delete it .

ybelkada

BigScience Workshop org Jul 29, 2022

No worries at all ! Let me create a discussion and ping you there ;)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment