Why does this take so long to load?

by Robo0890 - opened Nov 17, 2022

Discussion

Robo0890

Nov 17, 2022

Every time I try to run it, it spends so long just to load the model. So long that it times out.

Muennighoff

BigScience Workshop org Nov 18, 2022

•

edited Nov 18, 2022

Every time I try to run it, it spends so long just to load the model. So long that it times out.

If you're referring to the Hosted Inference API, that's because there is no GPU provisioned for this model and the model is huge, so it will take very very long.
If you want to run it, you need to download the model and run it on your own hardware, sorry :(

Here are some guidelines for running it: https://huggingface.co/bigscience/bloomz/discussions/18#636b6ad958a8f9348d0ab82c

TimeRobber

BigScience Workshop org Nov 18, 2022

We should probably disable the widget as it may be confusing then

Robo0890

Nov 18, 2022

•

edited Nov 18, 2022

Ah, got it.
Thanks. I was just attempting to try it out. If it does what I think it does, then it could be just as good if not better than GPT-3.
Nice work, I love open source.
Even if I can’t run it :(

Robo0890 changed discussion status to closed Nov 19, 2022

TimeRobber

BigScience Workshop org Nov 21, 2022

FYI removed the widget to prevent more confusion about this: https://huggingface.co/bigscience/bloomz-p3/commit/51f3d0d7079a37501554eb7ce2558012bb96d062

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment