What server configuration is required to run this model?

by hongdouzi - opened Jan 25, 2024

Discussion

hongdouzi

Jan 25, 2024

I have max_sql_length=5000, but I still experience memory overflow when inputting text of the corresponding length.

hongdouzi

Jan 25, 2024

My memory is about 32GB, and the graphics memory is approximately 24GB.

Cogian

Jan 26, 2024

Do you use the model in inference mode or with torch.no_grad() decorator when inferencing?

Danyray101

Jan 26, 2024

You want to set it up with the Endpoint's Available on the hugging space hub.. And but they do charge whenever you have it running!

Danyray101

Jan 26, 2024

You might be able to get away with it but honestly unless your running super fast gpus then your better off running with endpoints or google Collab or run pod!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment