โš ๏ธ Why the App Takes Time to Load or Respond

#1
by Kakaarot - opened

Hey everyone! ๐Ÿ‘‹ Just a quick heads-up for anyone wondering why the app is taking time for responses little longer,
This app uses the google/gemma-2b language model, which is relatively large and resource-intensive โ€” even though it's one of the smaller variants.

Since this Space runs on the free CPU tier, it comes with some limitations:

Cold Start Delays: Free-tier Spaces go to sleep when inactive and take longer to restart (especially with large models).

Model Loading Time: Even with @st .cache_resource, the first-time model load can take a while โ€” especially for models like Gemma, which need to download and initialize multiple components.

๐Ÿ“ฉ If you get stuck on loading, refresh and wait ~1โ€“2 mins โ€” after that, the app should respond smoothly!

Appreciate your understanding ๐Ÿ™

Using this as prototype to prepare a concrete proposal for GSOC 2025, So yep, Lets see....

Sign up or log in to comment