change model to gemini-flash-lite to avoid rate limit a2c76ca verified jay0911 commited on Jul 31, 2025
removing model.cuda and device=0 since accelerate takes care of this f7c4dd5 verified jay0911 commited on Jul 31, 2025