Spaces:

TharaKavin
/

Web-Rag

Sleeping

App Files Files Community

Web-Rag / scraper.py

TharaKavin's picture

Update scraper.py

1cc2027 verified about 2 months ago

396 Bytes

	from scrapling.fetchers import Fetcher

	def scrape_url(url: str) -> str:
	try:
	page = Fetcher.get(url)

	# ✅ Extract text properly
	texts = page.css("body *::text").getall()

	# Clean text
	cleaned = [t.strip() for t in texts if t.strip()]

	return " ".join(cleaned)

	except Exception as e:
	print("SCRAPING ERROR:", e)
	return ""