Spaces:

PHOROTHA913
/

Scrape-Anythings

Sleeping

Scrape-Anythings / README.md

Upload 9 files

5c3dc0d verified 5 months ago

2.71 kB

A newer version of the Streamlit SDK is available: 1.52.2

Upgrade

metadata

title: Scrape Anythings
emoji: ✨
colorFrom: blue
colorTo: green
sdk: streamlit
sdk_version: 1.35.0
python_version: '3.9'
app_file: app.py

✨ Scrape Anythings

A user-friendly Streamlit web application for extracting data from any website, including special support for YouTube and Instagram.

Scrape Any URL: Paste any website, YouTube, or Instagram URL to start.
Multiple Data Types: Extract text, images, links, tables, numbers, and metadata.
Social Media Support: Scrape YouTube video info & comments, and Instagram profile details & posts.
Rich Data Export: Download your data in JSON, CSV, TXT, and structured Excel (.xlsx) formats.
Modern UI: A clean and simple interface for a smooth user experience.

Create a Hugging Face Account: If you don't have one, sign up at huggingface.co.
Create a New Space:
- Go to huggingface.co/new-space.
- Enter a Space name (e.g., scrape-anythings).
- Select Streamlit as the Space SDK.
- Choose Create a new repository for this Space.
- Click Create Space.
Upload Your Files:
- In your new Space, go to the Files tab.
- Click Upload files.
- Drag and drop all the files from your project folder:
  - app.py
  - scraper.py
  - youtube_scraper.py
  - instagram_scraper.py
  - instagram_scraper_v2.py
  - requirements.txt
  - README.md
- Commit the files directly to the main branch.
Done! Hugging Face will automatically build and launch your application. You can share the URL of your Space with anyone.

Enter a URL: Paste the URL of the website, YouTube video, or Instagram profile you want to scrape.
Select Data Types: Choose the data you want to extract.
Click Scrape!: Let the app do the work.
View & Download: See the results directly in the app and download them in your preferred format.

This project is licensed under the MIT License - see the LICENSE file for details.

Made with ❤️ for the AI/ML community