Spaces:

sheikhcoders
/

browser-automation-tool

Running

App Files Files Community

sheikhcoders commited on Nov 6, 2025

Commit

a70da23

verified ·

1 Parent(s): 48ec8b8

Update README with proper YAML configuration

Browse files

Files changed (1) hide show

README.md +53 -185

README.md CHANGED Viewed

@@ -9,199 +9,67 @@ app_file: app.py
 pinned: false
 ---
-# Browser Automation Tool 🌐
-A comprehensive web scraping and browser automation platform - an alternative to browserbase.com. This Hugging Face Space provides powerful tools for web data extraction, screenshot capture, form automation, and multi-URL scraping.
-## Features 🚀
-### 🔍 Single URL Analysis
-- **Screenshot Capture**: Take high-quality screenshots of any webpage
-- **Data Extraction**: Extract text, links, images, forms, and custom elements
-- **Custom Selectors**: Use CSS selectors to extract specific data
-- **Headless/Headed Mode**: Choose between invisible or visible browser operation
-### 📊 Multiple URLs Scraping
-- **Concurrent Scraping**: Process multiple URLs simultaneously
-- **Configurable Workers**: Control the number of concurrent processes
-- **Batch Processing**: Extract data from entire lists of URLs
-- **Structured Output**: Get organized results in JSON format
-### 📋 Form Automation
-- **Smart Form Detection**: Automatically detect form fields
-- **Automated Filling**: Fill forms with provided data
-- **Field Type Recognition**: Handle text, email, password, textarea fields
-- **Form Submission**: Submit forms and capture responses
-### 🎮 Interactive Mode
-- **Real-time Screenshot**: Live view of browser activity
-- **Console Output**: Monitor JavaScript console in real-time
-- **Error Handling**: Graceful handling of failed requests
-- **Progress Tracking**: Visual feedback for long operations
-## How to Use 💡
-### 1. Single URL Analysis
-1. Enter a URL in the "Single URL Analysis" tab
-2. Choose screenshot mode (full page or viewport)
-3. Set wait time for page loading
-4. Click "Run Analysis" and wait for results
-### 2. Multiple URLs Scraping
-1. Select the "Multiple URLs Scraping" tab
-2. Paste multiple URLs (one per line)
-3. Configure number of workers (1-10)
-4. Set wait time per URL
-5. Click "Start Scraping" and monitor progress
-### 3. Form Automation
-1. Go to the "Form Automation" tab
-2. Enter target URL and form data
-3. Specify CSS selectors (or use auto-detection)
-4. Click "Fill and Submit" to automate the process
-### 4. Live Console
-1. Navigate to "Live Console" tab
-2. Enter any URL to monitor
-3. Watch real-time browser console output
-4. See JavaScript errors and logs as they happen
-## Supported Features 🛠️
-### Browser Capabilities
-- **Headless Chrome**: Fast, efficient browser automation
-- **Customizable Viewport**: Set specific screen dimensions
-- **JavaScript Execution**: Full JS support for dynamic content
-- **Error Recovery**: Automatic retry mechanisms
-- **Resource Management**: Optimized memory and CPU usage
-### Data Extraction
-- **Text Content**: All visible and hidden text
-- **Links**: Internal and external links
-- **Images**: All images with source URLs
-- **Forms**: All form elements with attributes
-- **Custom Data**: User-defined CSS selectors
-### Output Formats
-- **JSON**: Structured data with metadata
-- **Raw Text**: Clean text content
-- **Screenshot**: Base64 encoded images
-- **CSV**: Spreadsheet-compatible format
-- **HTML**: Full page HTML source
-## Technical Details 🔧
-### Performance
-- **Concurrent Processing**: Multiple URLs processed in parallel
-- **Rate Limiting**: Built-in delays to respect websites
-- **Memory Optimization**: Efficient resource management
-- **Error Resilience**: Continues processing despite individual failures
-### Security
-- **No Data Storage**: All processing happens in memory
-- **Temporary Files**: Screenshots and data cleared after use
-- **Secure Communication**: HTTPS-only external requests
-- **Input Sanitization**: URL and data validation
-### Limitations
-- **Rate Limits**: Built-in delays to avoid overloading websites
-- **JavaScript-Heavy Sites**: May require additional wait time
-- **Captcha Protection**: Cannot bypass CAPTCHA or bot detection
-- **Authentication**: No built-in login/session management
-## API Usage 🐍
-For developers who want to use this tool programmatically, you can import the main functions:
-```python
-from app import BrowserAutomationTool
-# Initialize the tool
-tool = BrowserAutomationTool()
-# Capture screenshot
-screenshot = tool.capture_screenshot(
-    url="https://example.com",
-    full_page=True,
-    wait_time=3
-)
-# Extract data
-data = tool.scrape_website(
-    url="https://example.com",
-    selectors=["h1", "p", "a"]
-)
-# Batch scraping
-results = tool.scrape_multiple_urls(
-    urls=["https://site1.com", "https://site2.com"],
-    max_workers=2,
-    wait_time=2
-)
-```
-## Examples 📚
-### Example 1: News Website Analysis
-```
-URL: https://news.ycombinator.com
-Screenshot: Full page
-Extraction: Titles, links, comments
-Expected Output: List of current news stories with engagement metrics
-```
-### Example 2: E-commerce Price Comparison
-```
-URLs:
-- https://amazon.com/product/123
-- https://ebay.com/product/456
-- https://walmart.com/product/789
-Extraction: Price, title, availability
-Expected Output: Comparative pricing data
-```
-### Example 3: Form Automation
-```
-URL: https://example.com/contact
-Form Data: name="John Doe", email="john@example.com"
-Action: Fill and submit contact form
-Expected Output: Success confirmation
-```
-## Troubleshooting 🔍
-### Common Issues
-- **"Page took too long to load"**: Increase wait time, check URL accessibility
-- **"Element not found"**: Verify CSS selector, check if page uses dynamic loading
-- **"Screenshot failed"**: Ensure URL is accessible, check for popup blockers
-- **"Scraping timeout"**: Reduce number of workers, increase timeout values
-### Performance Tips
-- **For large lists**: Start with fewer workers and increase gradually
-- **For slow sites**: Increase wait time between requests
-- **For dynamic content**: Use JavaScript execution wait
-- **For memory issues**: Process URLs in smaller batches
-## Support & Contributing 💬
-### Getting Help
-- **Check the examples** in the interface for common use cases
-- **Verify your URLs** are publicly accessible
-- **Test with simpler pages** first to debug issues
-- **Check the console output** for error messages
-### Contributing
-This is an open-source tool. Feel free to:
-- Report bugs and issues
-- Suggest new features
-- Submit improvements
-- Share your use cases
-### License
-MIT License - Feel free to use this tool in your projects.
----
-**Made with ❤️ for the developer community**
-*Alternative to browserbase.com - Bringing powerful browser automation to everyone.*

 pinned: false
 ---
+# Browser Automation Tool
+A comprehensive browser automation platform that combines the power of Gradio for web UI and FastAPI for programmatic access. This tool enables seamless web scraping, browser automation, and integration with AI agents.
+## Features
+- 🌐 **Web Browser Automation**: Control Chrome/Chromium browsers programmatically
+- 📸 **Screenshot Capture**: Take screenshots of any webpage
+- 🔍 **Web Scraping**: Extract data from HTML content
+- 🤖 **AI Agent Integration**: Compatible with Model Context Protocol (MCP)
+- ⚡ **Real-time Streaming**: Server-Sent Events (SSE) for live updates
+- 🚀 **RESTful API**: FastAPI backend for programmatic access
+- 📱 **Web Interface**: User-friendly Gradio UI
+## Quick Start
+### Using the Web Interface
+1. Visit the Space URL above
+2. Enter a URL to navigate
+3. Use built-in tools for automation
+### Using the API
+```python
+import requests
+# Get available sessions
+response = requests.get('https://your-space-url/api/sessions')
+sessions = response.json()
+# Create a new browser session
+session_data = {
+    'headless': True,
+    'window_size': {'width': 1920, 'height': 1080}
+}
+response = requests.post('https://your-space-url/api/sessions', json=session_data)
+session_id = response.json()['session_id']
+# Navigate to a website
+requests.post(f'https://your-space-url/api/sessions/{{session_id}}/navigate',
+              json={{'url': 'https://example.com'}})
+# Take a screenshot
+response = requests.get(f'https://your-space-url/api/sessions/{{session_id}}/screenshot')
+with open('screenshot.png', 'wb') as f:
+    f.write(response.content)
+```
+## API Endpoints
+### Session Management
+- `GET /api/sessions` - List all active sessions
+- `POST /api/sessions` - Create new browser session
+- `DELETE /api/sessions/{{session_id}}` - Close session
+### Browser Control
+- `POST /api/sessions/{{session_id}}/navigate` - Navigate to URL
+- `GET /api/sessions/{{session_id}}/screenshot` - Take screenshot
+- `POST /api/sessions/{{session_id}}/click` - Click element
+- `POST /api/sessions/{{session_id}}/type` - Type text
+- `GET /api/sessions/{{session_id}}/get_page_source` - Get HTML
+## Author
+**MiniMax Agent** - Advanced AI automation tools