Scrapazoid Documentation¶
Welcome to Scrapazoid - a Flask-based web platform for automated web scraping using Playwright.
Overview¶
Scrapazoid provides a user-friendly web interface for writing, executing, and monitoring Playwright-based web scraping scripts. It includes real-time execution monitoring, screenshot capture, and comprehensive logging.
Key Features¶
- ๐ฏ Web-Based Editor - Monaco code editor with Python syntax highlighting
- ๐ Real-Time Execution - Watch your scripts run with live logs and screenshots
- ๐ธ Screenshot Capture - Automatic and manual screenshot capabilities
- ๐ฅ File Downloads - Download files during script execution with security controls
- ๐ฆ Data Import - Import JSON data to parameterize scripts without editing code
- ๐ Debug Logging - Comprehensive logging at every step
- ๐ Execution History - Review past executions with full details
- ๐ Script Versioning - Track every change and see what code produced results
- ๐ก๏ธ Sandboxed Execution - Secure script execution environment
- โฑ๏ธ Automatic Timeouts - Scripts automatically timeout after 5 minutes
- ๐งน Cleanup System - Background cleanup for stuck executions and old files
- ๐ฅ Multi-User Support - Secure authentication with isolated workspaces
Quick Start¶
async def main(page):
# Navigate to a webpage
await page.goto('https://example.com')
# Get page title
title = await page.title()
print(f'Page title: {title}')
# Extract data
scrape_data({'title': title})
Architecture¶
Scrapazoid consists of several key components:
- Flask Application - Web server and API
- SocketIO - Real-time communication for live updates
- Playwright Executor - Sandboxed script execution engine
- Execution Monitor - Tracks and stores execution results
- Cleanup System - Automatically handles stuck executions
Getting Started¶
- Installation Guide - Set up Scrapazoid
- Quick Start - Run your first script
- Configuration - Configure your instance
Support¶
- GitHub Issues: Report bugs or request features
- Documentation: You're reading it!
- Examples: Check the Examples section
License¶
See the LICENSE file in the repository for details.