Selenium Ultimate Scraper Workflow

September 22, 2025

Aladuddin Aladin

This workflow is a powerful web scraping and data extraction automation using Selenium and OpenAI. It allows you to collect structured data from almost any website, whether public or behind a login, while handling anti-bot protections and analyzing scraped pages with AI.

It supports:

  • Running in a Selenium container with optional proxy configuration.
  • Scraping with or without authentication (via session cookies).
  • Automatic screenshot capture and AI-based content extraction.
  • Handling of blocked pages, errors, and fallback logic.

🚀 Features

  1. Webhook Trigger: Accepts JSON input with subject, domain, target URL, and data fields.
  1. Google Search + Smart URL Extraction: Finds the most relevant page from a given domain using query + AI filtering.
  1. Selenium Browser Control:
  • Launches and manages Chrome sessions inside a Dockerized Selenium container.
  • Supports proxy configuration for bypassing restrictions.
  • Can inject cookies for scraping logged-in pages.
  1. Anti-Bot Evasion: Modifies WebDriver fingerprints to avoid detection.
  1. Dynamic Page Handling: Resizes browser window, refreshes pages, and ensures page load stability.
  1. AI-Powered Data Extraction:
  • Uses OpenAI GPT-4o / GPT-4o-mini to analyze screenshots and extract structured data.
  • Extracts multiple attributes (up to 5 custom data points).
  • Handles cases where no relevant data is found.
  1. Error & Block Handling:
  • Returns clear JSON responses if the request is blocked, cookies don’t match, or pages crash.
  • Captures screenshots for debugging when issues occur.
  1. Proxy Debugging: Built-in flow to verify your scraping IP via ip-api.com.

About the author

Alauddin Aladin is an AI Automation expert helping businesses streamline operations, boost productivity, and scale effortlessly using tools like Make.com and n8n. With over a decade of experience in digital systems and automation strategy, Alauddin empowers entrepreneurs to save time and grow smarter through intelligent workflows and AI-driven solutions.

Leave a Comment