✨ Vision-Based AI Agent Scraper - with Google Sheets, ScrapingBee, and Gemini

Go to Workflow
35,213 views
Built by Dataki Dataki
Created on June 05, 2026

Description

Important Notes:
Check Legal Regulations:
This workflow involves scraping, so ensure you comply with the legal regulations in your country before getting started. Better safe than sorry!

Workflow Description:
😮‍💨 Tired of struggling with XPath, CSS selectors, or DOM specificity when scraping ?

This AI-powered solution is here to simplify your workflow! With a vision-based AI Agent, you can extract data effortlessly without worrying about how the DOM is structured.

This workflow leverages a vision-based AI Agent, integrated with Google Sheets, ScrapingBee, and the Gemini-1.5-Pro model, to extract structured data from webpages. The AI Agent primarily uses screenshots for data extraction but switches to HTML scraping when necessary, ensuring high accuracy.

Key Features:
Google Sheets Integration**: Manage URLs to scrape and store structured results.
ScrapingBee**: Capture full-page screenshots and retrieve HTML data for fallback extraction.
AI-Powered Data Parsing**: Use Gemini-1.5-Pro for vision-based scraping and a Structured Output Parser to format extracted data into JSON.
Token Efficiency**: HTML is converted to Markdown to optimize processing costs.

This template is designed for e-commerce scraping but can be customized for various use cases.

Nodes Used (6)

AI Agent
@n8n/n8n-nodes-langchain.agent
Call n8n Workflow Tool
@n8n/n8n-nodes-langchain.toolWorkflow
Google Gemini Chat Model
@n8n/n8n-nodes-langchain.lmChatGoogleGemini
Google Sheets
n8n-nodes-base.googleSheets
HTTP Request
n8n-nodes-base.httpRequest
Structured Output Parser
@n8n/n8n-nodes-langchain.outputParserStructured