Extract Specific Website Data with Form Input, Gemini 2.5 flash and Gmail

Go to Workflow
16 views
Built by Billy Christi Billy Christi
Created on June 06, 2026

Description

What this workflow does

This workflow creates an automated web scraper that accepts form submissions, extracts specific data from any website using AI, and emails the results back to you.

Step by step:
Web Scraper Form Submission provides a web form interface where users submit a URL and specify what data to extract
Get HTML from Source URL fetches the complete HTML content from the provided website
HTML Extractor processes the raw HTML and extracts the body content for analysis
Data Extractor LLM Chain uses Google Gemini AI to intelligently analyze the content and extract only the specific data requested by the user
Structured Output Parser formats the AI response into clean JSON structure with standardized format
Gmail Send Result delivers the extraction results via email including the source URL, extraction request details, and clean extracted results

How to set up

Connect your Google Gemini API to the Google Gemini Chat Model node for AI-powered data extraction
Connect your Gmail account to the Gmail node for sending result emails
Update the recipient email in the Gmail node
Customize the extraction prompt in the Data Extractor LLM Chain node based on your specific requirements

How to customize this workflow to your needs

Switch AI models**: Replace Google Gemini with OpenAI, Claude, or other LLM providers in the Chat Model node based on your accuracy requirements and budget preferences
Change result delivery**: Replace Gmail with Google Sheets for data storage, Outlook for corporate email, Slack for team notifications, or webhook integrations for custom applications
Customize extraction prompts**: Modify the LLM prompt in the Data Extractor Chain to handle specific data types, extraction formats, or industry-specific terminology for your use case

Need help customizing?

Contact me for consulting and support:
📧 [email protected]

Nodes Used (6)

Basic LLM Chain
@n8n/n8n-nodes-langchain.chainLlm
Gmail
n8n-nodes-base.gmail
Google Gemini Chat Model
@n8n/n8n-nodes-langchain.lmChatGoogleGemini
HTML
n8n-nodes-base.html
HTTP Request
n8n-nodes-base.httpRequest
Structured Output Parser
@n8n/n8n-nodes-langchain.outputParserStructured