Extract and Organize Colombian Invoices with Gmail, GPT-4o & Google Workspace

Go to Workflow
2,536 views
Built by Juan Sanchez Juan Sanchez
Created on June 08, 2026

Description

馃Ь Personal Invoice Processor

This N8N workflow automates the extraction and organization of personal invoices in Colombia received via Gmail. It includes the following key steps:

馃攣 Flow Summary

Email Trigger
Polls Gmail every 30 minutes for emails with .zip attachments (assumed to contain invoices).
Expects ZIP file following DIAN standards.

ZIP File Handling
Extracts all files.
Filters only PDF and XML files for processing.

Data Extraction & Processing
Uses LangChain Agent + OpenAI (GPT-4o-mini) to extract:
Tipo de documento (Factura / Nota Cr茅dito)
N煤mero de factura
Fecha de emisi贸n (YYYY-MM-DD)
NIT emisor y receptor (sin d铆gito de verificaci贸n)
Raz贸n social del emisor
Subtotal, IVA, Total
CUFE
Resumen de compra (max 20 words, formatted sentence)

Validation
Ensures Total = Subtotal + IVA using a calculator node.

Storage
Uploads the original PDF to Google Drive.
Renames the file to: YYYY-MM-DD-NUMERO_FACTURA.pdf.
Inserts or updates invoice details in Google Sheets using a unique Key (NIT_Emisor + Numero_Factura) to prevent duplication.

> 鈿欙笍 Designed for personal use with minimal latency tolerance and high automation reliability.

Nodes Used (8)

AI Agent
@n8n/n8n-nodes-langchain.agent
Calculator
@n8n/n8n-nodes-langchain.toolCalculator
Code
n8n-nodes-base.code
Compression
n8n-nodes-base.compression
Google Drive
n8n-nodes-base.googleDrive
Google Sheets
n8n-nodes-base.googleSheets
OpenAI Chat Model
@n8n/n8n-nodes-langchain.lmChatOpenAi
Structured Output Parser
@n8n/n8n-nodes-langchain.outputParserStructured