Convert PDF Documents to AI Podcasts with Google Gemini and Text-to-Speech
Go to WorkflowDescription
Convert PDF documents to AI-generated podcasts with Google Gemini and Text-to-Speech
Transform any PDF document into an engaging, natural-sounding podcast using Google's Gemini AI and advanced Text-to-Speech technology. This automated workflow extracts text content, generates conversational scripts, and produces high-quality audio files.
Who is this for?
This workflow template is perfect for content creators, educators, researchers, and marketing professionals who want to repurpose written content into audio format. Ideal for creating podcast episodes, educational content, or making documents more accessible.
What problem does this solve?
Converting written documents to engaging audio content manually is time-consuming and requires scriptwriting skills. This workflow automates the entire process, turning static PDFs into dynamic, conversational podcasts that sound natural and engaging.
What this workflow does
Extracts text from uploaded PDF documents
Generates podcast script using Google Gemini AI with conversational tone
Converts script to speech using Google's advanced TTS with customizable voices
Processes audio into properly formatted WAV files
Saves final podcast ready for distribution
Setup
Obtain API credentials:
Get Google Gemini API key from AI Studio
Configure credentials in n8n as "Google Gemini(PaLM) Api account"
Configure voice settings:
Choose from available voices: Kore (professional), Aoede (conversational), Laomedeia (energetic)
Customize script generation prompts if needed
Test the workflow:
Upload a sample PDF file
Verify audio output quality
Adjust voice settings as preferred
How to customize this workflow
Modify script style:** Edit the prompt in the "Generate Podcast Script" node to change tone, length, or format
Change voice:** Update the voice name in "Prepare TTS Request" node
Add preprocessing:** Insert text cleaning nodes before script generation
Integrate with storage:** Connect to Google Drive, Dropbox, or other storage services
Add notifications:** Include Slack or email notifications when podcasts are ready
Note: This template requires Google Gemini API access and works best with text-based PDF files under 10MB.