Home/Browse/Convert PDF documents to AI podcasts with Google Gemini and text-to-speech

Convert PDF documents to AI podcasts with Google Gemini and text-to-speech

n8n8 modulesv1.0

SlackGoogle DriveDropboxGemini

Convert PDF documents to AI podcasts with Google Gemini and text-to-speech workflow diagram

About this workflow

Convert PDF documents to AI-generated podcasts with Google Gemini and Text-to-Speech Transform any PDF document into an engaging, natural-sounding podcast using Google's Gemini AI and advanced Text-to-Speech technology. This automated workflow extracts text content, generates conversational scripts, and produces high-quality audio files. Who is this for? This workflow template is perfect for content creators, educators, researchers, and marketing professionals who want to repurpose written content into audio format. Ideal for creating podcast episodes, educational content, or making documents more accessible. What problem does this solve? Converting written documents to engaging audio content manually is time-consuming and requires scriptwriting skills. This workflow automates the entire process, turning static PDFs into dynamic, conversational podcasts that sound natural and engaging. What this workflow does 1. Extracts text from uploaded PDF documents 2. Generates podcast script using Google Gemini AI with conversational tone 3. Converts script to speech using Google's advanced TTS with customizable voices 4. Processes audio into properly formatted WAV files 5. Saves final podcast ready for distribution Setup 1. Obtain API credentials: - Get Google Gemini API key from AI Studio - Configure credentials in n8n as "Google Gemini(PaLM) Api account" 2. Configure voice settings: - Choose from available voices: Kore (professional), Aoede (conversational), Laomedeia (energetic) - Customize script generation prompts if needed 3. Test the workflow: - Upload a sample PDF file - Verify audio output quality - Adjust voice settings as preferred How to customize this workflow - Modify script style: Edit the prompt in the "Generate Podcast Script" node to change tone, length, or format - Change voice: Update the voice name in "Prepare TTS Request" node - Add preprocessing: Insert text cleaning nodes before script generation - Integrate with storage: Connect to Google Drive, Dropbox, or other storage services - Add notifications: Include Slack or email notifications when podcasts are ready Note: This template requires Google Gemini API access and works best with text-based PDF files under 10MB.

How to import this n8n workflow

1
Download the workflow JSON file after purchase.
2
Open n8n → click the menu → Import from File.
3
Select the downloaded JSON and import.
4
Set up credentials for each node that requires them.
5
Click Execute Workflow to test, then activate.

Setup guide

Setup guide included

Purchase to unlock the full step-by-step guide

Reviews

No reviews yet

Be the first to buy and share your experience.

Leave a review

Free

No ratings yet

Create a free account to purchase workflows.

JSON blueprint — instant download
Setup guide PDF included
5 downloads · valid 30 days
Works with n8n

Convert PDF documents to AI podcasts with Google Gemini and text-to-speech

About this workflow

How to import this n8n workflow

Setup guide

Step 1: Connect your apps

Step 2: Import the blueprint

Reviews