Voice Cloning Workflow - Zyphra Zonos API Who is this for? This workflow is designed for developers, content creators, and businesses looking to automate high-quality voice synthesis using AI voice cloning technology. What problem does this solve? It automates the process of generating natural-sounding speech from text using a sample voice file, eliminating the need for manual voice recording and providing consistent voice output for applications like audiobooks, virtual assistants, or content l

Voice Cloning Workflow - Zyphra Zonos API Who is this for? This workflow is designed for developers, content creators, and businesses looking to automate high-quality voice synthesis using AI voice cloning technology. What problem does this solve? It automates the process of generating natural-sounding speech from text using a sample voice file, eliminating the need for manual voice recording and providing consistent voice output for applications like audiobooks, virtual assistants, or content localization. What this workflow does The workflow receives text and voice cloning parameters via webhook, reads a sample voice file from your storage, sends the data to Zyphra's Zonos API for voice synthesis, and saves the generated audio file to your specified output location. Prerequisites You'll need: - API key from Zyphra (obtain from - Account registration at - Sample voice file stored on accessible disk/cloud storage - n8n instance running with webhook capabilities Setup 1. Configure your Zyphra API key in the "Call Zyphra Clone API" node under Header Parameters (Name: X-API-Key, Value: your-api-key) 2. Ensure your sample voice files are accessible at the paths you'll specify 3. Test the webhook endpoint is accessible Supported Audio Formats The API supports multiple output formats through the mimetype parameter: - WebM (default) - audio/webm - Ogg - audio/ogg - WAV - audio/wav - MP3 - audio/mp3 or audio/mpeg - MP4/AAC - audio/mp4 or audio/aac Usage Example Endpoint: POST Headers: Content-Type: application/json Request Body: { "text": "Hello there! This voice sounds just like the sample!", "speakingrate": 18, "samplevoicepath": "/data/output/sampleVoice.wav", "outputpath": "/data/output/", "languageisocode": "en-us", "mimetype": "audio/wav", "model": "zonos-v0.1-transformer", "emotion": { "happiness": 0.8, "neutral": 0.3, "sadness": 0.05, "disgust": 0.05, "fear": 0.05, "surprise": 0.05, "anger": 0.05, "other": 0.5 } } Parameters Required Parameters - text: Text to synthesize into speech - samplevoicepath: Path to your voice sample file - outputpath: Directory where generated audio will be saved Optional Parameters (with defaults) - speakingrate: 15 - Speech speed - languageisocode: "en-us" - Language code - mimetype: "audio/wav" - Output audio format - model: "zonos-v0.1-transformer" - AI model to use - emotion: Object with emotion levels (0-1 scale)
Download the workflow JSON file after purchase.
Open n8n → click the menu → Import from File.
Select the downloaded JSON and import.
Set up credentials for each node that requires them.
Click Execute Workflow to test, then activate.
Setup guide included
Purchase to unlock the full step-by-step guide
No reviews yet
Be the first to buy and share your experience.
Leave a review
Sign in to share your experience with this workflow.
Create a free account to purchase workflows.
Need help setting this up?
Book a 3-hour live setup session with an Agility consultant.