AI Voice Chat using Webhook, Memory Manager, OpenAI, Google Gemini & ElevenLabs

Go to Workflow
34,325 views
Built by Ayoub Ayoub
Created on June 08, 2026

Description

Who is this for?
This workflow is designed for businesses or developers looking to integrate voice-based chat applications with dynamic responses and conversational memory.

What problem does this solve?
It automates AI-powered voice conversations, maintaining context between sessions and converting speech-to-text and text-to-speech.

What this workflow does:
The workflow receives audio input, transcribes it using OpenAI, and processes the conversation using Google Gemini Chat Model (you can use OpenAI Chat Model). Responses are converted back to speech using ElevenLabs.

Prerequisites:
You'll need API keys for:
OpenAI (you can obtain it from OpenAI website)
ElevenLabs (you can obtain it from their website)
Google Gemini (You can obtain it from Google AI Studio)

Setup:
Configure you API keys
Ensure that the value (voice_message) in the "Path" parameter in the Webhook node is used as the name of the parameter that will contain the voice message you are sending via the HTTP Post request.

Nodes Used (6)

Basic LLM Chain
@n8n/n8n-nodes-langchain.chainLlm
Chat Memory Manager
@n8n/n8n-nodes-langchain.memoryManager
Google Gemini Chat Model
@n8n/n8n-nodes-langchain.lmChatGoogleGemini
HTTP Request
n8n-nodes-base.httpRequest
OpenAI
@n8n/n8n-nodes-langchain.openAi
Simple Memory
@n8n/n8n-nodes-langchain.memoryBufferWindow