This workflow is designed for businesses or developers looking to integrate voice-based chat applications with dynamic responses and conversational memory.
It automates AI-powered voice conversations, maintaining context between sessions and converting speech-to-text and text-to-speech.
The workflow receives audio input, transcribes it using OpenAI, and processes the conversation using Google Gemini Chat Model (you can use OpenAI Chat Model). Responses are converted back to speech using ElevenLabs.
You'll need API keys for:
Google AI Studio
)(voice_message)
in the "Path"
parameter in the Webhook node is used as the name of the parameter that will contain the voice message you are sending via the HTTP Post request.Implement complex processes faster with n8n