The Voice Mode action allows your users to have spoken conversations with Onyx using speech-to-text and text-to-speech
providers.
Setting up Voice Mode
Navigate to the Voice Mode Dashboard
Click your user profile icon and select Admin Panel, then select the Voice Mode tab in the sidebar.
Set up Speech-to-Text (STT) provider
Configure a provider to transcribe user speech into text.
OpenAI Whisper
Azure STT
ElevenLabs
Get an OpenAI API key
Navigate to the OpenAI dashboard and create a new API key, or
reuse an existing OpenAI API key already configured for your LLM provider. Enter the API key
Click Connect and enter the API key.
Get Azure Speech credentials
Navigate to the Azure Portal and create a Speech resource. Copy the API key
and region. Enter the credentials
Click Connect and enter the API key, region, and target URI.
Get an ElevenLabs API key
Enter the API key
Click Connect and enter the API key.
Set up Text-to-Speech (TTS) provider
Configure a provider to convert AI responses into spoken audio.
OpenAI TTS-1
OpenAI TTS-1 HD
Azure TTS
ElevenLabs
Get an OpenAI API key
Navigate to the OpenAI dashboard and create a new API key
(or reuse the one from Whisper). Enter the API key
Click Connect and enter the API key.
Get an OpenAI API key
Navigate to the OpenAI dashboard and create a new API key
(or reuse the one from Whisper). Enter the API key
Click Connect and enter the API key.
TTS-1 HD provides higher quality audio than TTS-1 but may have slightly higher latency and cost.
Get Azure Speech credentials
Navigate to the Azure Portal and create a Speech resource. Copy the API key
and region. Enter the credentials
Click Connect and enter the API key, region, and target URI.
Get an ElevenLabs API key
Enter the API key
Click Connect and enter the API key.
Once your TTS provider is configured, select a voice from the dropdown or input a voice ID directly.
Set Default Provider
If you have multiple providers configured, make sure to Set as Default for both your preferred STT and TTS
provider.
Make sure your users have microphone access enabled in their browser to use Voice Mode.