Configuring Voice Settings
Choose voices, enable expressive mode, adjust silence detection, interruption sensitivity, and more.
Overview
Voice settings control how your AI sounds and behaves during calls. Fine-tuning these settings helps create a natural, professional experience for your callers.
Choosing Between Standard and Ultra Voices
AIVO offers two tiers of voice quality:
Standard Voices
- Available on all plans.
- Clear, professional speech.
- Multiple male and female options with various accents.
- Lower latency (faster response time).
Ultra Voices
- Available on Professional and Enterprise plans.
- More natural intonation and rhythm.
- Better handling of complex sentences.
- Slightly higher latency (adds roughly 200ms).
How to switch:
- Go to Voice & AI > Voice Settings.
- Toggle Ultra Quality on or off.
- Browse the voice library - voices marked with a star icon are Ultra.
- Click Preview to hear a sample, then Select to apply.
Expressive Mode Explained
Expressive mode adds natural emotional variation to the AI's speech. Instead of a flat, monotone delivery, the AI adjusts its tone based on context:
- Warm and friendly for greetings.
- Empathetic for apologies or issues.
- Upbeat when confirming appointments.
Enable it: Voice & AI > Voice Settings > toggle Expressive Mode.
Note: Expressive mode works best with Ultra voices. Standard voices support a limited version.
Setting Silence Detection Timeout
Silence detection controls how long the AI waits for the caller to speak before prompting them.
- Default: 5 seconds.
- Short (3 seconds): Good for fast-paced interactions.
- Long (8 seconds): Better for callers who may need extra time (elderly, ESL speakers).
Adjust it: Voice & AI > Advanced > Silence Timeout.
Adjusting Interruption Sensitivity
This controls how easily a caller can interrupt the AI while it is speaking.
- High sensitivity: The AI stops speaking as soon as it detects the caller's voice. Best for conversational, back-and-forth interactions.
- Medium sensitivity (default): The AI pauses after a short burst of caller speech. Balances responsiveness with avoiding false triggers from background noise.
- Low sensitivity: The AI finishes its current sentence before pausing. Best for noisy environments or when the AI is delivering important information.
Adjust it: Voice & AI > Advanced > Interruption Sensitivity.
Max Call Duration Settings
Set a maximum length for calls to prevent unusually long sessions:
- Default: 15 minutes.
- Range: 1 to 60 minutes.
- When the limit is reached, the AI politely wraps up: "I want to make sure I've been helpful. Is there anything else before we end the call?"
Adjust it: Voice & AI > Advanced > Max Duration.
TTS and STT Provider Options
AIVO supports multiple text-to-speech (TTS) and speech-to-text (STT) engines:
TTS Providers
- AIVO Default - Optimized for low latency and natural speech.
- ElevenLabs - Premium voices with superior expressiveness (Enterprise only).
STT Providers
- AIVO Default - Fast, accurate transcription.
- Deepgram - Enhanced accuracy for noisy environments (Enterprise only).
Change providers: Voice & AI > Advanced > TTS Provider / STT Provider.
Most businesses get excellent results with the default providers. Only switch if you have a specific need.
Was this article helpful?