Overview
In 0.32.0 Talemate's TTS (Text-to-Speech) agent has been completely refactored to provide advanced voice capabilities including per-character voice assignment, speaker separation, and support for multiple local and remote APIs. The voice system now includes a comprehensive voice library for managing and organizing voices across all supported providers.
Key Features
- Per-character voice assignment - Each character can have their own unique voice
- Speaker separation - Automatic detection and separation of dialogue from narration
- Voice library management - Centralized management of all voices across providers
- Multiple API support - Support for both local and remote TTS providers
- Director integration - Automatic voice assignment for new characters
Supported APIs
Local APIs
- Kokoro - Fastest generation with predefined voice models and mixing
- F5-TTS - Fast voice cloning with occasional mispronunciations
- Chatterbox - High-quality voice cloning (slower generation)
Remote APIs
- ElevenLabs - Professional voice synthesis with voice cloning
- Google Gemini-TTS - Google's text-to-speech service
- OpenAI - OpenAI's TTS-1 and TTS-1-HD models
Enable the Voice agent
Start by enabling the voice agent, if it is currently disabled.
If your voice agent is disabled - indicated by the grey dot next to the agent - you can enable it by clicking on the agent and checking the Enable
checkbox near the top of the agent settings.
Ctrl click to toggle agent
You can use Ctrl click to toggle the agent on and off.
Voice Library Management
Voices are managed through the Voice Library, accessible from the main application bar. The Voice Library allows you to:
- Add and organize voices from all supported providers
- Assign voices to specific characters
- Create mixed voices (Kokoro)
- Manage both global and scene-specific voice libraries
See the Voice Library Guide for detailed instructions.
Character Voice Assignment
Characters can have individual voices assigned through the Voice Library. When a character has a voice assigned:
- Their dialogue will use their specific voice
- The narrator voice is used for exposition in their messages (with speaker separation enabled)
- If their assigned voice's API is not available, it falls back to the narrator voice
The Voice agent status will show all assigned character voices and their current status.