6.1 KiB
ElevenLabs TTS Models
Overview
EVE uses ElevenLabs Turbo v2.5 by default for text-to-speech. This model is specifically optimized for real-time conversational AI.
⚠️ Important: V3 Alpha Not Recommended for EVE
According to ElevenLabs documentation:
"Eleven v3 is not made for real-time applications like Agents Platform."
While V3 offers the highest quality, it is:
- ❌ Not optimized for real-time conversation
- ❌ Higher latency - Slower response times
- ❌ Requires multiple generations - Need to generate several versions and pick the best
- ✅ Best for: Audiobooks, character discussions, pre-recorded content
Current Default Model
Default: eleven_turbo_v2_5
This model is optimized for EVE and provides:
- ✅ Fast generation speed
- ✅ High-quality natural voices
- ✅ Low latency for real-time conversation (~100-300ms)
- ✅ Cost-effective
- ✅ Multilingual support
- ✅ Recommended by ElevenLabs for conversational AI
Available Models
ElevenLabs offers several models you can use:
Turbo Models (Recommended)
eleven_turbo_v2_5 (Current Default)
- Latest turbo model
- Excellent quality with fast generation
- Best for conversational AI
- Low latency
eleven_turbo_v2
- Previous turbo version
- Still high quality
- Slightly older technology
Multilingual Models
eleven_multilingual_v2
- Supports 29+ languages
- High quality across languages
- Slower than turbo but more versatile
eleven_multilingual_v1
- Original multilingual model
- Stable and reliable
- Good for non-English content
Monolingual Models
eleven_monolingual_v1
- English only
- High quality
- Original ElevenLabs model
- More expensive than turbo
Flash Models
eleven_flash_v2_5
- Ultra-fast generation
- Lowest latency
- Good quality
- Best for real-time applications
eleven_flash_v2
- Previous flash version
- Very fast
- Lower cost
Changing the Model
The model is configurable in the settings store:
// In settingsStore.ts
ttsModel: 'eleven_turbo_v2_5' // Default
To change:
setTtsModel('eleven_flash_v2_5') // For lower latency
setTtsModel('eleven_multilingual_v2') // For better multilingual support
Model Characteristics
Speed Comparison
- Flash - Fastest (< 300ms)
- Turbo - Very Fast (< 500ms)
- Multilingual - Fast (< 1s)
- Monolingual - Standard (1-2s)
Quality Comparison
- Monolingual - Highest quality
- Turbo v2.5 - Excellent quality
- Multilingual v2 - Great quality
- Flash - Good quality
Cost Comparison
- Flash - Most economical
- Turbo - Cost-effective
- Multilingual - Standard pricing
- Monolingual - Premium pricing
Recommended Use Cases
Real-Time Conversation (Default)
Model: eleven_turbo_v2_5
Speed: 1.0x
Stability: 50%
Clarity: 75%
Best balance for EVE assistant
Ultra-Low Latency
Model: eleven_flash_v2_5
Speed: 1.0x
Stability: 60%
Clarity: 80%
For instant responses
Maximum Quality
Model: eleven_monolingual_v1
Speed: 1.0x
Stability: 70%
Clarity: 85%
For professional content
Multilingual
Model: eleven_multilingual_v2
Speed: 1.0x
Stability: 55%
Clarity: 75%
For non-English languages
Technical Details
API Call
await client.textToSpeech.convert(voiceId, {
text: "Hello, how can I help you?",
model_id: "eleven_turbo_v2_5",
voice_settings: {
stability: 0.5,
similarity_boost: 0.75,
style: 0.0,
use_speaker_boost: true
}
})
Model Selection Flow
- User sends message
- EVE responds
- User clicks 🔊 speaker icon
- TTSControls reads
ttsModelfrom settings - Passes to TTS Manager
- TTS Manager calls ElevenLabs with model ID
- Audio generated and played
Fallback Behavior
If ElevenLabs model fails or is unavailable:
- Falls back to Browser Web Speech API
- Logs warning in console
- Continues with free browser TTS
Future Enhancements
Planned Features
- Model selector in UI - Dropdown to choose model in Settings
- Auto-detect best model - Based on language and use case
- Model presets - Quick selection for different scenarios
- Cost tracking - Show estimated cost per request
- Quality metrics - User feedback on voice quality
Potential Models
As ElevenLabs releases new models, EVE can be updated:
eleven_turbo_v3- Next generation turboeleven_flash_v3- Even faster flash modeleleven_multilingual_v3- Improved multilingual- Specialized models for specific use cases
Troubleshooting
Audio Not Playing
- Check that ElevenLabs API key is valid
- Verify model ID is correct
- Check console for error messages
- Try switching to
eleven_turbo_v2if v2.5 fails
Poor Quality
- Try
eleven_monolingual_v1for better quality - Adjust stability and clarity settings
- Check voice selection
- Ensure text is well-formatted
Slow Generation
- Switch to
eleven_flash_v2_5for speed - Reduce text length
- Check network connection
- Verify API quota not exceeded
Model Not Found Error
Error: Model 'eleven_turbo_v3' not found
- Model ID may be incorrect
- Model might not be available on your plan
- Fall back to
eleven_turbo_v2_5 - Check ElevenLabs documentation
Model Changelog
v2.5 Models (Current)
- Released: 2024
- Improvements: Better quality, faster generation
- Models:
eleven_turbo_v2_5,eleven_flash_v2_5
v2 Models
- Released: 2023
- Improvements: Multilingual support, reduced latency
- Models:
eleven_turbo_v2,eleven_flash_v2,eleven_multilingual_v2
v1 Models (Legacy)
- Released: 2022-2023
- Original high-quality models
- Models:
eleven_monolingual_v1,eleven_multilingual_v1
References
Current Default: eleven_turbo_v2_5
Status: ✅ Configured
Version: v0.2.0-rc
Date: October 5, 2025