Files

Aodhan Collins 66749a5ce7 Initial commit

2025-10-06 00:33:04 +01:00

4.9 KiB

Raw Blame History

ElevenLabs Voice Integration

Overview

EVE now automatically fetches and displays all available ElevenLabs voices from your account when you configure your API key.

Features

Automatic Voice Discovery

Fetches complete voice list from ElevenLabs API
Updates automatically when API key is configured
Shows loading state while fetching
Graceful error handling if API fails

Voice Details

Each voice includes:

Name - The voice's display name
Voice ID - Unique identifier
Category - Voice category (premade, cloned, etc.)
Labels - Metadata including:
- Accent (e.g., "American", "British")
- Age (e.g., "young", "middle-aged")
- Gender (e.g., "male", "female")
- Use case (e.g., "narration", "conversational")
Description - Voice description
Preview URL - Audio preview (future feature)

Voice Selection UI

Grouped Categories:

ElevenLabs Voices (Premium) - All your ElevenLabs voices with rich details
Browser Voices (Free) - System text-to-speech voices

Display Format:

Rachel - American (young)
Adam - American (middle-aged)
Antoni - British (young)

Automatic Provider Detection

The system automatically detects which provider to use based on voice selection:

Voice IDs prefixed with elevenlabs: → ElevenLabs TTS
Voice IDs prefixed with browser: → Browser TTS
default → Browser TTS fallback

How It Works

1. API Key Configuration

When you enter your ElevenLabs API key in Settings:

API key is saved to settings store
useEffect hook triggers voice fetching
Loading state is shown
Voices are fetched from ElevenLabs API
Voices populate the dropdown

2. Voice Selection

User selects a voice from dropdown
Voice ID is saved with provider prefix (e.g., elevenlabs:21m00Tcm4TlvDq8ikWAM)
Prefix is stored in settings

3. Playback

User clicks speaker icon on message
TTS manager parses voice ID prefix
Correct provider is initialized
Audio is generated and played

Code Architecture

Components

SettingsPanel - Fetches and displays voices
TTSControls - Initializes client and plays audio

Libraries

elevenlabs.ts - ElevenLabs API client with getVoices() method
tts.ts - TTS manager with automatic provider detection

Data Flow

Settings Panel
    ↓
[ElevenLabs API Key Entered]
    ↓
useEffect Hook Triggered
    ↓
getElevenLabsClient(apiKey)
    ↓
client.getVoices()
    ↓
ElevenLabs API
    ↓
Voice List Returned
    ↓
Populate Dropdown
    ↓
User Selects Voice
    ↓
Save with Prefix (elevenlabs:VOICE_ID)
    ↓
TTSControls Plays Message
    ↓
Parse Prefix → Use ElevenLabs
    ↓
Audio Playback

API Response Example

{
  voices: [
    {
      voice_id: "21m00Tcm4TlvDq8ikWAM",
      name: "Rachel",
      category: "premade",
      labels: {
        accent: "American",
        age: "young",
        gender: "female",
        use_case: "narration"
      },
      description: "A calm and professional female voice",
      preview_url: "https://..."
    },
    // ... more voices
  ]
}

Error Handling

No API Key

Shows: "Add ElevenLabs API key above to access premium voices"
Falls back to browser voices

Invalid API Key

Shows: "Failed to load ElevenLabs voices. Check your API key."
Error message in red text
Falls back to browser voices

Network Error

Logs error to console
Shows user-friendly error message
Maintains browser voices as fallback

Future Enhancements

Voice Preview

Click to hear voice sample before selecting
Uses preview_url from API response

Voice Filtering

Filter by accent
Filter by age
Filter by gender
Filter by use case

Custom Voice Upload

Support for cloned voices
Voice cloning interface

Voice Settings per Character

Different voices for different AI personalities
Character-specific voice preferences

Testing

To Test Voice Fetching

Open Settings
Enter valid ElevenLabs API key
Enable TTS
Wait for "Loading voices..." to complete
Open TTS Voice Selection dropdown
Verify ElevenLabs voices appear with details

To Test Voice Playback

Select an ElevenLabs voice
Save settings
Send a message to EVE
Click speaker icon on response
Verify audio plays with selected voice

To Test Fallback

Select an ElevenLabs voice
Remove API key
Click speaker icon
Verify fallback to browser TTS with warning message

Benefits

✅ No Manual Configuration - Voices auto-populate
✅ Always Up-to-Date - Gets latest voices from your account
✅ Rich Information - See voice details before selecting
✅ Smart Fallback - Gracefully handles errors
✅ User-Friendly - Clear feedback at every step
✅ Flexible - Mix ElevenLabs and browser voices

Implementation Complete: October 5, 2025
Status: Production Ready ✅

4.9 KiB Raw Blame History