elevenlabs logo

elevenlabs

Converts text into speech using a voice of your choice and returns audio.

11 APIs 16 Features

APIs

ElevenLabs Text to Speech API

The ElevenLabs Text to Speech API converts text into lifelike spoken audio with nuanced intonation, pacing, and emotional awareness. It supports multiple output formats includin...

ElevenLabs Speech to Text API

The ElevenLabs Speech to Text API provides state-of-the-art transcription capabilities, converting spoken audio into accurate text. It supports multiple audio formats and langua...

ElevenLabs Voice Cloning API

The ElevenLabs Voice Cloning API allows developers to create custom AI voices from audio recordings. Instant Voice Cloning requires as little as 60 seconds of clean audio to gen...

ElevenLabs Voices API

The ElevenLabs Voices API provides management capabilities for the voice library, including listing, retrieving, creating, editing, and deleting voices. Developers can access a ...

ElevenLabs Sound Effects API

The ElevenLabs Sound Effects API generates cinematic sound effects from text descriptions. Developers can describe the desired sound in natural language and receive high-quality...

ElevenLabs Audio Isolation API

The ElevenLabs Audio Isolation API removes background noise from audio recordings, isolating vocal tracks from ambient sounds and interference. This is useful for cleaning up re...

ElevenLabs Dubbing API

The ElevenLabs Dubbing API enables automatic translation and voice-over of audio and video content into different languages. It preserves the original speaker's voice characteri...

ElevenLabs Voice Changer API

The ElevenLabs Voice Changer API performs speech-to-speech conversion, replacing one voice with another while preserving the original speech content, timing, and emotional deliv...

ElevenLabs Music Generation API

The ElevenLabs Music Generation API creates music from text prompts, allowing developers to generate original musical compositions programmatically. Users describe the desired g...

ElevenLabs Conversational AI API

The ElevenLabs Conversational AI API enables developers to build interactive voice agents that can engage in natural, real-time conversations. It combines speech recognition, la...

ElevenLabs Studio API

The ElevenLabs Studio API provides programmatic access to the ElevenLabs Studio project management system. Developers can create, manage, and render long-form audio content proj...

Features

Free: 10k credits/mo with TTS/STT/Sound effects
Starter at $6/mo: 30k credits, commercial license, IVC
Creator at $11/mo: 121k credits, Professional Voice Cloning
Pro at $99/mo: 600k credits, 44.1kHz PCM
Scale at $299/mo: 1.8M credits, team collaboration
Business at $990/mo: 6M credits, low-latency TTS at 5c/min
Enterprise custom: HIPAA BAA, SSO, custom concurrency
Concurrent: 2 Free, 3 Starter, 5 Creator, 10 Pro, 15 Scale/Business
TTS API with multilingual voices
STT (Scribe) API
Dubbing API for video translation
Voice cloning (Instant + Professional)
Sound Effects generation
Music generation
Conversational AI (voice agents)
WebSocket streaming for low-latency synthesis

Event Specifications

ElevenLabs Conversational AI Events

The ElevenLabs Conversational AI WebSocket API enables real-time, interactive voice conversations with AI agents. It supports bidirectional audio streaming, text events, and con...

ASYNCAPI

ElevenLabs Text to Speech Streaming Events

The ElevenLabs Text to Speech WebSocket API enables bidirectional streaming for text-to-speech conversion. Clients send text chunks incrementally and receive audio chunks as the...

ASYNCAPI

ElevenLabs Webhook Events

The ElevenLabs Webhook system delivers event notifications to configured endpoints when specific actions occur within the platform. This includes post-call webhooks from Convers...

ASYNCAPI

Semantic Vocabularies

Elevenlabs Context

0 classes · 11 properties

JSON-LD

Resources

🔗
JSON-LD
JSON-LD
🔗
JSONSchema
JSONSchema
🔗
JSONSchema
JSONSchema
🔗
JSONSchema
JSONSchema

Sources

Raw ↑
aid: elevenlabs
url: https://raw.githubusercontent.com/api-evangelist/elevenlabs/refs/heads/main/apis.yml
modified: '2026-05-04'
common:
  - type: JSON-LD
    url: json-ld/elevenlabs-context.jsonld
  - type: JSONSchema
    url: json-schema/elevenlabs-voice-schema.json
  - type: JSONSchema
    url: json-schema/elevenlabs-agent-schema.json
  - type: JSONSchema
    url: json-schema/elevenlabs-webhook-event-schema.json
  - type: Features
    data:
      - 'Free: 10k credits/mo with TTS/STT/Sound effects'
      - 'Starter at $6/mo: 30k credits, commercial license, IVC'
      - 'Creator at $11/mo: 121k credits, Professional Voice Cloning'
      - 'Pro at $99/mo: 600k credits, 44.1kHz PCM'
      - 'Scale at $299/mo: 1.8M credits, team collaboration'
      - 'Business at $990/mo: 6M credits, low-latency TTS at 5c/min'
      - 'Enterprise custom: HIPAA BAA, SSO, custom concurrency'
      - 'Concurrent: 2 Free, 3 Starter, 5 Creator, 10 Pro, 15 Scale/Business'
      - TTS API with multilingual voices
      - STT (Scribe) API
      - Dubbing API for video translation
      - Voice cloning (Instant + Professional)
      - Sound Effects generation
      - Music generation
      - Conversational AI (voice agents)
      - WebSocket streaming for low-latency synthesis
    sources:
      - https://elevenlabs.io/pricing
    updated: '2026-05-04'
apis:
  - aid: elevenlabs:text-to-speech
    name: ElevenLabs Text to Speech API
    tags:
      - AI
      - Audio
      - Speech Synthesis
      - Text to Speech
      - Voice
    image: https://kinlane-productions2.s3.amazonaws.com/apis-json/apis-json-logo.jpg
    baseURL: https://api.elevenlabs.io
    humanURL: https://elevenlabs.io/docs/api-reference/text-to-speech/convert
    properties:
      - url: https://elevenlabs.io/docs/api-reference/text-to-speech/convert
        type: Documentation
      - url: openapi/elevenlabs-text-to-speech-openapi.yml
        type: OpenAPI
      - url: asyncapi/elevenlabs-text-to-speech-streaming-asyncapi.yml
        type: AsyncAPI
    description: The ElevenLabs Text to Speech API converts text into lifelike spoken audio with nuanced
      intonation, pacing, and emotional awareness. It supports multiple output formats including MP3,
      PCM, and mu-law, and offers a range of models such as Flash v2.5 for ultra-low latency real-time
      applications and Multilingual v2 for support across 70+ languages. Developers can select from thousands
      of pre-built voices or use custom cloned voices to generate speech that sounds natural and expressive.
  - aid: elevenlabs:speech-to-text
    name: ElevenLabs Speech to Text API
    tags:
      - AI
      - Audio
      - Speech to Text
      - Transcription
    image: https://kinlane-productions2.s3.amazonaws.com/apis-json/apis-json-logo.jpg
    baseURL: https://api.elevenlabs.io
    humanURL: https://elevenlabs.io/docs/api-reference/speech-to-text/convert
    properties:
      - url: https://elevenlabs.io/docs/api-reference/speech-to-text/convert
        type: Documentation
      - url: openapi/elevenlabs-speech-to-text-openapi.yml
        type: OpenAPI
    description: The ElevenLabs Speech to Text API provides state-of-the-art transcription capabilities,
      converting spoken audio into accurate text. It supports multiple audio formats and languages, enabling
      developers to build applications that require reliable audio transcription. The API is designed
      for both real-time and batch processing use cases.
  - aid: elevenlabs:voice-cloning
    name: ElevenLabs Voice Cloning API
    tags:
      - AI
      - Audio
      - Voice
      - Voice Cloning
    image: https://kinlane-productions2.s3.amazonaws.com/apis-json/apis-json-logo.jpg
    baseURL: https://api.elevenlabs.io
    humanURL: https://elevenlabs.io/docs/api-reference/voices/ivc/create
    properties:
      - url: https://elevenlabs.io/docs/api-reference/voices/ivc/create
        type: Documentation
      - url: openapi/elevenlabs-voice-cloning-openapi.yml
        type: OpenAPI
    description: The ElevenLabs Voice Cloning API allows developers to create custom AI voices from audio
      recordings. Instant Voice Cloning requires as little as 60 seconds of clean audio to generate a
      usable voice clone, while Professional Voice Cloning produces higher fidelity results from a minimum
      of 30 minutes of recordings. Cloned voices can then be used with the Text to Speech API for generating
      speech that closely matches the original speaker.
  - aid: elevenlabs:voices
    name: ElevenLabs Voices API
    tags:
      - AI
      - Voice Library
      - Voice Management
      - Voices
    image: https://kinlane-productions2.s3.amazonaws.com/apis-json/apis-json-logo.jpg
    baseURL: https://api.elevenlabs.io
    humanURL: https://elevenlabs.io/docs/api-reference/voices/get
    properties:
      - url: https://elevenlabs.io/docs/api-reference/voices/get
        type: Documentation
      - url: openapi/elevenlabs-voices-openapi.yml
        type: OpenAPI
    description: The ElevenLabs Voices API provides management capabilities for the voice library, including
      listing, retrieving, creating, editing, and deleting voices. Developers can access a library of
      over 5,000 pre-built voices and manage their own custom voices. The API also supports voice design,
      allowing creation of new AI voices from text descriptions specifying desired characteristics such
      as accent, age, and tone.
  - aid: elevenlabs:sound-effects
    name: ElevenLabs Sound Effects API
    tags:
      - AI
      - Audio Generation
      - Sound Effects
    image: https://kinlane-productions2.s3.amazonaws.com/apis-json/apis-json-logo.jpg
    baseURL: https://api.elevenlabs.io
    humanURL: https://elevenlabs.io/docs/api-reference/sound-generation/create
    properties:
      - url: https://elevenlabs.io/docs/api-reference/sound-generation/create
        type: Documentation
      - url: openapi/elevenlabs-sound-effects-openapi.yml
        type: OpenAPI
    description: The ElevenLabs Sound Effects API generates cinematic sound effects from text descriptions.
      Developers can describe the desired sound in natural language and receive high-quality audio output.
      The API supports audio tags for controlling delivery, emotion, emphasis, pauses, and specific sound
      effects, making it suitable for game development, film production, and multimedia content creation.
  - aid: elevenlabs:audio-isolation
    name: ElevenLabs Audio Isolation API
    tags:
      - Audio Isolation
      - Audio Processing
      - Noise Removal
    image: https://kinlane-productions2.s3.amazonaws.com/apis-json/apis-json-logo.jpg
    baseURL: https://api.elevenlabs.io
    humanURL: https://elevenlabs.io/docs/api-reference/audio-isolation/audio-isolation
    properties:
      - url: https://elevenlabs.io/docs/api-reference/audio-isolation/audio-isolation
        type: Documentation
      - url: openapi/elevenlabs-audio-isolation-openapi.yml
        type: OpenAPI
    description: The ElevenLabs Audio Isolation API removes background noise from audio recordings, isolating
      vocal tracks from ambient sounds and interference. This is useful for cleaning up recordings, improving
      audio quality for podcasts and interviews, and preparing audio files for further processing such
      as voice cloning or transcription. The API processes audio files and returns cleaned versions with
      the vocal content preserved.
  - aid: elevenlabs:dubbing
    name: ElevenLabs Dubbing API
    tags:
      - Audio
      - Dubbing
      - Localization
      - Translation
      - Video
    image: https://kinlane-productions2.s3.amazonaws.com/apis-json/apis-json-logo.jpg
    baseURL: https://api.elevenlabs.io
    humanURL: https://elevenlabs.io/docs/api-reference/dubbing/resources/dub-segment
    properties:
      - url: https://elevenlabs.io/docs/api-reference/dubbing/resources/dub-segment
        type: Documentation
      - url: openapi/elevenlabs-dubbing-openapi.yml
        type: OpenAPI
    description: The ElevenLabs Dubbing API enables automatic translation and voice-over of audio and
      video content into different languages. It preserves the original speaker's voice characteristics
      while translating the spoken content, supporting seamless localization of multimedia content. The
      API handles the full dubbing pipeline including transcription, translation, and speech synthesis
      with lip-sync timing.
  - aid: elevenlabs:voice-changer
    name: ElevenLabs Voice Changer API
    tags:
      - Audio Processing
      - Voice Changer
      - Voice Conversion
    image: https://kinlane-productions2.s3.amazonaws.com/apis-json/apis-json-logo.jpg
    baseURL: https://api.elevenlabs.io
    humanURL: https://elevenlabs.io/docs/api-reference/speech-to-speech/convert
    properties:
      - url: https://elevenlabs.io/docs/api-reference/speech-to-speech/convert
        type: Documentation
      - url: openapi/elevenlabs-voice-changer-openapi.yml
        type: OpenAPI
    description: The ElevenLabs Voice Changer API performs speech-to-speech conversion, replacing one
      voice with another while preserving the original speech content, timing, and emotional delivery.
      Developers can transform audio recordings to sound like a different speaker using any voice from
      the ElevenLabs library or a custom cloned voice. This is useful for content creation, privacy protection,
      and character voice generation.
  - aid: elevenlabs:music
    name: ElevenLabs Music Generation API
    tags:
      - AI
      - Audio Generation
      - Music
    image: https://kinlane-productions2.s3.amazonaws.com/apis-json/apis-json-logo.jpg
    baseURL: https://api.elevenlabs.io
    humanURL: https://elevenlabs.io/docs/overview/capabilities/music
    properties:
      - url: https://elevenlabs.io/docs/overview/capabilities/music
        type: Documentation
      - url: openapi/elevenlabs-music-openapi.yml
        type: OpenAPI
    description: The ElevenLabs Music Generation API creates music from text prompts, allowing developers
      to generate original musical compositions programmatically. Users describe the desired genre, mood,
      tempo, and instrumentation in natural language and receive generated audio output. The API is designed
      for applications that need background music, jingles, or custom soundtracks without requiring manual
      composition.
  - aid: elevenlabs:conversational-ai
    name: ElevenLabs Conversational AI API
    tags:
      - AI
      - Conversational AI
      - Real-Time
      - Voice Agents
    image: https://kinlane-productions2.s3.amazonaws.com/apis-json/apis-json-logo.jpg
    baseURL: https://api.elevenlabs.io
    humanURL: https://elevenlabs.io/docs/overview/capabilities/conversational-ai
    properties:
      - url: https://elevenlabs.io/docs/overview/capabilities/conversational-ai
        type: Documentation
      - url: openapi/elevenlabs-conversational-ai-openapi.yml
        type: OpenAPI
      - url: asyncapi/elevenlabs-conversational-ai-asyncapi.yml
        type: AsyncAPI
      - url: asyncapi/elevenlabs-webhooks-asyncapi.yml
        type: AsyncAPI
    description: The ElevenLabs Conversational AI API enables developers to build interactive voice agents
      that can engage in natural, real-time conversations. It combines speech recognition, language understanding,
      and speech synthesis into a unified interface supporting multi-turn dialogue across 70+ languages.
      The API is designed for building customer service agents, voice assistants, and interactive voice
      response systems with expressive, human-sounding voices.
  - aid: elevenlabs:studio
    name: ElevenLabs Studio API
    tags:
      - Content Management
      - Projects
      - Studio
    image: https://kinlane-productions2.s3.amazonaws.com/apis-json/apis-json-logo.jpg
    baseURL: https://api.elevenlabs.io
    humanURL: https://elevenlabs.io/docs/overview/capabilities/projects
    properties:
      - url: https://elevenlabs.io/docs/overview/capabilities/projects
        type: Documentation
      - url: openapi/elevenlabs-studio-openapi.yml
        type: OpenAPI
    description: The ElevenLabs Studio API provides programmatic access to the ElevenLabs Studio project
      management system. Developers can create, manage, and render long-form audio content projects through
      the API, organizing text into chapters and assigning different voices to different sections. The
      Studio is designed for producing audiobooks, podcasts, and other long-form audio content at scale.
description: Converts text into speech using a voice of your choice and returns audio.