Speechmatics
Speechmatics is an enterprise-grade speech intelligence platform headquartered in Cambridge, UK, offering highly accurate speech-to-text APIs supporting 55+ languages with batch and real-time transcription modes. The platform provides REST APIs for batch transcription job management and WebSocket APIs for low-latency real-time streaming transcription, along with speaker diarization, speaker identification, custom vocabulary, sentiment analysis, translation, and topic detection. Speechmatics also offers a Text-to-Speech API, a Voice Agent API (early access), and a Management API for programmatic account and API key management. Deployments are available as cloud SaaS, containerized on-premises, Kubernetes, and virtual appliance options.
APIs
Speechmatics Batch Transcription API
REST API for submitting audio files for asynchronous batch transcription, managing jobs, and retrieving transcripts with support for 55+ languages, diarization, translation, and...
Speechmatics Realtime Transcription API
WebSocket API for low-latency real-time streaming transcription supporting live audio input, speaker diarization, and partial/final transcript events.
Speechmatics Management API
REST API for programmatic management of projects, API keys, usage tracking, and account administration.
Speechmatics Text-to-Speech API
REST API for converting text to natural-sounding speech with multiple voice options and language support.