Google Cloud Speech-To-Text logo

Google Cloud Speech-To-Text

Google Cloud Speech-to-Text API converts audio to text using advanced deep learning models, supporting over 125 languages and variants with real-time streaming and batch transcription capabilities.

1 APIs 0 Features
Audio ProcessingGoogle CloudMachine LearningSpeech RecognitionTranscription

APIs

Google Cloud Speech-to-Text API

The Google Cloud Speech-to-Text API provides speech recognition capabilities to convert audio to text, supporting synchronous recognition, asynchronous batch processing, and rea...

Resources

🌐
Portal
Portal
🚀
Getting Started
Getting Started
🔗
Documentation
Documentation
🔑
Authentication
Authentication
💰
Pricing
Pricing
📜
Terms of Service
Terms of Service
📜
Privacy Policy
Privacy Policy
🟢
Status
Status
💬
Support
Support
🔗
JSONLDContext
JSONLDContext

Sources

Raw ↑
aid: google-cloud-speech-to-text
name: Google Cloud Speech-To-Text
description: >-
  Google Cloud Speech-to-Text API converts audio to text using advanced deep
  learning models, supporting over 125 languages and variants with real-time
  streaming and batch transcription capabilities.
image: https://kinlane-productions2.s3.amazonaws.com/apis-json/apis-json-logo.jpg
url: https://raw.githubusercontent.com/api-evangelist/google-cloud-speech-to-text/refs/heads/main/apis.yml
created: '2026-03-13'
modified: '2026-04-28'
specificationVersion: '0.19'
type: Index
tags:
  - Audio Processing
  - Google Cloud
  - Machine Learning
  - Speech Recognition
  - Transcription
apis:
  - name: Google Cloud Speech-to-Text API
    description: >-
      The Google Cloud Speech-to-Text API provides speech recognition
      capabilities to convert audio to text, supporting synchronous recognition,
      asynchronous batch processing, and real-time streaming transcription.
    image: https://kinlane-productions2.s3.amazonaws.com/apis-json/apis-json-logo.jpg
    humanURL: https://cloud.google.com/speech-to-text/docs
    baseURL: https://speech.googleapis.com/v1
    tags:
      - Audio Analysis
      - Speech Recognition
      - Streaming
      - Transcription
    properties:
      - type: Documentation
        url: https://cloud.google.com/speech-to-text/docs/reference/rest
      - type: OpenAPI
        url: openapi/openapi.yml
      - type: Authentication
        url: https://cloud.google.com/docs/authentication
      - type: Getting Started
        url: https://cloud.google.com/speech-to-text/docs/quickstart
      - type: JSONSchema
        url: json-schema/json-schema.yml
      - type: JSONLDContext
        url: json-ld/json-ld.yml
common:
  - type: Portal
    url: https://cloud.google.com/speech-to-text
  - type: Getting Started
    url: https://cloud.google.com/speech-to-text/docs/quickstart
  - type: Documentation
    url: https://cloud.google.com/speech-to-text/docs
  - type: Authentication
    url: https://cloud.google.com/docs/authentication
  - type: Pricing
    url: https://cloud.google.com/speech-to-text/pricing
  - type: Terms of Service
    url: https://cloud.google.com/terms
  - type: Privacy Policy
    url: https://policies.google.com/privacy
  - type: Status
    url: https://status.cloud.google.com/
  - type: Support
    url: https://cloud.google.com/speech-to-text/docs/support
  - type: JSONLDContext
    url: json-ld/json-ld.yml
maintainers:
  - FN: Kin Lane
    email: [email protected]