Airbyte logo

Airbyte

Airbyte is an open-source data integration platform that enables businesses to easily and efficiently move and consolidate their data from various sources into one centralized location. With Airbyte, organizations can seamlessly connect and synchronize data from sources such as databases, APIs, and other third-party applications, allowing for real-time insights and analysis. Airbyte offers both self-hosted and cloud-hosted options, with a catalog of hundreds of pre-built connectors.

1 APIs 16 Features
Data IntegrationETLELTOpen SourceData PipelineConnectorsData

APIs

Airbyte

Airbyte is an open-source data integration platform that enables businesses to move and consolidate data from various sources into centralized destinations. The Airbyte API prov...

Features

Core: free open-source self-managed
Standard from $10/mo: fully managed, volume-based
Plus: bulk-credit discounts, accelerated support
Pro: Data Workers capacity pricing, SSO, RBAC
Enterprise Flex: custom limits, dedicated SA
600+ pre-built connectors
Public API at 60 req/min/workspace
OAuth 2.0 + workspace API keys
Webhooks for sync events
Connector Builder for custom sources
dbt Cloud integration for transformations
Multiple workspaces (Pro+)
SSO and RBAC (Pro+)
Self-hosted Enterprise option
Bring your own custom connector
Data Activation (Reverse ETL) capabilities

Use Cases

Data Warehouse Loading

Sync operational data to Snowflake, BigQuery, Redshift, or other warehouses.

Data Lake Ingestion

Land raw data into S3, GCS, or Azure data lakes.

Analytics Pipelines

Build ELT pipelines for business intelligence and analytics.

AI/ML Data Preparation

Aggregate training data from multiple sources for machine learning.

API Data Sync

Pull data from SaaS APIs (Salesforce, HubSpot, Stripe) into your data stack.

Database Replication

Replicate relational databases with CDC change data capture.

Vector Database Population

Load and embed data into vector stores for AI search and retrieval.

Integrations

Apache Airflow

Orchestrate Airbyte syncs from Airflow DAGs.

dbt

Transform data after Airbyte syncs with dbt models.

Snowflake

Load data into Snowflake data warehouse.

BigQuery

Sync data to Google BigQuery.

Redshift

Load data into Amazon Redshift.

Databricks

Ingest data into Databricks lakehouse.

Terraform

Infrastructure-as-code support for Airbyte resources.

Kubernetes / Helm

Deploy Airbyte on Kubernetes using official Helm charts.

Semantic Vocabularies

Airbyte Context

108 classes · 129 properties

JSON-LD

API Governance Rules

Airbyte API Rules

28 rules · 9 errors 12 warnings 7 info

SPECTRAL

Resources

🔗
ArazzoWorkflows
ArazzoWorkflows
🔗
LinkedIn
LinkedIn
🌐
Portal
Portal
🌐
Console
Console
📝
SignUp
SignUp
💰
Pricing
Pricing
👥
GitHubOrganization
GitHubOrganization
👥
GitHubRepository
GitHubRepository
🚀
GettingStarted
GettingStarted
🟢
StatusPage
StatusPage
📰
Blog
Blog
🎓
Tutorials
Tutorials
💬
Support
Support
📜
PrivacyPolicy
PrivacyPolicy
📜
TermsOfService
TermsOfService
📰
Newsletter
Newsletter
📄
ChangeLog
ChangeLog
🔗
RoadMap
RoadMap
📦
PyAirbyte
SDK
🔗
Airbyte CLI (abctl)
CLI
📦
Python Connector CDK
SDK
📦
Agent SDK
SDK
📦
Helm Chart
SDK
🔗
Airbyte Spectral Rules
SpectralRules
🔗
Airbyte Vocabulary
Vocabulary
🔗
AgentSkill
AgentSkill
🔗
LLMsTxt
LLMsTxt

Sources

Raw ↑
aid: airbyte
url: https://raw.githubusercontent.com/api-evangelist/airbyte/refs/heads/main/apis.yml
apis:
  - aid: airbyte:airbyte
    name: Airbyte
    tags:
      - Data Integration
      - ETL
      - ELT
      - Open Source
      - Data Pipeline
      - Connectors
    humanURL: https://airbyte.com
    properties:
      - url: https://docs.airbyte.com
        type: Documentation
      - url: https://reference.airbyte.com/reference/getting-started
        type: APIReference
      - url: https://raw.githubusercontent.com/api-evangelist/airbyte/refs/heads/main/openapi/airbyte-openapi.yml
        type: OpenAPI
      - url: https://docs.airbyte.com/api-documentation
        type: Authentication
      - url: https://github.com/airbytehq/airbyte-api-python-sdk
        type: SDK
        title: Python SDK
      - url: https://github.com/airbytehq/airbyte-api-java-sdk
        type: SDK
        title: Java SDK
      - url: https://pypi.org/project/airbyte-api/
        type: SDK
        title: Python SDK PyPI
      - url: https://github.com/airbytehq/terraform-provider-airbyte
        type: SDK
        title: Terraform Provider
    description: >-
      Airbyte is an open-source data integration platform that enables businesses to move and consolidate data from
      various sources into centralized destinations. The Airbyte API provides programmatic control over sources,
      destinations, connections, jobs, workspaces, and organizations for both Airbyte Cloud and self-managed
      deployments.
    baseURL: https://api.airbyte.com/v1
name: Airbyte
tags:
  - Data Integration
  - ETL
  - ELT
  - Open Source
  - Data Pipeline
  - Connectors
  - Data
type: Index
image: https://kinlane-productions.s3.amazonaws.com/apis-json/apis-json-logo.jpg
access: 3rd-Party
common:
  - type: ArazzoWorkflows
    url: arazzo/
    workflows:
      - url: arazzo/airbyte-application-token-bootstrap-workflow.yml
        name: Airbyte Bootstrap an Application and Access Token
        summary: >-
          Create an API application to mint client credentials, read it back, then exchange those credentials for a
          bearer access token.
      - url: arazzo/airbyte-cancel-running-job-workflow.yml
        name: Airbyte Find and Cancel a Running Job
        summary: >-
          List the running jobs for a connection, branch on whether any are running, then cancel the first one and
          confirm the cancellation.
      - url: arazzo/airbyte-destination-create-verify-workflow.yml
        name: Airbyte Create and Verify a Destination
        summary: >-
          Create a destination connector, read it back to confirm it persisted, and list the workspace's destinations to
          confirm it appears.
      - url: arazzo/airbyte-find-connection-and-sync-workflow.yml
        name: Airbyte Find a Connection and Trigger Its Sync
        summary: >-
          List the connections in a workspace, branch on whether any exist, then trigger and confirm a sync for the
          first connection.
      - url: arazzo/airbyte-provision-pipeline-workflow.yml
        name: Airbyte Provision a Full Data Pipeline
        summary: >-
          Stand up a source, a destination, inspect the available streams, wire them into a connection, and kick off the
          first sync.
      - url: arazzo/airbyte-reset-and-resync-workflow.yml
        name: Airbyte Reset a Connection and Re-Sync
        summary: >-
          Trigger a reset job to clear destination data, poll it to completion, then branch into a fresh sync when the
          reset succeeds.
      - url: arazzo/airbyte-source-create-verify-workflow.yml
        name: Airbyte Create and Verify a Source
        summary: >-
          Create a source connector, read it back to confirm it persisted, and list the workspace's sources to confirm
          it appears.
      - url: arazzo/airbyte-tag-connection-workflow.yml
        name: Airbyte Create a Tag and Apply It to a Connection
        summary: >-
          Create an organizing tag in a workspace, find the first connection, and patch that connection to carry the new
          tag.
      - url: arazzo/airbyte-trigger-sync-and-poll-workflow.yml
        name: Airbyte Trigger a Sync and Poll to Completion
        summary: Kick off a sync job for a connection and poll its status until it succeeds, fails, or is cancelled.
      - url: arazzo/airbyte-workspace-bootstrap-workflow.yml
        name: Airbyte Bootstrap a Workspace
        summary: >-
          Create a workspace, read it back, create an organizing tag inside it, and confirm the workspace starts empty
          of connections.
  - type: LinkedIn
    url: https://www.linkedin.com/company/airbytehq
  - url: https://airbyte.com
    type: Portal
  - url: https://cloud.airbyte.io
    type: Console
  - url: https://cloud.airbyte.io
    type: SignUp
  - url: https://airbyte.com/pricing
    type: Pricing
  - url: https://github.com/airbytehq
    type: GitHubOrganization
  - url: https://github.com/airbytehq/airbyte
    type: GitHubRepository
  - url: https://docs.airbyte.com
    type: GettingStarted
  - url: https://status.airbyte.com
    type: StatusPage
  - url: https://airbyte.com/blog
    type: Blog
  - url: https://airbyte.com/tutorials
    type: Tutorials
  - url: https://support.airbyte.com/hc/en-us
    type: Support
  - url: https://airbyte.com/company/privacy-policy
    type: PrivacyPolicy
  - url: https://airbyte.com/company/terms
    type: TermsOfService
  - url: https://airbyte.com/community/newsletter
    type: Newsletter
  - url: https://docs.airbyte.com/category/release-notes/
    type: ChangeLog
  - url: https://github.com/orgs/airbytehq/projects/37/views/1
    type: RoadMap
  - url: https://airbyte.com/connectors
    type: Integrations
  - url: https://github.com/airbytehq/PyAirbyte
    type: SDK
    title: PyAirbyte
  - url: https://github.com/airbytehq/abctl
    type: CLI
    title: Airbyte CLI (abctl)
  - url: https://github.com/airbytehq/airbyte-python-cdk
    type: SDK
    title: Python Connector CDK
  - url: https://github.com/airbytehq/airbyte-agent-sdk
    type: SDK
    title: Agent SDK
  - url: https://artifacthub.io/packages/helm/airbyte/airbyte
    type: SDK
    title: Helm Chart
  - type: Features
    data:
      - 'Core: free open-source self-managed'
      - 'Standard from $10/mo: fully managed, volume-based'
      - 'Plus: bulk-credit discounts, accelerated support'
      - 'Pro: Data Workers capacity pricing, SSO, RBAC'
      - 'Enterprise Flex: custom limits, dedicated SA'
      - 600+ pre-built connectors
      - Public API at 60 req/min/workspace
      - OAuth 2.0 + workspace API keys
      - Webhooks for sync events
      - Connector Builder for custom sources
      - dbt Cloud integration for transformations
      - Multiple workspaces (Pro+)
      - SSO and RBAC (Pro+)
      - Self-hosted Enterprise option
      - Bring your own custom connector
      - Data Activation (Reverse ETL) capabilities
    sources:
      - https://airbyte.com/pricing
    updated: '2026-05-04'
  - type: UseCases
    data:
      - name: Data Warehouse Loading
        description: Sync operational data to Snowflake, BigQuery, Redshift, or other warehouses.
      - name: Data Lake Ingestion
        description: Land raw data into S3, GCS, or Azure data lakes.
      - name: Analytics Pipelines
        description: Build ELT pipelines for business intelligence and analytics.
      - name: AI/ML Data Preparation
        description: Aggregate training data from multiple sources for machine learning.
      - name: API Data Sync
        description: Pull data from SaaS APIs (Salesforce, HubSpot, Stripe) into your data stack.
      - name: Database Replication
        description: Replicate relational databases with CDC change data capture.
      - name: Vector Database Population
        description: Load and embed data into vector stores for AI search and retrieval.
  - type: Integrations
    data:
      - name: Apache Airflow
        description: Orchestrate Airbyte syncs from Airflow DAGs.
      - name: dbt
        description: Transform data after Airbyte syncs with dbt models.
      - name: Snowflake
        description: Load data into Snowflake data warehouse.
      - name: BigQuery
        description: Sync data to Google BigQuery.
      - name: Redshift
        description: Load data into Amazon Redshift.
      - name: Databricks
        description: Ingest data into Databricks lakehouse.
      - name: Terraform
        description: Infrastructure-as-code support for Airbyte resources.
      - name: Kubernetes / Helm
        description: Deploy Airbyte on Kubernetes using official Helm charts.
  - url: https://raw.githubusercontent.com/api-evangelist/airbyte/refs/heads/main/rules/airbyte-spectral-rules.yml
    type: SpectralRules
    title: Airbyte Spectral Rules
  - url: https://raw.githubusercontent.com/api-evangelist/airbyte/refs/heads/main/vocabulary/airbyte-vocabulary.yaml
    type: Vocabulary
    title: Airbyte Vocabulary
  - name: Agent Skills
    url: https://github.com/airbytehq/airbyte-claude-skills
    type: AgentSkill
  - type: LLMsTxt
    url: https://docs.airbyte.com/llms.txt
created: '2025-01-08'
modified: '2026-05-19'
position: Consumer
description: >-
  Airbyte is an open-source data integration platform that enables businesses to easily and efficiently move and
  consolidate their data from various sources into one centralized location. With Airbyte, organizations can seamlessly
  connect and synchronize data from sources such as databases, APIs, and other third-party applications, allowing for
  real-time insights and analysis. Airbyte offers both self-hosted and cloud-hosted options, with a catalog of hundreds
  of pre-built connectors.
maintainers:
  - FN: Kin Lane
    email: [email protected]
specificationVersion: '0.19'