Apache SeaTunnel logo

Apache SeaTunnel

Apache SeaTunnel is a high-performance, distributed data integration platform that supports real-time and batch data synchronization. It provides a connector API with support for over 100 data sources and sinks.

1 APIs 1 Capabilities 6 Features
Data IntegrationETLELTBatchStreamingApacheOpen Source

APIs

Apache SeaTunnel REST API

SeaTunnel provides a REST API for job management and monitoring, a Connector API for building custom data sources and sinks, and a Transform API for data transformation, support...

Capabilities

Features

200+ Connectors

Over 200 built-in connectors for databases, warehouses, and file systems

Batch and Streaming

Unified API for both batch ETL and real-time streaming jobs

Schema Evolution

Automatic schema detection and evolution support

Distributed Execution

Zeta execution engine with no external dependencies

CDC Support

Change Data Capture for real-time database synchronization

Transform Layer

Built-in SQL and custom transform functions

Use Cases

Database Migration

Migrate data between databases with schema mapping

Data Warehouse Loading

Load and sync data into data warehouses

Real-Time Synchronization

CDC-based real-time sync between source and target systems

Data Lake Ingestion

Ingest data from multiple sources into a data lake

Integrations

Apache Kafka

Kafka source and sink connector for streaming pipelines

Apache Flink

Run SeaTunnel jobs on Flink execution engine

Apache Spark

Run SeaTunnel jobs on Spark execution engine

ClickHouse

High-performance ClickHouse sink connector

Doris

Apache Doris connector for analytical workloads

Semantic Vocabularies

Apache Seatunnel Context

10 classes · 32 properties

JSON-LD

API Governance Rules

Apache SeaTunnel API Rules

6 rules · 4 errors 2 warnings

SPECTRAL

Resources

👥
GitHubOrganization
GitHubOrganization
🔗
Documentation
Documentation
🔗
SpectralRules
SpectralRules
🔗
Vocabulary
Vocabulary
🔗
NaftikoCapability
NaftikoCapability
🔗
JSON-LD
JSON-LD