Apache Kafka logo

Apache Kafka

Apache Kafka is an open-source distributed event streaming platform used by thousands of companies for high-performance data pipelines, streaming analytics, data integration, and mission-critical applications. It provides a REST Proxy API, Kafka Connect REST API, and AsyncAPI for event streaming.

3 APIs 1 Capabilities 7 Features
Distributed SystemsEvent StreamingMessagingOpen SourcePub-Sub

APIs

Kafka REST Proxy API

The Kafka REST Proxy provides a RESTful interface to a Kafka cluster for producing and consuming messages, managing topics, partitions, consumer groups, and viewing cluster stat...

Kafka Connect REST API

Kafka Connect REST API for managing connectors, their configurations, tasks, and offsets for integrating Kafka with external data systems including databases, object stores, and...

Apache Kafka Messaging API

The core Kafka messaging protocol for producing and consuming records to/from topics using the native Kafka binary protocol, supporting exactly-once semantics, compaction, and p...

Capabilities

Apache Kafka Event Streaming

Workflow capability for data engineers and platform architects to manage Kafka topics, produce and consume messages, and manage connectors via Kafka REST Proxy and Connect APIs.

Run with Naftiko

Features

High Throughput

Handle millions of messages per second with low latency at massive scale.

Exactly-Once Semantics

Guarantee exactly-once message delivery with idempotent producers and transactional APIs.

Distributed Replication

Automatic replication across brokers for fault tolerance and high availability.

Stream Processing

Real-time stream processing via Kafka Streams library and KSQL.

Connector Ecosystem

200+ pre-built Kafka Connect connectors for databases, clouds, and SaaS.

Log Compaction

Retain the latest value for each key with topic log compaction.

Consumer Groups

Horizontally scalable consumers with automatic partition rebalancing.

Use Cases

Event-Driven Architecture

Build event-driven microservices with reliable message delivery.

Data Pipeline

Move data between systems at scale with exactly-once delivery guarantees.

Real-Time Analytics

Process and analyze event streams in real time with Kafka Streams.

Log Aggregation

Centralize application and infrastructure logs for analysis and alerting.

CDC (Change Data Capture)

Capture database changes and stream them to data warehouses and caches.

Integrations

Apache Spark

Spark Structured Streaming integration for batch and streaming analytics.

Apache Flink

Native Flink Kafka connector for low-latency stream processing.

Debezium

CDC platform using Kafka Connect to capture database change events.

Elasticsearch

Kafka Connect Elasticsearch sink for indexing event data.

Amazon S3

Kafka Connect S3 sink for archiving event streams to object storage.

Apache Hadoop

HDFS sink connector for streaming data into Hadoop data lake.

Event Specifications

Apache Kafka Messaging API

Apache Kafka is a distributed event streaming platform capable of handling trillions of events a day. This spec describes the core messaging protocol for producing and consuming...

ASYNCAPI

Semantic Vocabularies

Apache Kafka Kafka Connect Config Context

2 classes · 3 properties

JSON-LD

Apache Kafka Kafka Connect Connector Context

5 classes · 7 properties

JSON-LD

Apache Kafka Kafka Connect Create Context

2 classes · 1 properties

JSON-LD

Apache Kafka Kafka Connect Task Context

2 classes · 7 properties

JSON-LD

Apache Kafka Kafka Rest Proxy Context

10 classes · 41 properties

JSON-LD

API Governance Rules

Apache Kafka API Rules

16 rules · 9 errors 3 warnings 4 info

SPECTRAL

Resources

👥
GitHubOrganization
GitHubOrganization
👥
GitHubRepository
GitHubRepository
🔗
Documentation
Documentation
🚀
GettingStarted
GettingStarted
📜
TermsOfService
TermsOfService
🔗
Versioning
Versioning
🔗
SpectralRules
SpectralRules
🔗
Vocabulary
Vocabulary
🔗
NaftikoCapability
NaftikoCapability