Apache Pinot
Apache Pinot is a real-time distributed OLAP datastore designed to deliver scalable real-time analytics with low latency. It ingests data from batch and streaming sources and provides fast analytical queries for user-facing applications.
APIs
Apache Pinot REST API
The Pinot API provides REST endpoints for SQL queries, schema management, table management, segment management, cluster administration, and task management, along with a JDBC dr...
Capabilities
Apache Pinot Analytics Workflow
Workflow for executing real-time OLAP queries, managing schemas and tables, and monitoring the Pinot cluster.
Run with NaftikoFeatures
Sub-second analytical queries over real-time and historical data
Standard SQL query interface with Pinot-specific extensions
Real-time data ingestion from Kafka, Kinesis, and Pulsar
Offline data ingestion from HDFS, S3, GCS, and local files
Column-oriented storage with bitmap indexing for fast queries
Tenant isolation for broker and server resources
Pre-aggregated star-tree index for metric rollup queries
Use Cases
Power user-facing dashboards like LinkedIn Who Viewed Profile
Business intelligence dashboards over streaming data
Real-time anomaly detection over metric time series
Real-time experiment analysis and statistical significance
Integrations
Real-time stream ingestion from Kafka topics
Flink connector for streaming data into Pinot
Visual analytics and dashboards via SQL
Federated query access to Pinot via Presto connector
Grafana data source plugin for Pinot metrics