Apache Kylin
Apache Kylin is an open-source distributed analytics engine designed to provide a SQL interface and multi-dimensional analysis (OLAP) on large-scale datasets. It provides sub-second query latency on trillion-record datasets via pre-computed cubes and works on top of Hadoop, Spark, and cloud storage.
APIs
Apache Kylin REST API
The Kylin REST API provides endpoints for SQL query execution, model management, project management, job management, and table management for administering the Kylin OLAP engine.
Apache Kylin JDBC Driver
The Kylin JDBC driver provides SQL-over-Kylin access for BI tools and SQL clients, enabling standard JDBC connectivity to Kylin OLAP cubes.
Capabilities
Apache Kylin OLAP Analytics
Workflow capability for data analysts and BI engineers to execute OLAP queries, manage projects, and monitor cube build jobs in Apache Kylin.
Run with NaftikoFeatures
Pre-computed cubes enable sub-second query response on trillion-record datasets.
ANSI SQL interface for business analysts using existing SQL skills.
Build cubes with aggregates pre-calculated for instant query response.
Works on top of Hadoop, Spark, and cloud object storage.
Standard JDBC and ODBC drivers for BI tool integration.
Incremental cube building with date-range segment management.
Project-based multi-tenancy for isolating datasets and access.
Use Cases
Accelerate slow Hive or Spark queries with Kylin cube pre-computation.
Connect Tableau, PowerBI, and Superset to Kylin via JDBC for analytics.
Stream data into Kylin incrementally for near-real-time OLAP analytics.
Generate business reports over trillion-record datasets in seconds.
Integrations
Reads from HDFS and executes MapReduce cube builds on Hadoop.
Spark-based cube building for faster and more efficient data processing.
Hive metastore integration for table schema and metadata.
HBase storage for pre-computed cube data.
Native Tableau connector via Kylin JDBC driver.
Apache Superset integration via JDBC for self-service analytics.