Apache Zeppelin logo

Apache Zeppelin

Apache Zeppelin is a web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala, Python, R, and more. It provides built-in data visualization, collaboration features, and interpreter integration with Apache Spark, JDBC, Python, R, Shell, and 20+ other backends. Zeppelin exposes a REST API for notebook management, interpreter configuration, and job execution. It is maintained by the Apache Software Foundation.

1 APIs 7 Features
Data AnalyticsInteractive ComputingNotebookVisualizationOpen Source

APIs

Apache Zeppelin REST API

The Zeppelin REST API provides HTTP endpoints for managing notebooks, notes, paragraphs, interpreters, and credentials. Key endpoints include: GET/POST /api/notebook for noteboo...

Features

Multi-Language Support

Execute code in Scala, Python, R, SQL, Shell, and 20+ languages in the same notebook.

Built-In Visualization

Bar, line, pie, scatter, and map charts from query results without additional tools.

Collaborative Notebooks

Real-time collaborative editing of notebooks with user management and permissions.

Spark Integration

Native Apache Spark interpreter for Scala, Python (PySpark), and SQL queries.

JDBC Interpreter

Universal JDBC interpreter for any SQL database including MySQL, PostgreSQL, Hive.

Paragraph Scheduling

Schedule notebook paragraphs with cron expressions for automated execution.

Dynamic Forms

Interactive input forms within notebook paragraphs for parameterized execution.

Use Cases

Interactive Data Exploration

Exploratory data analysis with Spark SQL, Python, and R in a collaborative notebook.

Data Science Prototyping

Rapid ML prototyping and model development with live results visualization.

SQL Analytics

Interactive SQL queries against Hive, Spark SQL, or any JDBC database.

Automated Reporting

Scheduled notebook execution for automated data report generation.

Integrations

Apache Spark

Native Spark interpreter for Scala, PySpark, and SparkSQL workloads.

Apache Hive

Hive JDBC and HiveQL interpreter for Hive data warehouse queries.

Apache Flink

Apache Flink interpreter for stream processing in Zeppelin notebooks.

Kubernetes

Zeppelin on Kubernetes with per-notebook pod isolation for interpreter processes.

Elasticsearch

Elasticsearch interpreter for indexing and querying Elasticsearch data.

Resources

👥
GitHubRepository
GitHubRepository
🔗
Documentation
Documentation
🌐
Portal
Portal
🚀
GettingStarted
GettingStarted
📄
ReleaseNotes
ReleaseNotes
💬
Support
Support
📜
TermsOfService
TermsOfService