Apache Zeppelin
Apache Zeppelin is a web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala, Python, R, and more. It provides built-in data visualization, collaboration features, and interpreter integration with Apache Spark, JDBC, Python, R, Shell, and 20+ other backends. Zeppelin exposes a REST API for notebook management, interpreter configuration, and job execution. It is maintained by the Apache Software Foundation.
APIs
Apache Zeppelin REST API
The Zeppelin REST API provides HTTP endpoints for managing notebooks, notes, paragraphs, interpreters, and credentials. Key endpoints include: GET/POST /api/notebook for noteboo...
Features
Execute code in Scala, Python, R, SQL, Shell, and 20+ languages in the same notebook.
Bar, line, pie, scatter, and map charts from query results without additional tools.
Real-time collaborative editing of notebooks with user management and permissions.
Native Apache Spark interpreter for Scala, Python (PySpark), and SQL queries.
Universal JDBC interpreter for any SQL database including MySQL, PostgreSQL, Hive.
Schedule notebook paragraphs with cron expressions for automated execution.
Interactive input forms within notebook paragraphs for parameterized execution.
Use Cases
Exploratory data analysis with Spark SQL, Python, and R in a collaborative notebook.
Rapid ML prototyping and model development with live results visualization.
Interactive SQL queries against Hive, Spark SQL, or any JDBC database.
Scheduled notebook execution for automated data report generation.
Integrations
Native Spark interpreter for Scala, PySpark, and SparkSQL workloads.
Hive JDBC and HiveQL interpreter for Hive data warehouse queries.
Apache Flink interpreter for stream processing in Zeppelin notebooks.
Zeppelin on Kubernetes with per-notebook pod isolation for interpreter processes.
Elasticsearch interpreter for indexing and querying Elasticsearch data.