Apache Livy logo

Apache Livy

Apache Livy is a service that enables easy interaction with a Spark cluster over a REST interface. It allows submitting Spark jobs or snippets of Spark code, retrieving results synchronously or asynchronously, and managing Spark contexts across multiple users. Licensed under Apache 2.0.

1 APIs 1 Capabilities 7 Features
Big DataInteractive ComputingOpen SourceRESTSpark

APIs

Apache Livy REST API

The Livy REST API provides endpoints for creating and managing interactive Spark sessions, submitting batch Spark jobs, executing code statements (Python, Scala, R, SQL), and re...

Capabilities

Apache Livy Spark Job Management

Workflow capability for data engineers and data scientists to manage interactive Spark sessions and submit batch Spark jobs via Apache Livy REST API.

Run with Naftiko

Features

Interactive Spark Sessions

Create persistent Spark contexts for interactive code execution in Python, Scala, R, and SQL.

Batch Job Submission

Submit batch Spark jobs without creating an interactive session.

Multi-Language Support

Execute code in PySpark, Spark (Scala), SparkR, and SQL.

Multi-User Impersonation

Proxy user support for multi-tenant Spark cluster access.

Asynchronous Execution

Submit jobs and poll for results asynchronously.

Log Access

Retrieve driver and executor logs for debugging.

REST Interface

Simple HTTP REST API for Spark cluster interaction without native clients.

Use Cases

Notebook Integration

Power Jupyter, Zeppelin, and other notebooks with Spark backends via Livy.

Data Engineering Pipelines

Submit Spark batch jobs from orchestration tools like Airflow and Oozie.

Interactive Data Exploration

Execute ad-hoc Spark code for exploratory data analysis.

Multi-Tenant Spark Access

Enable multiple users to share a Spark cluster with isolation via Livy sessions.

Integrations

Apache Spark

Livy requires a Spark cluster and acts as the REST gateway to Spark.

Apache Zeppelin

Zeppelin notebook backend using Livy for distributed Spark execution.

Jupyter Notebook

Jupyter sparkmagic extension uses Livy for remote Spark kernel access.

Apache Airflow

Airflow LivyOperator for submitting Spark batch jobs from DAGs.

Amazon EMR

Livy is available as an EMR application for REST-based Spark access.

Semantic Vocabularies

Apache Livy Rest Api Context

12 classes · 31 properties

JSON-LD

API Governance Rules

Apache Livy API Rules

13 rules · 9 errors 3 warnings 1 info

SPECTRAL

Resources

👥
GitHubOrganization
GitHubOrganization
👥
GitHubRepository
GitHubRepository
🔗
Documentation
Documentation
🚀
GettingStarted
GettingStarted
📜
TermsOfService
TermsOfService
🔗
Versioning
Versioning
🔗
SpectralRules
SpectralRules
🔗
Vocabulary
Vocabulary
🔗
NaftikoCapability
NaftikoCapability