Apache Hadoop
Apache Hadoop is an open-source framework for distributed storage and processing of large datasets across clusters of computers using simple programming models. It includes HDFS for distributed storage, YARN for resource management, and MapReduce for parallel data processing.
4 APIs
0 Features
Big DataData ProcessingDistributed ComputingHDFSMapReduceOpen Source
APIs
HDFS REST API (WebHDFS)
RESTful API for Hadoop Distributed File System operations including file operations, directory operations, and file status queries.
YARN REST API
RESTful API for Yet Another Resource Negotiator (YARN) for cluster resource management, application submission, and monitoring.
MapReduce History Server REST API
REST API for accessing MapReduce job history and statistics.
HttpFS REST API
HTTP REST API gateway supporting both webhdfs and httpfs operations for HDFS access.
Resources
🔗
Website
Website
🔗
Documentation
Documentation
🚀
Getting Started
Getting Started
👥
GitHub Organization
GitHub Organization
🔗
Community
Community
📄
Change Log
Change Log
📜
Terms of Service
Terms of Service