Ray
Ray is an open-source unified compute framework, stewarded by Anyscale, that scales Python and AI workloads from a laptop to a cluster. It consists of Ray Core (a distributed runtime) and a set of AI libraries (Ray Train, Ray Data, Ray Tune, Ray Serve, RLlib) for training, batch inference, hyperparameter search, and model serving. Ray clusters expose a Dashboard and Jobs REST API on the head node (default port 8265) for submitting jobs, inspecting actors and tasks, and serving deployed applications via Ray Serve HTTP endpoints.
APIs
Ray Jobs REST API
REST API on the Ray head node for submitting, listing, inspecting, and stopping Ray jobs, plus streaming logs. Default base URL is http://
Ray Dashboard API
Internal REST API powering the Ray Dashboard, exposing endpoints for nodes, actors, tasks, placement groups, runtime environments, and cluster events. Same base URL as the Jobs ...
Ray Serve HTTP API
HTTP interface for invoking models and applications deployed via Ray Serve. Each deployed application is exposed as an HTTP endpoint on the Serve HTTP proxy (default port 8000);...