CoreWeave
CoreWeave is a specialized GPU cloud purpose-built for AI workloads, offering managed Kubernetes (CKS), Slurm-on-Kubernetes (SUNK), dedicated and serverless inference, AI Object Storage, distributed VAST file storage, HPC InfiniBand interconnect, and a Sandbox product. CoreWeave's control plane is Kubernetes-native and exposes APIs for CKS clusters, Inference deployments and gateways, VPCs, Object Storage, and Sandbox control.
CoreWeave publishes 5 APIs on the APIs.io network. Tagged areas include AI, Cloud, GPU, HPC, and Inference.
CoreWeave’s developer surface includes documentation, developer portal, pricing, engineering blog, support, SDKs, and 9 more developer resources.
APIs
CoreWeave Kubernetes Service API
The CKS API provisions and manages CoreWeave Kubernetes Service clusters and node pools on bare-metal GPU and CPU hardware. It exposes operations for cluster lifecycle, node poo...
CoreWeave Inference API
The CoreWeave Inference API manages Deployments, Gateways, and Capacity Claims for serverless and dedicated AI inference. It is used to create, update, and route to managed mode...
CoreWeave VPC API
The VPC API creates and manages Virtual Private Clouds on CoreWeave, including network configuration, routing, and isolation for CKS clusters and other compute resources.
CoreWeave AI Object Storage API
CoreWeave AI Object Storage (CAIOS) is an S3-compatible object storage service optimized for AI dataset and model storage. It supports standard S3 operations alongside CoreWeave...
CoreWeave Sandbox Control Plane API
The Sandbox Control Plane API provisions ephemeral compute sandboxes for short-lived, isolated workloads on CoreWeave infrastructure.
Features
Managed Kubernetes on bare-metal GPU and CPU nodes for training, inference, and HPC.
Slurm running on Kubernetes for batch and burst training workloads alongside K8s services.
Dedicated and serverless inference offerings with managed deployments, gateways, and capacity claims.
S3-compatible object storage purpose-built for AI dataset and model workloads.
High-performance VAST Data file storage for large-scale training pipelines.
InfiniBand-based HPC fabric with GPUDirect RDMA for multi-node training.
Ephemeral compute sandboxes for short-lived, isolated workloads.
Integrations
Native Kubernetes API surface across CKS clusters with standard kubectl and Helm workflows.
Slurm workload manager integrated with Kubernetes through SUNK.
Official CoreWeave Terraform provider for CKS clusters, VPCs, and object storage buckets.
NVIDIA GPU Operator and InfiniBand fabric integration for accelerated workloads.