Nebius
Nebius is an AI-focused cloud platform spun out of Yandex, offering NVIDIA GPU virtual machines and clusters (GB300, GB200, B300, B200, H200, H100) connected over InfiniBand, managed Kubernetes and Slurm (Soperator), S3-compatible storage, managed PostgreSQL, container registry, MLflow, JupyterLab, vLLM, and the Token Factory inference platform. Nebius exposes a gRPC control plane API, a Terraform provider, a nebius CLI, and Go and TypeScript SDKs.
Nebius publishes 6 APIs on the APIs.io network. Tagged areas include AI, Cloud, Compute, GPU, and HPC.
Nebius’ developer surface includes documentation, developer portal, pricing, engineering blog, support, SDKs, and 10 more developer resources.
APIs
Nebius Compute API
The Nebius Compute API provisions and manages virtual machines and GPU clusters with NVIDIA GPUs and InfiniBand interconnect for ML and AI workloads. Exposed over gRPC and acces...
Nebius Managed Kubernetes API
The Managed Kubernetes API provisions Kubernetes clusters with GPU and InfiniBand support for distributed training and inference workloads.
Nebius Storage API
The Nebius Storage API exposes AWS S3-compatible object storage for ML/AI datasets and model artifacts.
Nebius IAM API
The Nebius Identity and Access Management API controls users, service accounts, projects, and resource-level access policies.
Nebius Managed Applications API
The Managed Applications API deploys and manages JupyterLab, vLLM, Open WebUI, MLflow, and other ready-made apps on Nebius infrastructure.
Nebius Token Factory
Nebius Token Factory is the AI model inference platform offering OpenAI-compatible endpoints for serving open-source LLMs on Nebius GPU infrastructure.
Features
Virtual machines and clusters with NVIDIA GB300, GB200, B300, B200, H200, and H100 GPUs.
High-bandwidth InfiniBand interconnect for large-scale distributed training.
Kubernetes clusters with GPU and InfiniBand support.
Slurm workload manager running on Kubernetes via the open-source Soperator project.
Object storage optimized for ML datasets and model artifacts.
One-click JupyterLab, vLLM, Open WebUI, and MLflow deployments.
OpenAI-compatible inference endpoints for open-source LLMs.
Integrations
Official Terraform provider for Nebius infrastructure.
Standard Kubernetes API across managed clusters.
Slurm workload manager via the open-source Soperator project.
Managed MLflow for experiment tracking.
Managed PostgreSQL database clusters.