Scale AI logo

Scale AI

Scale AI is the data engine for AI. The company turns raw data into training data by combining ML-powered pre-labeling with multi-tier human review, and ships an extensive REST API and SDKs for managing labeling, evaluation, and generative-AI data pipelines. The product portfolio spans the Scale Data Engine (foundational labeling and review), the GenAI Data Engine (data for foundation-model training and tuning), the Scale GenAI Platform (deployment and orchestration for generative AI), the Automotive Data Engine (LiDAR, sensor fusion, customer dashboards, Nucleus), and Donovan (Scale's defense / public-sector AI product). The REST API lives at api.scale.com/v1, supports live and sandbox modes, and is wrapped by official Python (scaleapi) and JavaScript (scaleapi) SDKs. The company serves enterprise, insurance, healthcare, and U.S. and global public-sector verticals.

6 APIs 11 Features
Data EngineLabelingRLHFGenAI PlatformDonovanDefense AILiDARSensor FusionREST API

Scale AI publishes 6 APIs on the APIs.io network. Tagged areas include Data Engine, Labeling, RLHF, GenAI Platform, and Donovan.

Scale AI’s developer surface includes developer portal, documentation, API reference, getting-started guide, authentication, engineering blog, signup flow, and 6 more developer resources.

APIs

Scale REST API

The Scale REST API is the unified programmatic surface for Scale's data engine. It is built on REST principles with resource-oriented URLs, form-encoded request bodies, JSON res...

Scale GenAI Data Engine

The GenAI Data Engine is Scale's product surface for generating, curating, and reviewing data used to train and tune generative-AI foundation models, including RLHF, SFT, evalua...

Scale GenAI Platform

The Scale GenAI Platform is the deployment and orchestration product for generative-AI applications, used by enterprise and public-sector customers to deliver agentic and genera...

Scale Automotive Data Engine

Scale's Automotive Data Engine covers autonomy-grade data needs including LiDAR labeling, sensor fusion, multi-stage annotation, the customer dashboard, data hosting, and Nucleu...

Scale Nucleus

Nucleus is Scale's dataset management product for browsing, querying, and curating ML datasets at scale.

Scale Donovan

Donovan is Scale's AI platform for defense and public-sector use cases, delivering decision-support and analytic capabilities to U.S. and allied government customers.

Features

REST API at api.scale.com/v1

Resource-oriented REST API with JSON responses, live and sandbox modes, and versioned v1 endpoints.

Tasks API

Create, retrieve, cancel, and tag individual labeling tasks with unique identifiers and metadata.

Batches API

Create, finalize, prioritize, list, and retrieve status for batches of tasks.

Projects API

Create and manage labeling projects, including taxonomy service management.

Specialized Annotation

Image and video, sensor fusion, LiDAR, and multi-stage annotation task types.

GenAI Data Engine

RLHF, SFT, evaluation, and red-team data for generative AI foundation models.

GenAI Platform

Deployment and orchestration product for enterprise and public-sector generative-AI workflows.

Donovan

Scale's defense and public-sector AI product line.

Nucleus

Dataset management for browsing, querying, and curating ML datasets.

Cloud Storage Integration

Integrates with AWS S3, Azure, and Google Cloud Storage for data ingest and delivery.

Callbacks

Asynchronous task completion callbacks and secure result URLs.

Use Cases

Foundation Model Training Data

RLHF, SFT, evaluation, and red-team datasets for frontier model labs.

Autonomous Vehicle Data

LiDAR, camera, and sensor-fusion labeling for AV programs.

Enterprise GenAI Deployment

Build and deploy generative-AI applications on the GenAI Platform.

Public Sector Decision Support

Deliver Donovan-based analytic and decision-support workflows to defense and government customers.

Dataset Curation

Browse, query, and curate ML datasets at scale with Nucleus.

Integrations

AWS S3, Azure Blob, Google Cloud Storage

Cloud storage ingest and delivery for labeling jobs.

Python SDK (scaleapi)

Official Python client published on PyPI.

JavaScript SDK (scaleapi)

Official Node.js client published on npm.

Sandbox Mode

Test integrations safely against a sandbox environment that mirrors live behavior.

Resources

🌐
Portal
Portal
🔗
Documentation
Documentation
🔗
APIReference
APIReference
🚀
GettingStarted
GettingStarted
🔑
Authentication
Authentication
📰
Blog
Blog
📝
SignUp
SignUp
🌐
Console
Console
💰
Pricing
Pricing
📦
SDK
SDK
📦
SDK
SDK
💬
Support
Support
🔗
LinkedIn
LinkedIn

Sources

apis.yml Raw ↑
aid: scale-ai
name: Scale AI
description: Scale AI is the data engine for AI. The company turns raw data into training data by combining ML-powered pre-labeling
  with multi-tier human review, and ships an extensive REST API and SDKs for managing labeling, evaluation, and generative-AI
  data pipelines. The product portfolio spans the Scale Data Engine (foundational labeling and review), the GenAI Data Engine
  (data for foundation-model training and tuning), the Scale GenAI Platform (deployment and orchestration for generative AI),
  the Automotive Data Engine (LiDAR, sensor fusion, customer dashboards, Nucleus), and Donovan (Scale's defense / public-sector
  AI product). The REST API lives at api.scale.com/v1, supports live and sandbox modes, and is wrapped by official Python
  (scaleapi) and JavaScript (scaleapi) SDKs. The company serves enterprise, insurance, healthcare, and U.S. and global public-sector
  verticals.
type: Index
image: https://kinlane-productions.s3.amazonaws.com/apis-json/apis-json-logo.jpg
tags:
- Data Engine
- Labeling
- RLHF
- GenAI Platform
- Donovan
- Defense AI
- LiDAR
- Sensor Fusion
- REST API
url: https://raw.githubusercontent.com/api-evangelist/scale-ai/refs/heads/main/apis.yml
created: '2026-05-23'
modified: '2026-05-23'
specificationVersion: '0.19'
apis:
- aid: scale-ai:scale-rest-api
  name: Scale REST API
  description: The Scale REST API is the unified programmatic surface for Scale's data engine. It is built on REST principles
    with resource-oriented URLs, form-encoded request bodies, JSON responses, and standard HTTP status codes. Endpoints cover
    tasks (create, retrieve, cancel, set metadata, manage tags and unique identifiers), batches (create, finalize, list,
    prioritize, retrieve status), projects (creation and management), specialized annotation tasks (image and video, sensor
    fusion, LiDAR, multi-stage tasks), data retrieval and downloads, taxonomy service management, and callbacks for async
    operations. The API is versioned (v1), supports live and sandbox modes, and operates on one object per request (no bulk
    updates).
  humanURL: https://api-reference.scale.com
  baseURL: https://api.scale.com
  tags:
  - REST API
  - Labeling
  - Tasks
  - Batches
  properties:
  - type: Documentation
    url: https://scale.com/docs/
  - type: APIReference
    url: https://api-reference.scale.com
  - type: Authentication
    url: https://api-reference.scale.com/docs/api-reference/authentication.md
  - type: GettingStarted
    url: https://api-reference.scale.com/docs/api-reference/introduction-to-scale-api.md
- aid: scale-ai:scale-genai-data-engine
  name: Scale GenAI Data Engine
  description: The GenAI Data Engine is Scale's product surface for generating, curating, and reviewing data used to train
    and tune generative-AI foundation models, including RLHF, SFT, evaluation, and red-team data.
  humanURL: https://scale.com/docs/
  baseURL: https://api.scale.com
  tags:
  - GenAI
  - RLHF
  - SFT
  - Foundation Models
  properties:
  - type: Documentation
    url: https://scale.com/docs/
  - type: APIReference
    url: https://api-reference.scale.com
- aid: scale-ai:scale-genai-platform
  name: Scale GenAI Platform
  description: The Scale GenAI Platform is the deployment and orchestration product for generative-AI applications, used
    by enterprise and public-sector customers to deliver agentic and generative workflows on top of Scale's data engine.
  humanURL: https://scale.com/docs/
  tags:
  - GenAI Platform
  - Agentic AI
  - Enterprise
  properties:
  - type: Documentation
    url: https://scale.com/docs/
- aid: scale-ai:scale-automotive-data-engine
  name: Scale Automotive Data Engine
  description: Scale's Automotive Data Engine covers autonomy-grade data needs including LiDAR labeling, sensor fusion, multi-stage
    annotation, the customer dashboard, data hosting, and Nucleus for dataset management.
  humanURL: https://scale.com/docs/
  baseURL: https://api.scale.com
  tags:
  - Automotive
  - LiDAR
  - Sensor Fusion
  - Nucleus
  properties:
  - type: Documentation
    url: https://scale.com/docs/
- aid: scale-ai:scale-nucleus
  name: Scale Nucleus
  description: Nucleus is Scale's dataset management product for browsing, querying, and curating ML datasets at scale.
  humanURL: https://scale.com/docs/
  tags:
  - Nucleus
  - Dataset Management
  - ML Ops
  properties:
  - type: Documentation
    url: https://scale.com/docs/
- aid: scale-ai:scale-donovan
  name: Scale Donovan
  description: Donovan is Scale's AI platform for defense and public-sector use cases, delivering decision-support and analytic
    capabilities to U.S. and allied government customers.
  humanURL: https://scale.com/donovan
  tags:
  - Donovan
  - Defense
  - Public Sector
  properties:
  - type: Documentation
    url: https://scale.com/donovan
common:
- type: Portal
  url: https://scale.com
- type: Documentation
  url: https://scale.com/docs/
- type: APIReference
  url: https://api-reference.scale.com
- type: GettingStarted
  url: https://api-reference.scale.com/docs/api-reference/introduction-to-scale-api.md
- type: Authentication
  url: https://api-reference.scale.com/docs/api-reference/authentication.md
- type: Blog
  url: https://scale.com/blog
- type: SignUp
  url: https://dashboard.scale.com
- type: Console
  url: https://dashboard.scale.com
- type: Pricing
  url: https://scale.com
- type: SDK
  url: https://pypi.org/project/scaleapi/
  name: Scale Python SDK (scaleapi on PyPI)
- type: SDK
  url: https://www.npmjs.com/package/scaleapi
  name: Scale JavaScript SDK (scaleapi on npm)
- type: Support
  url: https://scale.com
- type: LinkedIn
  url: https://www.linkedin.com/company/scaleai
- type: Features
  data:
  - name: REST API at api.scale.com/v1
    description: Resource-oriented REST API with JSON responses, live and sandbox modes, and versioned v1 endpoints.
  - name: Tasks API
    description: Create, retrieve, cancel, and tag individual labeling tasks with unique identifiers and metadata.
  - name: Batches API
    description: Create, finalize, prioritize, list, and retrieve status for batches of tasks.
  - name: Projects API
    description: Create and manage labeling projects, including taxonomy service management.
  - name: Specialized Annotation
    description: Image and video, sensor fusion, LiDAR, and multi-stage annotation task types.
  - name: GenAI Data Engine
    description: RLHF, SFT, evaluation, and red-team data for generative AI foundation models.
  - name: GenAI Platform
    description: Deployment and orchestration product for enterprise and public-sector generative-AI workflows.
  - name: Donovan
    description: Scale's defense and public-sector AI product line.
  - name: Nucleus
    description: Dataset management for browsing, querying, and curating ML datasets.
  - name: Cloud Storage Integration
    description: Integrates with AWS S3, Azure, and Google Cloud Storage for data ingest and delivery.
  - name: Callbacks
    description: Asynchronous task completion callbacks and secure result URLs.
- type: UseCases
  data:
  - name: Foundation Model Training Data
    description: RLHF, SFT, evaluation, and red-team datasets for frontier model labs.
  - name: Autonomous Vehicle Data
    description: LiDAR, camera, and sensor-fusion labeling for AV programs.
  - name: Enterprise GenAI Deployment
    description: Build and deploy generative-AI applications on the GenAI Platform.
  - name: Public Sector Decision Support
    description: Deliver Donovan-based analytic and decision-support workflows to defense and government customers.
  - name: Dataset Curation
    description: Browse, query, and curate ML datasets at scale with Nucleus.
- type: Integrations
  data:
  - name: AWS S3, Azure Blob, Google Cloud Storage
    description: Cloud storage ingest and delivery for labeling jobs.
  - name: Python SDK (scaleapi)
    description: Official Python client published on PyPI.
  - name: JavaScript SDK (scaleapi)
    description: Official Node.js client published on npm.
  - name: Sandbox Mode
    description: Test integrations safely against a sandbox environment that mirrors live behavior.
maintainers:
- FN: Kin Lane
  url: http://apievangelist.com
  email: [email protected]