Baseten logo

Baseten

Baseten is a production inference platform for deploying and serving custom and pre-trained ML models. Offers a Model APIs catalog with OpenAI-compatible endpoints (DeepSeek, Qwen, GLM, Nemotron), dedicated deployments via Truss, autoscaling GPU compute, async/queue inference, training, chains (multi-model workflows), and management APIs.

3 APIs 0 Features
AIMLInferenceDeploymentMLOpsOpenAI CompatibleAnthropic CompatibleTruss

APIs

Baseten LLM Inference API

OpenAI-compatible chat completions for Baseten's Model APIs catalog (DeepSeek V4, Qwen, GLM, Nemotron, etc.). Per-million-token pricing.

Baseten Anthropic-Compatible Messages API

Anthropic Messages-compatible inference for compatible Model APIs models.

Baseten Management & Async API

Deployment management, async/queued inference, chain calls (multi-model workflows), training, dedicated-deployment lifecycle, async result polling, and webhook delivery.

Resources

🔗
Website
Website
🔗
Documentation
Documentation
💰
Pricing
Pricing
🔗
Plans
Plans
🔗
RateLimits
RateLimits
🔗
FinOps
FinOps

Sources

Raw ↑
aid: baseten
url: https://raw.githubusercontent.com/api-evangelist/baseten/refs/heads/main/apis.yml
name: Baseten
x-type: company
description: >-
  Baseten is a production inference platform for deploying and serving custom and pre-trained
  ML models. Offers a Model APIs catalog with OpenAI-compatible endpoints (DeepSeek, Qwen,
  GLM, Nemotron), dedicated deployments via Truss, autoscaling GPU compute, async/queue
  inference, training, chains (multi-model workflows), and management APIs.
image: https://kinlane-productions.s3.amazonaws.com/apis-json/apis-json-logo.jpg
tags:
  - AI
  - ML
  - Inference
  - Deployment
  - MLOps
  - OpenAI Compatible
  - Anthropic Compatible
  - Truss
created: '2026-05-08'
modified: '2026-05-08'
specificationVersion: '0.19'
apis:
  - aid: baseten:llm-inference
    name: Baseten LLM Inference API
    description: >-
      OpenAI-compatible chat completions for Baseten's Model APIs catalog (DeepSeek V4,
      Qwen, GLM, Nemotron, etc.). Per-million-token pricing.
    image: https://kinlane-productions.s3.amazonaws.com/apis-json/apis-json-logo.jpg
    humanURL: https://docs.baseten.co/reference/inference-api/llm-openapi-spec
    baseURL: https://inference.baseten.co/v1
    tags:
      - AI
      - LLM
      - Chat Completions
      - OpenAI Compatible
    properties:
      - type: Documentation
        url: https://docs.baseten.co/
      - type: OpenAPI
        url: openapi/baseten-llm-openapi.json
      - type: Pricing
        url: https://www.baseten.co/pricing/
  - aid: baseten:messages
    name: Baseten Anthropic-Compatible Messages API
    description: >-
      Anthropic Messages-compatible inference for compatible Model APIs models.
    image: https://kinlane-productions.s3.amazonaws.com/apis-json/apis-json-logo.jpg
    humanURL: https://docs.baseten.co/reference/inference-api/messages-openapi-spec
    baseURL: https://inference.baseten.co
    tags:
      - AI
      - LLM
      - Anthropic Compatible
      - Messages
    properties:
      - type: Documentation
        url: https://docs.baseten.co/
      - type: OpenAPI
        url: openapi/baseten-messages-openapi.json
  - aid: baseten:management
    name: Baseten Management & Async API
    description: >-
      Deployment management, async/queued inference, chain calls (multi-model workflows),
      training, dedicated-deployment lifecycle, async result polling, and webhook delivery.
    image: https://kinlane-productions.s3.amazonaws.com/apis-json/apis-json-logo.jpg
    humanURL: https://docs.baseten.co/api-reference
    baseURL: https://api.baseten.co
    tags:
      - Management
      - Async
      - Webhooks
      - Deployment
      - Training
    properties:
      - type: Documentation
        url: https://docs.baseten.co/api-reference
      - type: TrussCLI
        url: https://docs.baseten.co/
common:
  - type: Website
    url: https://www.baseten.co/
  - type: Documentation
    url: https://docs.baseten.co/
  - type: Pricing
    url: https://www.baseten.co/pricing/
  - type: Plans
    url: plans/baseten-plans-pricing.yml
  - type: RateLimits
    url: rate-limits/baseten-rate-limits.yml
  - type: FinOps
    url: finops/baseten-finops.yml
maintainers:
  - FN: Kin Lane
    email: [email protected]