Cloudflare AI Gateway is a managed LLM proxy that sits in front of 23+ AI providers (OpenAI, Anthropic, Google AI Studio, Google Vertex AI, Amazon Bedrock, Azure OpenAI, Workers AI, Mistral, Cohere, Groq, DeepSeek, Cerebras, xAI, Perplexity, Replicate, HuggingFace, OpenRouter, ElevenLabs, Deepgram, Cartesia, Ideogram, Fal AI, Baseten, Parallel) and provides analytics, request and error logging, response caching, rate limiting, request retries, model fallback, guardrails, and evaluations. A unified REST API launched May 21, 2026 lets developers call any model through a single endpoint. The gateway integrates with Workers AI, the Secrets Store, and Cloudflare CASB's Claude Compliance API support. This is a standalone product profile; the broader Cloudflare provider profile lives at github.com/api-evangelist/cloudflare.
4 APIs10 Features
AI GatewayLLMObservabilityCachingRate LimitingWorkers AICloudflare
Cloudflare AI Gateway publishes 4 APIs on the APIs.io network. Tagged areas include AI Gateway, LLM, Observability, Caching, and Rate Limiting.
Cloudflare AI Gateway’s developer surface includes developer portal, documentation, getting-started guide, API reference, authentication, changelog, engineering blog, and 11 more developer resources.
The AI Gateway proxy endpoint accepts requests in each provider's native API format and forwards them through Cloudflare's edge with analytics, caching, retries, rate limiting, ...
Unified REST API launched May 21, 2026 that lets developers call any supported model through a single endpoint instead of formatting requests for each provider individually. Sit...
The Cloudflare API surface for managing AI Gateway resources — creating gateways, listing them, retrieving request logs, and configuring caching, rate limiting, and authenticati...
aid: cloudflare-ai-gateway
name: Cloudflare AI Gateway
description: Cloudflare AI Gateway is a managed LLM proxy that sits in front of 23+ AI providers (OpenAI, Anthropic, Google
AI Studio, Google Vertex AI, Amazon Bedrock, Azure OpenAI, Workers AI, Mistral, Cohere, Groq, DeepSeek, Cerebras, xAI, Perplexity,
Replicate, HuggingFace, OpenRouter, ElevenLabs, Deepgram, Cartesia, Ideogram, Fal AI, Baseten, Parallel) and provides analytics,
request and error logging, response caching, rate limiting, request retries, model fallback, guardrails, and evaluations.
A unified REST API launched May 21, 2026 lets developers call any model through a single endpoint. The gateway integrates
with Workers AI, the Secrets Store, and Cloudflare CASB's Claude Compliance API support. This is a standalone product profile;
the broader Cloudflare provider profile lives at github.com/api-evangelist/cloudflare.
type: Index
image: https://kinlane-productions.s3.amazonaws.com/apis-json/apis-json-logo.jpg
tags:
- AI Gateway
- LLM
- Observability
- Caching
- Rate Limiting
- Workers AI
- Cloudflare
url: https://raw.githubusercontent.com/api-evangelist/cloudflare-ai-gateway/refs/heads/main/apis.yml
created: '2026-05-23'
modified: '2026-05-23'
specificationVersion: '0.19'
apis:
- aid: cloudflare-ai-gateway:cloudflare-ai-gateway-proxy
name: Cloudflare AI Gateway Proxy
description: The AI Gateway proxy endpoint accepts requests in each provider's native API format and forwards them through
Cloudflare's edge with analytics, caching, retries, rate limiting, fallback, and guardrails applied. URL pattern is https://gateway.ai.cloudflare.com/v1/{account_id}/{gateway_id}/{provider}.
Authentication uses a Cloudflare API token with AI Gateway Read/Edit and Workers AI Read permissions; provider credentials
can be sent as headers (BYOK) or stored via Unified Billing.
humanURL: https://developers.cloudflare.com/ai-gateway/
baseURL: https://gateway.ai.cloudflare.com
tags:
- LLM Proxy
- REST API
- Multi-Provider
properties:
- type: Documentation
url: https://developers.cloudflare.com/ai-gateway/
- type: GettingStarted
url: https://developers.cloudflare.com/ai-gateway/get-started/
- type: APIReference
url: https://developers.cloudflare.com/ai-gateway/usage/providers/
- type: Authentication
url: https://developers.cloudflare.com/ai-gateway/get-started/
- aid: cloudflare-ai-gateway:cloudflare-ai-gateway-unified-api
name: Cloudflare AI Gateway Unified REST API
description: Unified REST API launched May 21, 2026 that lets developers call any supported model through a single endpoint
instead of formatting requests for each provider individually. Sits alongside the provider-native proxy mode and uses
the same gateway routing, caching, observability, and guardrails.
humanURL: https://developers.cloudflare.com/ai-gateway/
baseURL: https://gateway.ai.cloudflare.com
tags:
- Unified API
- LLM
- Multi-Provider
properties:
- type: Documentation
url: https://developers.cloudflare.com/ai-gateway/
- type: APIReference
url: https://developers.cloudflare.com/ai-gateway/usage/providers/
- aid: cloudflare-ai-gateway-management-api
name: Cloudflare AI Gateway Management API
description: The Cloudflare API surface for managing AI Gateway resources — creating gateways, listing them, retrieving
request logs, and configuring caching, rate limiting, and authentication. Exposed under the standard Cloudflare REST API
at api.cloudflare.com.
humanURL: https://developers.cloudflare.com/api/resources/ai_gateway/
baseURL: https://api.cloudflare.com
tags:
- Management API
- REST API
- Configuration
properties:
- type: Documentation
url: https://developers.cloudflare.com/api/resources/ai_gateway/
- type: APIReference
url: https://developers.cloudflare.com/api/resources/ai_gateway/
- aid: cloudflare-ai-gateway:ai-gateway-mcp
name: AI Gateway MCP Server
description: Cloudflare-hosted remote MCP server that exposes AI Gateway control-plane operations to MCP-compatible AI agents.
humanURL: https://ai-gateway.mcp.cloudflare.com/mcp
tags:
- MCP
- Agentic AI
properties:
- type: MCPServer
url: https://ai-gateway.mcp.cloudflare.com/mcp
common:
- type: Portal
url: https://developers.cloudflare.com/ai-gateway/
- type: Documentation
url: https://developers.cloudflare.com/ai-gateway/
- type: GettingStarted
url: https://developers.cloudflare.com/ai-gateway/get-started/
- type: APIReference
url: https://developers.cloudflare.com/ai-gateway/usage/providers/
- type: Authentication
url: https://developers.cloudflare.com/fundamentals/api/get-started/create-token/
- type: ChangeLog
url: https://developers.cloudflare.com/changelog/
- type: Blog
url: https://blog.cloudflare.com/
- type: SignUp
url: https://dash.cloudflare.com/sign-up
- type: Console
url: https://dash.cloudflare.com/
- type: Pricing
url: https://www.cloudflare.com/plans/
- type: SDK
url: https://github.com/cloudflare/cloudflare-typescript
name: Cloudflare TypeScript SDK
- type: SDK
url: https://github.com/cloudflare/cloudflare-python
name: Cloudflare Python SDK
- type: SDK
url: https://github.com/cloudflare/cloudflare-go
name: Cloudflare Go SDK
- type: CLI
url: https://developers.cloudflare.com/workers/wrangler/
name: Wrangler CLI
- type: MCPServer
url: https://ai-gateway.mcp.cloudflare.com/mcp
name: AI Gateway MCP Server
- type: GitHubOrganization
url: https://github.com/cloudflare
- type: StatusPage
url: https://www.cloudflarestatus.com/
- type: Support
url: https://support.cloudflare.com/
- type: Features
data:
- name: 23+ Provider Coverage
description: OpenAI, Anthropic, Google AI Studio, Google Vertex AI, Amazon Bedrock, Azure OpenAI, Workers AI, Mistral,
Cohere, Groq, DeepSeek, Cerebras, xAI, Perplexity, Replicate, HuggingFace, OpenRouter, ElevenLabs, Deepgram, Cartesia,
Ideogram, Fal AI, Baseten, Parallel.
- name: Unified REST API
description: Single endpoint that can call any supported model (launched May 21, 2026), alongside provider-native pass-through
mode.
- name: Analytics and Logging
description: Per-request analytics for tokens, cost, latency, and error rates with full request and response logging.
- name: Edge Caching
description: Cache LLM responses at Cloudflare's edge to cut latency and provider spend.
- name: Rate Limiting
description: Enforce request and token rate limits per gateway, per application.
- name: Retries and Model Fallback
description: Automatically retry failing calls and fall back to alternate models or providers.
- name: Guardrails and Evaluations
description: Apply guardrails to prompts and responses and run evaluations against captured traffic.
- name: BYOK and Unified Billing
description: Bring your own provider keys per gateway or route spend through Cloudflare's Unified Billing.
- name: Workers AI Integration
description: Native integration with Cloudflare Workers AI models via the cf-aig-gateway-id header.
- name: Secrets Store Integration
description: Provider keys can be sourced from Cloudflare's Secrets Store rather than embedded in client code.
- type: UseCases
data:
- name: Observe LLM Traffic
description: Capture per-request tokens, latency, and cost across every provider in one place.
- name: Cut LLM Costs
description: Cache repeated responses at the edge and rate limit runaway workloads.
- name: Fail Over Between Providers
description: Configure retries and model fallback so a single provider outage does not take down the app.
- name: Govern Prompts and Responses
description: Apply guardrails and run evaluations against production AI traffic.
- name: Unify Multi-Provider Access
description: Use a single REST endpoint to address any supported model from any provider.
- type: Integrations
data:
- name: Workers AI
description: First-class integration with Cloudflare Workers AI inference catalog.
- name: Cloudflare Workers
description: Call AI Gateway directly from Workers using AI bindings configured in wrangler.jsonc.
- name: Cloudflare Secrets Store
description: Store and rotate provider API keys without redeploying.
- name: Cloudflare CASB - Claude Compliance API
description: CASB support for Anthropic's Claude Compliance API announced May 2026.
- name: Claude Managed Agents on Cloudflare
description: Partnership offering announced May 19, 2026 that runs Anthropic-managed Claude agents on Cloudflare with
AI Gateway in the path.
- name: MCP Server
description: Remote MCP server at ai-gateway.mcp.cloudflare.com/mcp for AI agent access to AI Gateway control plane.
maintainers:
- FN: Kin Lane
url: http://apievangelist.com
email: [email protected]