fal logo

fal

fal (Features and Labels, Inc.) is a generative media platform providing the world's fastest API for running image, video, audio, and multimodal generative AI models. Through a unified queue-based REST API at https://queue.fal.run, plus realtime WebSocket and SSE streaming surfaces, fal serves 1,000+ production models — including FLUX, Veo 3, Kling, Wan, Seedream, Nano Banana, and Stable Diffusion — on autoscaling GPU infrastructure. fal Serverless lets developers ship custom models with `@fal.function` / `fal.App` / BYO containers, while fal Compute provides dedicated H100/H200/A100/B200 instances. Trusted by Canva, Perplexity, Poe, and 1.5M+ developers; Series D funded ($140M, Sequoia-led, December 2025); SOC 2 with 99.99% uptime.

9 APIs 7 Capabilities 18 Features
AIArtificial IntelligenceGenerative AIGenerative MediaImage GenerationVideo GenerationAudio GenerationInferenceServerlessGPUMCP

fal publishes 3 APIs on the APIs.io network: Model APIs, Storage API, and Serverless Platform API. Tagged areas include AI, Artificial Intelligence, Generative AI, Generative Media, and Image Generation.

The fal catalog on APIs.io includes 7 machine-runnable capabilities, 1 JSON-LD context, and 1 Spectral governance ruleset.

fal’s developer surface includes developer portal, documentation, getting-started guide, engineering blog, signup flow, pricing, support, and 33 more developer resources.

APIs

fal Model APIs

Unified queue-based REST API for invoking 1,000+ generative image, video, audio, and multimodal models hosted on fal's inference infrastructure. Submit a request to `https://que...

fal Realtime API

WebSocket-based realtime inference for ultra-low latency interactive generative experiences such as LCM/SDXL sketch-to-image, live-portrait, and realtime upscaling. Bi-direction...

fal Streaming API

HTTP streaming endpoint (`/{model-id}/stream`) that emits progressive partial outputs as a model runs — used for LLM/VLM token streams, incremental video frames, and step-by-ste...

fal Storage API

REST endpoints for uploading binary inputs (images, audio clips, reference frames, control maps) to fal's CDN so they can be referenced by URL when invoking model APIs. Issues s...

fal Serverless Platform API

Programmatic management of custom fal Serverless applications — list, inspect, deploy, scale, and monitor user-defined GPU functions deployed with `@fal.function`, `fal.App`, or...

fal Models Catalog API

Read-only discovery endpoints for browsing fal's 1,000+ production model catalog, including model metadata, capability tags, pricing per output, supported parameters, example in...

fal Compute API

Provision and manage dedicated GPU instances (H100, H200, A100, B200) with full SSH access for training, fine-tuning, and persistent workloads. Hourly or per-second billing with...

fal API Keys API

Manage fal API keys — create, list, scope, and revoke keys used to authenticate against the Model, Storage, Serverless, and Compute APIs via the Authorization: Key $FAL_KEY header.

fal Usage and Billing API

Programmatic access to usage metrics, per-model spend, GPU-second consumption, and invoicing history. Surfaces the same data shown on the fal dashboard so platform teams can pip...

Capabilities

fal Compute API — Instances

fal Compute API — Instances. Provision and manage dedicated GPU instances (H100, H200, A100, B200) with full SSH access for training, fine-tuning, and persistent workloads.

Run with Naftiko

fal API Keys API — Management

fal API Keys API — Management. Create, list, scope, and revoke API keys used to authenticate against fal's Model, Storage, Serverless, and Compute APIs.

Run with Naftiko

fal Model APIs — Queue

fal Model APIs — Queue. 4 operations (submit, status, result, cancel) for invoking 1,000+ generative models through the unified https://queue.fal.run REST surface. Self-containe...

Run with Naftiko

fal Models Catalog API — Discovery

fal Models Catalog API — Discovery. Read-only listing of fal's 1,000+ generative model gallery including pricing, parameters, and per-model OpenAPI schemas.

Run with Naftiko

fal Serverless Platform API — Apps

fal Serverless Platform API — Apps. 4 operations covering listing, inspecting, secret management, and file listing for custom GPU functions deployed with @fal.function / fal.App...

Run with Naftiko

fal Storage API — Files

fal Storage API — Files. 1 operation. Lead operation: Initiate Asset Upload. Self-contained Naftiko capability for uploading binary inputs (images, audio, video, control maps) t...

Run with Naftiko

fal Usage and Billing API — Reporting

fal Usage and Billing API — Reporting. Programmatic access to per-model spend, GPU-second consumption, and invoicing history that powers the fal dashboard.

Run with Naftiko

Features

Unified queue-based REST API at https://queue.fal.run/{model-id} for 1,000+ generative models
Image generation models — FLUX (Schnell, Dev, Pro, Kontext Pro), Seedream V4, Nano Banana, Qwen, SDXL, SD3, Ideogram, Recraft
Video generation models — Veo 3, Kling 2.5 Turbo Pro, Wan 2.5, Seedance 2.0, Ovi, Hunyuan, Sora-class
Audio and voice models — Inworld TTS-1.5, ElevenLabs, MMAudio, MusicGen, Stable Audio
3D and multimodal models — TripoSR, Hunyuan3D, LivePortrait, FaceChain
Synchronous, asynchronous queue, server-sent streaming, and WebSocket realtime invocation modes
Webhook callbacks for queue completion with HMAC signature verification
File uploads / CDN storage at https://v3.fal.media with signed upload URLs
fal Serverless — `@fal.function`, `fal.App`, BYO container deployment with autoscaling from 0 to thousands of GPUs
fal Compute — dedicated H100/H200/A100/B200 instances with SSH and per-second billing
Per-output billing (image, video second, audio minute) plus per-second GPU billing for custom deployments
99.99% uptime SLA, SOC 2 compliance, private endpoints, and enterprise support
Proprietary Inference Engine — up to 10x faster than reference implementations
Official SDKs for Python (fal-client), JavaScript/TypeScript (@fal-ai/client), Swift, Java/Kotlin, Dart
fal CLI for serverless deploy / run / apps / secrets / auth
fal MCP Server exposing all 1,000+ models to AI assistants via the Model Context Protocol
ComfyUI and Blender extensions, plus Terraform provider for infra-as-code
Day-zero launch partner for major model releases (FLUX, Veo, Kling, Seedance, Wan, etc.)

Semantic Vocabularies

Fal Ai Context

0 classes · 9 properties

JSON-LD

API Governance Rules

fal API Rules

8 rules · 1 errors 5 warnings 2 info

SPECTRAL

Resources

🌐
Portal
Portal
🔗
Documentation
Documentation
🔗
Documentation
Documentation
🔗
Documentation
Documentation
🔗
Documentation
Documentation
🔗
Documentation
Documentation
🔗
Documentation
Documentation
🔗
Documentation
Documentation
🔗
Documentation
Documentation
🔗
Documentation
Documentation
🚀
GettingStarted
GettingStarted
🟢
StatusPage
StatusPage
📰
Blog
Blog
📝
SignUp
SignUp
💰
Pricing
Pricing
💬
Support
Support
🔗
Forum
Forum
📜
TermsOfService
TermsOfService
📜
PrivacyPolicy
PrivacyPolicy
🔗
TrustCenter
TrustCenter
🔗
LinkedIn
LinkedIn
🔗
Twitter
Twitter
👥
GitHubOrganization
GitHubOrganization
📦
SDK
SDK
📦
SDK
SDK
📦
SDK
SDK
📦
SDK
SDK
📦
SDK
SDK
📦
SDK
SDK
🔧
Tool
Tool
🔧
Tool
Tool
🔧
Tool
Tool
💻
CodeExamples
CodeExamples
💻
CodeExamples
CodeExamples
💻
CodeExamples
CodeExamples
🔗
Documentation
Documentation
🔗
Documentation
Documentation
🔗
Plans
Plans
🔗
RateLimits
RateLimits
🔗
FinOps
FinOps

Sources

Raw ↑
aid: fal-ai
url: https://raw.githubusercontent.com/api-evangelist/fal-ai/refs/heads/main/apis.yml
apis:
- aid: fal-ai:fal-model-apis
  name: fal Model APIs
  tags:
  - AI
  - Generative AI
  - Image Generation
  - Video Generation
  - Audio Generation
  - Multimodal
  - Inference
  humanURL: https://fal.ai/docs/model-apis/quickstart
  baseURL: https://queue.fal.run
  properties:
  - url: https://fal.ai/docs/model-apis/quickstart
    type: Documentation
  - url: https://fal.ai/models
    type: Documentation
    name: Model Gallery
  - url: openapi/fal-model-apis-openapi.yml
    type: OpenAPI
  - url: json-schema/fal-queue-request-schema.json
    type: JSONSchema
  - url: json-schema/fal-queue-status-schema.json
    type: JSONSchema
  - url: json-ld/fal-ai-context.jsonld
    type: JSONLD
  - type: NaftikoCapability
    url: capabilities/model-apis-queue.yaml
  description: Unified queue-based REST API for invoking 1,000+ generative image, video, audio, and multimodal
    models hosted on fal's inference infrastructure. Submit a request to `https://queue.fal.run/{model-id}`, poll
    `/requests/{request_id}/status` or `/requests/{request_id}` for progress and results, or subscribe to webhook
    callbacks. Supports synchronous responses, asynchronous queueing, server-sent streaming progress, and request
    cancellation. Powers flagship models including FLUX, Veo 3, Kling 2.5, Wan 2.5, Seedream, Nano Banana, Qwen,
    SDXL, and Stable Diffusion variants.
- aid: fal-ai:fal-realtime-api
  name: fal Realtime API
  tags:
  - AI
  - Generative AI
  - Realtime
  - WebSocket
  - Streaming
  - Inference
  humanURL: https://fal.ai/docs/model-apis/real-time
  baseURL: wss://realtime.fal.run
  properties:
  - url: https://fal.ai/docs/model-apis/real-time
    type: Documentation
  - url: https://github.com/fal-ai/real-time-demo-app
    type: CodeExamples
  description: WebSocket-based realtime inference for ultra-low latency interactive generative experiences such as
    LCM/SDXL sketch-to-image, live-portrait, and realtime upscaling. Bi-directional binary/JSON messaging keeps a
    persistent connection open so each frame, prompt, or pose adjustment is processed in milliseconds. Powers
    fal.realtime client utilities used in canvas apps, drawing tools, AR experiences, and live video pipelines.
- aid: fal-ai:fal-streaming-api
  name: fal Streaming API
  tags:
  - AI
  - Generative AI
  - Streaming
  - Server-Sent Events
  - Inference
  humanURL: https://fal.ai/docs/model-apis/streaming
  baseURL: https://queue.fal.run
  properties:
  - url: https://fal.ai/docs/model-apis/streaming
    type: Documentation
  description: HTTP streaming endpoint (`/{model-id}/stream`) that emits progressive partial outputs as a model
    runs — used for LLM/VLM token streams, incremental video frames, and step-by-step image diffusion previews.
    Compatible with Server-Sent Events parsers in the official fal-client SDKs.
- aid: fal-ai:fal-storage-api
  name: fal Storage API
  tags:
  - AI
  - Generative AI
  - File Upload
  - Storage
  - CDN
  humanURL: https://fal.ai/docs/model-apis/file-uploads
  baseURL: https://rest.alpha.fal.ai
  properties:
  - url: https://fal.ai/docs/model-apis/file-uploads
    type: Documentation
  - url: openapi/fal-storage-api-openapi.yml
    type: OpenAPI
  - type: NaftikoCapability
    url: capabilities/storage-files.yaml
  description: REST endpoints for uploading binary inputs (images, audio clips, reference frames, control maps) to
    fal's CDN so they can be referenced by URL when invoking model APIs. Issues short-lived signed upload URLs via
    `/storage/upload/initiate` and serves the resulting assets from `https://v3.fal.media`.
- aid: fal-ai:fal-serverless-platform-api
  name: fal Serverless Platform API
  tags:
  - AI
  - Serverless
  - GPU
  - Deployments
  - Platform
  humanURL: https://fal.ai/docs/private-serverless-models
  baseURL: https://rest.alpha.fal.ai
  properties:
  - url: https://fal.ai/docs/private-serverless-models
    type: Documentation
  - url: https://github.com/fal-ai/fal
    type: SDK
    name: fal Python SDK and CLI
  - url: openapi/fal-serverless-platform-api-openapi.yml
    type: OpenAPI
  - type: NaftikoCapability
    url: capabilities/serverless-apps.yaml
  description: Programmatic management of custom fal Serverless applications — list, inspect, deploy, scale, and
    monitor user-defined GPU functions deployed with `@fal.function`, `fal.App`, or BYO containers. Covers app
    metadata, secrets, file volumes, scaling parameters (`keep_alive`, `min_concurrency`), and execution analytics.
- aid: fal-ai:fal-models-catalog-api
  name: fal Models Catalog API
  tags:
  - AI
  - Generative AI
  - Catalog
  - Discovery
  humanURL: https://fal.ai/models
  baseURL: https://fal.ai
  properties:
  - url: https://fal.ai/models
    type: Documentation
  - type: NaftikoCapability
    url: capabilities/models-catalog.yaml
  description: Read-only discovery endpoints for browsing fal's 1,000+ production model catalog, including model
    metadata, capability tags, pricing per output, supported parameters, example inputs, and OpenAPI schemas per
    model. Backs the model gallery, search, and SDK tooling.
- aid: fal-ai:fal-compute-api
  name: fal Compute API
  tags:
  - AI
  - GPU
  - Compute
  - Infrastructure
  - Dedicated
  humanURL: https://fal.ai/compute
  baseURL: https://rest.alpha.fal.ai
  properties:
  - url: https://fal.ai/compute
    type: Documentation
  - type: NaftikoCapability
    url: capabilities/compute-instances.yaml
  description: Provision and manage dedicated GPU instances (H100, H200, A100, B200) with full SSH access for
    training, fine-tuning, and persistent workloads. Hourly or per-second billing with no lock-in.
- aid: fal-ai:fal-keys-api
  name: fal API Keys API
  tags:
  - AI
  - Administration
  - Authentication
  - API Keys
  humanURL: https://fal.ai/dashboard/keys
  baseURL: https://rest.alpha.fal.ai
  properties:
  - url: https://fal.ai/dashboard/keys
    type: Documentation
  - type: NaftikoCapability
    url: capabilities/keys-management.yaml
  description: 'Manage fal API keys — create, list, scope, and revoke keys used to authenticate against the Model,
    Storage, Serverless, and Compute APIs via the Authorization: Key $FAL_KEY header.'
- aid: fal-ai:fal-usage-billing-api
  name: fal Usage and Billing API
  tags:
  - AI
  - Administration
  - Usage
  - Billing
  - FinOps
  humanURL: https://fal.ai/dashboard/usage
  baseURL: https://rest.alpha.fal.ai
  properties:
  - url: https://fal.ai/dashboard/usage
    type: Documentation
  - type: NaftikoCapability
    url: capabilities/usage-billing.yaml
  description: Programmatic access to usage metrics, per-model spend, GPU-second consumption, and invoicing
    history. Surfaces the same data shown on the fal dashboard so platform teams can pipe inference cost into
    internal FinOps tooling.
name: fal
tags:
- AI
- Artificial Intelligence
- Generative AI
- Generative Media
- Image Generation
- Video Generation
- Audio Generation
- Inference
- Serverless
- GPU
- MCP
kind: contract
image: https://kinlane-productions2.s3.amazonaws.com/apis-json/apis-json-logo.jpg
access: 3rd-Party
common:
- type: Portal
  url: https://fal.ai
- type: Documentation
  url: https://fal.ai/docs
- type: Documentation
  name: Model APIs Quickstart
  url: https://fal.ai/docs/model-apis/quickstart
- type: Documentation
  name: Model Gallery
  url: https://fal.ai/models
- type: Documentation
  name: Authentication
  url: https://fal.ai/docs/authentication
- type: Documentation
  name: Webhooks
  url: https://fal.ai/docs/model-apis/webhooks
- type: Documentation
  name: Realtime
  url: https://fal.ai/docs/model-apis/real-time
- type: Documentation
  name: Streaming
  url: https://fal.ai/docs/model-apis/streaming
- type: Documentation
  name: File Uploads
  url: https://fal.ai/docs/model-apis/file-uploads
- type: Documentation
  name: Private Serverless Models
  url: https://fal.ai/docs/private-serverless-models
- type: GettingStarted
  url: https://fal.ai/docs/model-apis/quickstart
- type: StatusPage
  url: https://status.fal.ai
- type: Blog
  url: https://blog.fal.ai
- type: SignUp
  url: https://fal.ai/login
- type: Pricing
  url: https://fal.ai/pricing
- type: Support
  name: Discord
  url: https://discord.gg/fal-ai
- type: Forum
  url: https://discord.gg/fal-ai
- type: TermsOfService
  url: https://fal.ai/legal/terms-of-service
- type: PrivacyPolicy
  url: https://fal.ai/legal/privacy-policy
- type: TrustCenter
  url: https://trust.fal.ai
- type: LinkedIn
  url: https://www.linkedin.com/company/featuresandlabels
- type: Twitter
  url: https://twitter.com/fal
- type: GitHubOrganization
  url: https://github.com/fal-ai
- type: SDK
  name: fal Python Client
  url: https://github.com/fal-ai/fal-client-python
- type: SDK
  name: fal JavaScript Client
  url: https://github.com/fal-ai/fal-js
- type: SDK
  name: fal Swift Client
  url: https://github.com/fal-ai/fal-swift
- type: SDK
  name: fal Java/Kotlin Client
  url: https://github.com/fal-ai/fal-java
- type: SDK
  name: fal Dart/Flutter Client
  url: https://github.com/fal-ai/fal-dart
- type: SDK
  name: fal Python SDK / Serverless
  url: https://github.com/fal-ai/fal
- type: Tool
  name: fal Terraform Provider
  url: https://github.com/fal-ai/terraform-provider-fal
- type: Tool
  name: fal Blender Extension
  url: https://github.com/fal-ai/fal-blender-extension
- type: Tool
  name: fal VS Code Extension (Serverless)
  url: https://github.com/fal-ai/serverless-vscode
- type: CodeExamples
  name: Awesome fal
  url: https://github.com/fal-ai/awesome
- type: CodeExamples
  name: Real-Time Demo App
  url: https://github.com/fal-ai/real-time-demo-app
- type: CodeExamples
  name: fal Next.js Template
  url: https://github.com/fal-ai/fal-nextjs-template
- type: Documentation
  name: MCP Server
  url: https://fal.ai/docs/mcp-server
- type: Documentation
  name: ComfyUI Integration
  url: https://fal.ai/docs/comfyui
- url: plans/fal-ai-plans-pricing.yml
  type: Plans
- url: rate-limits/fal-ai-rate-limits.yml
  type: RateLimits
- url: finops/fal-ai-finops.yml
  type: FinOps
- type: Features
  data:
  - Unified queue-based REST API at https://queue.fal.run/{model-id} for 1,000+ generative models
  - Image generation models — FLUX (Schnell, Dev, Pro, Kontext Pro), Seedream V4, Nano Banana, Qwen, SDXL, SD3,
    Ideogram, Recraft
  - Video generation models — Veo 3, Kling 2.5 Turbo Pro, Wan 2.5, Seedance 2.0, Ovi, Hunyuan, Sora-class
  - Audio and voice models — Inworld TTS-1.5, ElevenLabs, MMAudio, MusicGen, Stable Audio
  - 3D and multimodal models — TripoSR, Hunyuan3D, LivePortrait, FaceChain
  - Synchronous, asynchronous queue, server-sent streaming, and WebSocket realtime invocation modes
  - Webhook callbacks for queue completion with HMAC signature verification
  - File uploads / CDN storage at https://v3.fal.media with signed upload URLs
  - fal Serverless — `@fal.function`, `fal.App`, BYO container deployment with autoscaling from 0 to thousands of
    GPUs
  - fal Compute — dedicated H100/H200/A100/B200 instances with SSH and per-second billing
  - Per-output billing (image, video second, audio minute) plus per-second GPU billing for custom deployments
  - 99.99% uptime SLA, SOC 2 compliance, private endpoints, and enterprise support
  - Proprietary Inference Engine — up to 10x faster than reference implementations
  - Official SDKs for Python (fal-client), JavaScript/TypeScript (@fal-ai/client), Swift, Java/Kotlin, Dart
  - fal CLI for serverless deploy / run / apps / secrets / auth
  - fal MCP Server exposing all 1,000+ models to AI assistants via the Model Context Protocol
  - ComfyUI and Blender extensions, plus Terraform provider for infra-as-code
  - Day-zero launch partner for major model releases (FLUX, Veo, Kling, Seedance, Wan, etc.)
  sources:
  - https://fal.ai
  - https://fal.ai/docs
  - https://fal.ai/pricing
  - https://fal.ai/models
  - https://github.com/fal-ai
  - https://blog.fal.ai
  updated: '2026-05-25'
created: '2026-05-25'
modified: '2026-05-25'
position: Consuming
description: fal (Features and Labels, Inc.) is a generative media platform providing the world's fastest API for
  running image, video, audio, and multimodal generative AI models. Through a unified queue-based REST API at
  https://queue.fal.run, plus realtime WebSocket and SSE streaming surfaces, fal serves 1,000+ production models
  — including FLUX, Veo 3, Kling, Wan, Seedream, Nano Banana, and Stable Diffusion — on autoscaling GPU
  infrastructure. fal Serverless lets developers ship custom models with `@fal.function` / `fal.App` / BYO
  containers, while fal Compute provides dedicated H100/H200/A100/B200 instances. Trusted by Canva, Perplexity,
  Poe, and 1.5M+ developers; Series D funded ($140M, Sequoia-led, December 2025); SOC 2 with 99.99% uptime.
maintainers:
- FN: Kin Lane
  email: [email protected]
  X: apievangelist
  url: https://apievangelist.com
specificationVersion: '0.16'