Mercor
Mercor is an AI-powered talent and human-intelligence marketplace that organizes expert humans to power frontier AI work. The platform routes specialized professionals (software engineers, finance and investment-banking experts, clinicians, attorneys, generalist consultants) to AI labs and enterprises for RLHF data, SFT data, agent training, evals, frontier research, and managed data pipelines. Mercor also ships APEX, a public research and benchmarking suite — APEX Benchmarks (AI Productivity Index), APEX-Agents Leaderboard, and APEX-SWE Leaderboard. Mercor maintains documentation hubs at mercor.com/docs and talent.docs.mercor.com (the expert-facing help center), and exposes a developer-facing documentation surface at mercor.com/docs/api. Public API endpoint details are not advertised on the open web at this time; this profile documents the product and documentation surfaces rather than endpoint shapes.
Mercor publishes 7 APIs on the APIs.io network. Tagged areas include Talent Marketplace, Human Intelligence, RLHF, SFT, and AI Evals.
Mercor’s developer surface includes developer portal, documentation, API reference, engineering blog, signup flow, support, and 4 more developer resources.
APIs
Mercor Talent Marketplace
The core Mercor platform that matches expert humans to AI lab and enterprise demand for RLHF, SFT, evals, agent training, and frontier research projects. Domains covered include...
Mercor Data Pipelines
Mercor's managed data-pipeline product for designing and operating large, expert-driven labeling and evaluation pipelines for AI training data.
Mercor API
Mercor's developer-facing API documentation surface. Endpoint shapes and authentication details are not currently published openly; access is via Mercor's enterprise sales process.
APEX Benchmarks (AI Productivity Index)
Mercor's public AI productivity benchmark and research surface. APEX measures how well AI models perform real expert-grade work.
APEX-Agents Leaderboard
Public leaderboard for AI agent performance run by Mercor's research team.
APEX-SWE Leaderboard
Public leaderboard for AI software-engineering performance run by Mercor's research team.
Terminal-Bench
Public benchmark / task-submission framework published by Mercor (terminal-bench-3 on GitHub) for evaluating AI agents on terminal-based engineering tasks.
Features
Routes expert humans across software engineering, finance, healthcare, legal, and consulting into AI lab and enterprise projects.
Provides preference, reward, and demonstration data for foundation-model training.
Specialist data for training and evaluating AI agents.
End-to-end design and operation of expert-driven labeling and evaluation pipelines.
Public benchmarks (AI Productivity Index, APEX-Agents, APEX-SWE) measuring real-world AI performance.
Open-source benchmark and task-submission framework for AI agents on terminal engineering tasks.
Use Cases
Source preference and reward data from domain experts for RLHF.
Capture demonstrations of expert workflows for supervised fine-tuning.
Benchmark agent performance against expert-graded tasks via APEX and Terminal-Bench.
Stand up managed labeling and evaluation pipelines for enterprise AI programs.
Integrations
Expert workflows and team coordination run over Slack channels.
Engineering experts integrate with customer GitHub repositories for code-related work.
Mercor designs custom ingest and delivery pipelines per customer engagement.