WhyLabs
WhyLabs was an AI observability platform focused on data and model monitoring for both classical ML and LLM workloads. It built and maintained whylogs, an open-source data logging library that produces statistical profiles of tabular and unstructured data, and LangKit, an open-source toolkit for LLM telemetry covering relevance, toxicity, prompt injection signals, and quality metrics. WhyLabs, Inc. has announced it is discontinuing operations and has open-sourced its platform; the whylogs and LangKit projects remain available on GitHub for community use and research.
WhyLabs publishes 3 APIs on the APIs.io network. Tagged areas include AI Observability, ML Monitoring, LLM Monitoring, Open Source, and whylogs.
APIs
whylogs
whylogs is an open-source data logging library that creates approximate statistical profiles of datasets, enabling drift detection, data quality monitoring, and bias analysis fo...
LangKit
LangKit is an open-source toolkit that extracts telemetry from LLM prompts and responses including relevance, sentiment, toxicity, prompt injection signals, jailbreak similarity...
WhyLabs Observability Platform
WhyLabs Observability is the historical commercial SaaS that ingested whylogs profiles and LangKit telemetry for dashboards, drift alerts, and constraint monitoring. WhyLabs, In...
Features
Privacy-preserving statistical profiles of tabular, text, image, and embedding data.
Out-of-the-box metrics for relevance, toxicity, prompt injection signals, and refusal patterns.
Compare profiles over time to detect data and concept drift.
Constraint-based checks on schema, ranges, missingness, and distribution properties.
Profile-driven analysis of model inputs and outputs across protected groups.
Core libraries remain available under permissive licenses on GitHub.
Use Cases
Monitor training and inference datasets for schema drift and quality issues.
Instrument LLM applications with LangKit metrics to track safety and quality over time.
Detect distribution shifts in features and predictions for production ML models.
Share statistical profiles between teams and environments without exposing raw data.
Integrations
Profile pandas DataFrames directly with whylogs.
Generate whylogs profiles from PySpark and Spark Scala jobs.
Profile Snowflake tables for drift and quality monitoring.
Read and write whylogs profiles to S3 for distributed pipelines.
Log whylogs profiles alongside MLflow runs and models.
Apply LangKit metrics to Hugging Face model outputs.