Gutendex
Gutendex is a simple, self-hosted JSON-based web API for serving book catalog information from Project Gutenberg, providing structured metadata for over 70,000 free ebooks including titles, authors, subjects, bookshelves, languages, copyright status, media types, downloadable formats, and download counts. The hosted instance at gutendex.com runs the open-source Django project by Gareth B. Johnson under the MIT license.
APIs
Gutendex Books API
Returns paginated metadata for Project Gutenberg ebooks with filters for author birth/death year, copyright status, IDs, languages, MIME type, free-text search, topic, and sort ...
Capabilities
Gutendex API — Books
Gutendex API — Books. 2 operations: List Books and Get Book. Self-contained Naftiko capability covering the full Gutendex public API surface for Project Gutenberg ebook metadata.
Run with NaftikoFeatures
No API key, no signup, no rate limit documented; gutendex.com is openly available.
Mirrors the full Project Gutenberg catalog of free public-domain ebooks.
Filter by author birth/death year, copyright status, language, MIME type, IDs, search keywords, and topic.
Sort results by popularity (default), ascending ID, or descending ID.
Each book exposes a Format object mapping MIME types (HTML, EPUB, MOBI, plain text, JPEG) to direct download URLs.
Books are tagged with Library of Congress subject headings and curated Project Gutenberg bookshelves.
Books are indexed by two-character ISO language codes; filter by multiple languages at once.
MIT-licensed Django app that operators can run on their own infrastructure with nightly Project Gutenberg catalog sync.
Use Cases
Power large-scale analysis of public-domain literature by ingesting the structured Project Gutenberg catalog.
Build classroom or self-study apps that surface public-domain texts by subject, author, or reading level.
Use the download_count and bookshelf data to recommend popular classics or themed reading lists.
Power library catalog interfaces and ebook reader apps with a clean JSON facade over Project Gutenberg's XML archives.
Use the catalog to discover bulk text downloads for training language models on public-domain corpora.
Power author/title lookup widgets in citation managers and academic writing tools.
Integrations
Upstream source. Gutendex nightly-ingests the official Project Gutenberg XML catalog and republishes it as JSON.
Built on Django + Django REST Framework. Operators can extend the API with additional viewsets.
Community-maintained MCP servers expose Project Gutenberg / Gutendex to LLM agents (bobbyhouse, vellankis-space, nasimcoderex).
Default Django storage backends used by self-hosted instances for catalog persistence.