Apache POI
Apache POI is a Java API for manipulating various file formats based upon the Office Open XML standards (OOXML) and Microsoft's OLE2 Compound Document format (OLE2). It supports reading and writing Excel, Word, PowerPoint, Visio, and Outlook files.
APIs
Apache POI
POI provides Java APIs for reading and writing Microsoft Office formats including Excel (HSSF/XSSF), Word (HWPF/XWPF), PowerPoint (HSLF/XSLF), Visio (HDGF/XDGF), and Outlook (HS...
Capabilities
Features
Read and write Excel files in legacy XLS (HSSF) and modern XLSX (XSSF) formats
Read and write Word documents in legacy DOC (HWPF) and modern DOCX (XWPF) formats
Create and manipulate PowerPoint presentations in PPT and PPTX formats
Evaluate Excel formulas and compute cell values programmatically
Low-memory streaming API (SXSSF) for writing large Excel files
Create and modify charts in Excel workbooks and PowerPoint slides
Sign Office documents with digital signatures using OOXML standards
Use Cases
Generate Excel and Word reports programmatically from application data
Import data from Excel spreadsheets and export results back
Fill Office document templates with dynamic data
Convert between legacy Office formats and modern OOXML formats
Integrations
POI is used by Tika for Office document text extraction
Integrate POI with Spring Boot for web-based document generation
Available as org.apache.poi artifacts on Maven Central
Uses Commons Collections and Commons Math for data structures