Data Sources
The Epstein Document Archive sources all 207,251 documents exclusively from official U.S. government releases. Below is a complete list of data sources, the number of documents from each, and descriptions of what they contain. We do not alter, fabricate, or editorialize any content.
All Sources
DOJ Epstein Library - Court Records
Court records from the Jeffrey Epstein investigation released through the DOJ Epstein Library. Includes indictments, plea agreements, motions, orders, and related filings from both the Southern District of Florida and the Southern District of New York.
DOJ - EFTA Photos
Photographs transferred via the Electronic File Transfer Application (EFTA) from the DOJ. These include evidence photos from properties, items, and locations related to the investigation.
DOJ - Manual Photos
Manually uploaded photographs from the DOJ release. These complement the EFTA photos and include additional evidence images, property photos, and investigation materials.
DOJ - Unclassified Materials
Unclassified materials from the DOJ release that do not fall into the other specific categories. Includes miscellaneous investigation documents, reports, and correspondence.
DOJ - Data Sets 1-5
Five structured data sets released by the DOJ containing additional investigation records. These data sets were released in batches and include various document types including financial records, communications, and investigative notes.
House Oversight Committee
Documents released by the U.S. House Oversight Committee related to the Epstein investigation. These include committee reports, transcripts, and related materials from congressional oversight activities.
FBI Vault
FBI investigation files obtained from the FBI Vault, the FBI's electronic reading room. These include FBI reports, memos, and investigative materials related to Epstein that have been declassified and released.
FOIA Releases
Documents obtained through Freedom of Information Act (FOIA) requests. These include records from various federal agencies that were released in response to public records requests related to Epstein.
Data Integrity
Every document in the archive is traceable to its official government source. We maintain SHA-256 file hashes for all original documents to ensure integrity. The processing pipeline (OCR, text extraction, entity recognition) is documented in our methodology page.