EDA

Archive Statistics

Comprehensive overview and metrics for the Epstein Document Archive. The archive contains 207,251 documents including 4,050 emails, 3,004 flight log entries, 16,407 photographs, and 886 Amazon purchase orders, with 23,540 identified people and entities.

Overview

Documents by Source

DOJ - Court Records12,521 (6.0%)
DOJ - EFTA Photos7,607 (3.7%)
Manual Photos7,683 (3.7%)
DOJ - Unclassified2,632 (1.3%)
DOJ - Data Set 11,200 (0.6%)
DOJ - Data Set 21,100 (0.5%)
DOJ - Data Set 31,050 (0.5%)
DOJ - Data Set 41,180 (0.6%)
DOJ - Data Set 51,173 (0.6%)
DOJ - Data Set 60 (0.0%)
DOJ - Data Set 70 (0.0%)
DOJ - Data Set 80 (0.0%)
DOJ - Data Set 90 (0.0%)
DOJ - Data Set 100 (0.0%)
DOJ - Data Set 110 (0.0%)
DOJ - Data Set 120 (0.0%)
House Oversight Committee1,124 (0.5%)
FBI Vault850 (0.4%)
FOIA Releases600 (0.3%)
Documents by Type
Documents by Source
Top Mentioned People
Source Distribution

Data Summary

Total indexed records
207,251
PDF files processed
188,585
Data sources
19
Identified entities
23,540
Entity relationships
875
Search index status
Active

Frequently Asked Questions

How many documents are in the Epstein archive?
The archive contains 207,251 documents from the Jeffrey Epstein investigation, including 4,050 seized emails, 3,004 flight log entries, 16,407 photographs, and 886 Amazon purchase orders. There are 23,540 identified people and entities across all files.
Where do the documents come from?
Documents are sourced from 19 official government sources, including the DOJ Epstein Library, FBI Vault, House Oversight Committee releases, and FOIA disclosures. All documents are from publicly released government collections.
How are the statistics calculated?
Statistics are calculated from the indexed records in the archive database. Document counts, entity mentions, and other metrics are updated as new documents are processed and added to the archive.