FiveThirtyEight Index hits Internet Archive
Journalist Ben Welsh launched a searchable index of over 22,000 FiveThirtyEight articles preserved on the Internet Archive's Wayback Machine. The project restores access to 15 years of data-driven reporting after the original site archive was taken offline by corporate owners.
This preservation effort transforms a massive cultural loss into a structured dataset for the data journalism and AI research communities.
- –Provides a comprehensive directory of 22,264 pages spanning 2008 to 2025
- –Full dataset available as a CSV, making it a high-quality resource for RAG benchmarking and statistical analysis
- –Leverages the Internet Archive as a backend, demonstrating the power of metadata-driven finding aids for web preservation
- –Highlights the vulnerability of digital archives and the critical role of open-source tools in salvaging history
DISCOVERED
1h ago
2026-05-20
PUBLISHED
4h ago
2026-05-20
RELEVANCE
AUTHOR
ChocMontePy