Wikipedia dump from August 2022
wordfreq
The wordfreq project is not being updated because of AI and thus remains a resource for human-generated word frequencies.
Internet Archive Image Library
This library contains digital images uploaded by Archive users which range from maps to astronomical imagery to photographs of artwork. Many of these images are available for free download.
Library of Congress Photo Archive
The Prints and Photographs Online Catalog (PPOC) contains catalog records and digital images representing a rich cross-section of still pictures held by the Prints & Photographs Division and, in some cases, other units of the Library of Congress. The Library of Congress offers broad public access to these materials as a contribution to education and scholarship.
Project Gutenberg
You will find the world’s great literature here, with focus on older works for which U.S. copyright has expired. Thousands of volunteers digitized and diligently proofread the eBooks, for you to enjoy.
Arctic Code Vault
The archive is located in a decommissioned coal mine in the Svalbard archipelago, closer to the North Pole than the Arctic Circle. GitHub captured a snapshot of every active public repository on 02/02/2020 and preserved that data in the Arctic Code Vault.