Starting in 1996, Alexa Internet has been donating their crawl data to the Internet Archive. Flowing in every day, these data are added to the Wayback Machine after an embargo period.
If working with an
ISP where all of the files are in
HTML document
space, disable all access to the Interchange catalog directory with
the proper
HTTP access restrictions. Normally that is done by creating
a .htaccess file like this: