The seed for this crawl was a list of every host in the Wayback Machine
This crawl was run at a level 1 (URLs including their embeds, plus the URLs of all outbound links including their embeds)
The WARC files associated with this crawl are not currently available to the general public.
You can be part of this incredible success story by joining the world’s biggest English-language website.
You’re just a few clicks away from discovering the opportunity of a lifetime.