This crawl was run with a Heritrix setting of "maxHops=0" (URLs including their embeds)
Survey 7 is based on a seed list of 339,249,218 URLs which is all the URLs in the Wayback Machine that we saw a 200 response code from in 2017 based on a query we ran on Feb. 1st, 2018.
The WARC files associated with this crawl are not currently available to the general public.
Copyright©2012-2018 expoon.com , All Rights Reserved
京ICP证130160号 京ICP备12034864号 京公安备11010602004141