This crawl was run with a Heritrix setting of "maxHops=0" (URLs including their embeds)
Survey 7 is based on a seed list of 339,249,218 URLs which is all the URLs in the Wayback Machine that we saw a 200 response code from in 2017 based on a query we ran on Feb. 1st, 2018.
The WARC files associated with this crawl are not currently available to the general public.
关于甲壳虫 | 甲壳虫招聘 | 联系我们 | 隐私政策
沪网文 0547-059 沪ICP备11018428号
甲壳虫（上海）网络科技有限公司 Copyright©2008-2014 Beetle Inc. All Rights Reserved