The Internet Archive discovers and captures web pages through many different web crawls.
At any given time several distinct crawls are running, some for months, and some every day or longer.
View the web archive through the Wayback Machine.
Web wide crawl with initial seedlist and crawler configuration from September 2012.
CCK11 Participant Blog Posts
Participant Blog Posts CCK11. By Stephen Downes and George Siemens.en-usWed, 13 Apr 2011 06:29:32 -0400Wed, 13 Apr 2011 06:29:32 -0400http://blogs.law.harvard.edu/tech/rssgRSShopperstephen@email@example.com