The Harvest Information Discovery and Access System
Harvest is an integrated set of tools to gather, extract,
organize, search, cache, and replicate relevant information across the
Internet. With modest effort users can tailor Harvest to digest
information in many different formats from many different machines,
and offer custom search services on the web.
Funding for Harvest ended August 1996, and the project is now officially
over, but the software will live on in two ways:
As free software maintained by volunteers.
- A group of Harvest users has released
to fix several bugs in version 1.4, the
final version released by the original team.
- There is a significant expert
user base, and the
Usenet newsgroup comp.infosystems.harvest
will continue to be active and a source for advice and extensions.
- Some of the parts of Harvest, for example
are still maintained and extended (see
by its original team.
As commercial derivatives:
Catalog Server is based on the Harvest design.
The Harvest cache project has spawned a
established to promote cooperating
caching as core Internet functionality. In March 1997, this effort
became part of
which intends to provide
software and appliance versions of the Harvest object cache.
As part of its distributed service architecture, @Home Network replicates content from content partners onto
its Regional Data Centers. @Home will use Netscape's Catalog
Server to provide search capabilities for this content.
You can learn about and experiment with Harvest through the hypertext
If you have questions about configuring and using Harvest, please see
the Usenet newsgroup
searchable archive of messages
Harvest Version 1.5
is now available.
The Harvest Cache is available
as a separate distribution.
Mic Bowman, Peter Danzig, Darren Hardy, Udi Manber, Mike Schwartz, and
September 2, 1996