The Wayback Machine - https://web.archive.org/all/20070607033242/http://www.nla.gov.au/guidelines/webresources.html

Guideline

Safeguarding Australia’s Web Resources: Guidelines for Creators and Publishers

BACKGROUND

On-going access to information resources published on the web is under threat due to changes in the computer technologies that are needed to use them. Dependence on particular hardware and software may result in resources becoming unusable when their support technology becomes obsolete. Australian web resources form a significant part of our documentary heritage and action must be taken now to safeguard them for use into the future.

Preservation solutions for web resources are still evolving. There are as yet no internationally agreed approaches for ensuring the resources will remain available for use over the long term although possible solutions are being explored. However, there are steps that can be taken now to ensure resources are appropriately organised, documented, and managed for needs beyond immediate access. These steps are an essential pre-requisite to later preservation action that might be required when solutions are available.

Creators, publishers, libraries and other information repositories all have a vital role to play in ensuring web resources remain available for use.

These guidelines are intended primarily to assist those creators and publishers who do not already have well established digital data management procedures in place. They could, for instance, be working in the academic, government and not-for-profit sectors, and include independent creators and newly established web publishers. Types of resources include those deemed by their creators to have lasting research or information value, and which are made available for public use - for example, research papers; electronic journal articles; government reports; creative writing; statistical databases and business or service delivery web sites. The resources will be varied in nature, ranging from static, plain text documents to dynamic, interactive, and/or multi-media resources whose content is updated periodically or evolves over time.

It should be noted that government agencies creating web resources, whether primarily for public information or as part of internal business operations, must comply with relevant state, territory or Commonwealth legislation concerning record keeping responsibilities and public accountability requirements. Record keeping and archival standards set by archives organisations should also be followed. Further information is available from Archives of Australia from the National Archives of Australia site.

GUIDELINES

The guidelines provide practical advice on creating, describing, naming and managing web resources to facilitate their on-going use. It is recommended that they become standard practice, built into the planning stage for all web resources created and into existing information management strategies for information in other forms. In this way, it will be easier to carry out preservation action that might be required at a later stage to maintain access to those resources judged to have lasting value.

Creating resources

Before a resource is created thought needs to be given to the technology used to publish it and to view it. This includes software and hardware, file formats, data standards, and how the resource will be described. Unless there are strong reasons to do otherwise, a policy of using very widely accepted standards is highly recommended. This approach will support the widest possible current access as well as making long-term access more likely.

Other points to consider include the following:

A useful source of information on best practices for creating web resources is the National Initiatives for Networked Cultural Heritage site.

Describing resources

Metadata is data that records information about a resource. It is essential to support discovery, use and management of the resource. For instance, information about the title can support searching; information about terms and conditions associated with use of the resource can determine how it can be used; and information about its technical characteristics can support preservation decisions.

It is desirable that metadata schemas in general use or which are recognised standards in particular fields, are used for all purposes. This will achieve consistency of description and searching of resources from different sources, and facilitate the exchange of metadata.

Metadata can be recorded by:

To support resource discovery and use:

To support resource management:

Maintaining links to resources

The name or identifier commonly used for web resources is the URL (Uniform Resource Locator). This identifies the domain name of a server on which a resource is housed and the place of the resource in the file structure of that server. The location-based nature of the URL is its major weakness as a resource identifier as the identifier will change if the resource is relocated. When this happens all existing links to the resource will be broken, including those embedded in other resources or, for instance, found in catalogues, indexes and listings that provide resource discovery services.

To maintain access to web resources over time it is necessary to ensure that links to them are not broken, even if their location on the web changes. The maintenance of links to resources requires careful planning and management of local web sites. There are a number of ways to make sure links continue to work. These can be grouped into three categories:

The National Library is currently investigating options for implementation of a persistent naming scheme that will serve its own business needs and that can also be adopted as a national scheme. A report on a consultancy conducted for the National Library is available at www.nla.gov.au/initiatives/persistence/PIcontents.html

Additional information about managing web resources for persistent linking, is available in a companion document Managing web resources for persistent access

Managing resources

Creators and publishers can make sure their web resources are managed appropriately for ongoing access by:

The conditions needed for managing online resources for ongoing access include:

Creators who are not able to archive their own resources or have them archived by their organisation, are encouraged to make arrangements with an appropriate repository. For instance, the National Library and most State and Territory libraries accept responsibility for online publications that are in-scope for their selection criteria; and the National Archives of Australia accepts responsibility for selected Commonwealth government online records. It is expected that other bodies will accept similar roles over time.

ROLE OF THE NATIONAL LIBRARY

Archiving

The National Library has been archiving selected Australian online publications (individual documents, collections, e-journals and web pages) in its PANDORA Archive since 1996. However, the Library is able to archive only a portion of what is being produced online. It recognises that it must work with others and facilitate action to ensure a wider range of information resources is saved and accessible for future use. It does this by:

Preservation

The National Library accepts a national leadership role in the area of digital preservation and is also a participant in several international projects. At a practical level, it intends to preserve for long term access the digital resources archived in PANDORA. To this end, it is conducting trials to test migration of data to new formats and operating systems, so content can still be used when older formats and technologies become obsolete; and it is testing emulation of obsolete technologies – ie. making new generations of hardware and software function as if they were the technology originally used to provide access.

Developments in digital preservation techniques and relevant standards and best practices are reported on the Library’s PADI (Preserving Access to Digital Information) website.

CONTACT DETAILS

For more information or assistance with any matter raised in these guidelines, please contact:

Manager
Digital Archiving
National Library of Australia
Canberra ACT 2600

email: elecunit@nla.gov.au
fax: 02 6273 4322
phone: 02 6262 1140

June 2002

Top

About This Site | Copyright | Privacy | Accessibility | Site Map | Site Search | Content A-Z | Contact Us