The Wayback Machine - https://web.archive.org/all/20020806115118/http://www.amonline.net.au:80/explore/searching_the_web.htm
australian museum onlineabout the museumresearch and collectionsfeaturesexplore


explore





 




Searching the web

Here are some pointers to help you understand how to use the web for research. There is a wealth of information available.

Contents


How big is the World Wide Web?

In their 2000 review of the World Wide Web, researchers at the Online Computer Library Center, Inc. (OCLC) estimated there were 7.1 million unique web sites, a 50 percent increase over the previous year's total of 4.7 million.

Public web sites constitute 41 percent of the Web, or about 2.9 million sites. Private sites- whose content is subject to explicit access restrictions (eg, Internet Protocol filters or password authentication), or is not intended for public use (eg, web interfaces to privately owned hardware devices such as printers or routers) -comprise 21 percent of the Web, or 1.5 million sites. The remaining 2.7 million sites - or about 38 percent of the Web - are provisional sites: their content is in an unfinished or transitory state (eg, server default pages or "Site under construction" notices).
[Reference: (http://www.oclc.org/home/ Dublin, Ohio, Oct. 16, 2000).

OCLC describes itself as 'a non-profit, membership, library computer service and research organisation dedicated to the public purposes of furthering access to the world's information and reducing information costs'.]


How many Australians use the World Wide Web?

According the Australian Bureau of Statistics as at May 2000, 54% of households (3.8 million) had a computer, an 18% increase over May 1999. About one-third (33%) of households (2.3 million) had Internet access, a 53% increase over May 1999. Estimates for households with a home computer, and those with Internet access at May 2001, (based on the reported intentions of those households without either at May 2000) showed a little over 60% of households are expected to have a home computer and almost 50% are expected to have Internet access.' [From 'Communications and Information Technology -Household use of information technology' Australian Bureau of Statistics website http://www.abs.gov.au/ausstats/]


How do I start to search the Web?

Define you topic. Remember that the more specific your search is, the more likely you will find what you want. If you are just browsing through a subject you could use one of the subject directories (also called search directories). If you are after something more specific use a search engine. If you are after a particular theme or interest area try one of the portals.


What is a subject/search directory?

A directory is a database system. Internet addresses are organised according to subject categories. Subject/search directories data are entered manually.

Directories will remain popular as users will continue searching in broad categories. Directories can help users who haven't defined their search, to navigate through an already organised subject hierarchy. Directory topics and categories have been chosen to help a user browse through the subject area they are interested in.

Examples of subject directories

Australian Museum bookmarks http://www.austmus.gov.au/scripts/bookmark/booksearch.asp
Librarian's Index www.lii.org
Infomine infomine.ucr.edu
Academic Info http://www.academicinfo.net
Yahoo www.yahoo.com
About.com www.about.com
Google Web Directory http://directory.google.com/


What is a search engine?

A search engine is a database system designed to index Internet addresses (URLs, Usenet, ftp, image locations etc). Search engines obtain their data automatically.

Examples of search engines

Australian search engines


What is a portal?

A portal (also known as a 'gateway', 'metasite' or 'umbrella site') is a public website which comprises links to web pages and other websites around a common theme. (From the National Archives of Australia, March 2001)

Examples of subject portals


What kind of information can I expect to find on the web?

The web covers almost every type of information you can imagine. You need to decide if it is the best place or form of information for you or if more traditional sources are appropriate for you. You should be able to find something that suits your enquiry.


Who puts information up on the web?

Information is organised and put onto the Internet by many different individuals, organisations, agencies and groups.


How can I recognise who put the information on the web?

Although it is isn't always the case some of the following categories can help you determine who is responsible for the information you are looking at.

Special interest group
Sponsored by an organisation or group attempting to influence public option.
Check the URL address, it frequently ends in .org

Commercial web pages
Usually a site trying to promote and sell products and services.
Check the URL address, it often ends in .com

Personal web pages
Published by an individual who may or may not be affiliated with an institution.
These pages may have a variety of endings to the URL address - .com, .edu
Often embedded in the URL address is a tilda ~

Information web pages
Main purpose is to present factual information.
The URL address frequently ends in .edu, .gov, or .net
Many of these pages are sponsored by educational institutions or government agencies.

News web page
Their primary purpose is to provide extremely current information.
The URL usually ends in .com


How can I verify a site?

You can try and assess a site using the following criteria.

Accuracy
How reliable and free from error is the information?
Are there editors and fact checkers?

Be aware that almost any one can publish on the web.
Many web resources are not verified by editors or fact checkers.
Standards to achieve accuracy for web publication are not fully developed.

Authority
What are the author's qualifications to write on the subject?
How reputable is the publisher?

It is often difficult to determine authorship of web resources.
If authors names are listed, their qualifications are often absent.
The publisher's responsibility is often not indicated.

Objectivity
Is the information presented with a particular or obvious bias?
To what extent is the information trying to sway the opinion of the audience?

The web often functions as a virtual soapbox.
The goals or aims of the persons or group presenting the material are often not clearly stated.

Currency
Is the content of the work up-to-date?
Is the publication date clearly indicated?

Dates are not always included on web pages and if included, dates may have various meaning.

The date can mean:

  • date the paper etc was created
  • date it was placed on web
  • date of the last revision.

Coverage
What topics are included in the work?
To what depth are the topics explored?

Coverage may differ from print or other media coverage.
It is often hard to determine the extent of the web coverage.


What are some problems with using web resources for research?

One problem is that search engines can retrieve web pages out of context.
You might end up with marketing oriented web pages or the blending of information, entertainment and advertising.
The instability of web pages can be a problem as web pages tend to change or disappear.


How can I cope with these problems?

Out of context
If you can, return to the homepage to determine the source of information.

Marketing oriented pages
On the web this is often blurred. Try and determine if advertising content and information is supplied by the same person.

Web pages may move or disappear without notice
Try and determine the source and documentation to the fullest extent.

Accidental or deliberate alteration
Try verifying the information using other sources.

The ultimate challenge is applying this knowledge when using the web as a research tool.


What are some good sites to start with?

Scientific Literature Searching

  • PubMed/Medline- the National Library of Medicine's search service that provides access to over 10 million citations in MEDLINE, PreMEDLINE, and other related databases, with links to participating online journals.
  • PubSCIENCE - provides users the capability to search across a large compendium of peer reviewed journal literature with a focus on the physical sciences and other disciplines of concern to the Department of Energy (DOE).
  • INIST's catalog of Articles and Monographs - Article@INIST allows you to search for documents (Journal articles, Journal issues, Books, Reports or Conferences, doctoral dissertations) in the catalogue at the Institut de l'Information Scientifique et Technique.
  • BioMedNet - (Full-Text Library) The BioMedNet Library contains over 170 biological and medical journals, including the Trends and Current Opinion review journals.

Electronic Journals

Finding Electronic Journals

Citing information which appears on the web
Library of Congress: Citing Electronic Sources, http://learning.loc.gov/learn/resources/cite/index.html

Suggestions for the citation of multimedia electronic resources.

Other links

  • Science Gems - An annotated directory of more than 2,500 science resources sorted by categories.
  • WebElements - The Periodic Table of Elements and more.
  • On-line Dictionaries - An Internet directory of dictionaries, more than 800 dictionaries of over 150 different languages.
  • Australian Libraries Gateway - The Australian Libraries Gateway (ALG) is a free World Wide Web-based directory service which improves access to information about more than 5,100 Australian libraries.
  • The Invisible Web - The World's Largest Directory of Searchable Information Sources on the Internet. You've heard the pitch about how big the Web is -- some 380 million pages and counting. But did you know that there is a virtual treasure trove of information that is not visible through the traditional search engines? Did you know that there are thousands of searchable databases, archives and other information sources delivering highly targeted information for a much improved searching experience?