A major restriction of archie was its limitation to pattern matching on file
names rather than the actual content of the files. The Wide Area Information
Server (WAIS) project [KM91] implemented a more powerful concept by
searching through the text of documents in addition to their file names or titles.
Suppose you are interested in finding articles on Michael Jordan’s second come-
back to professional basketball, and you perform an archie search using
“Jordan” as your keyword. Even if the file named “NBA-News-September-
2001.txt” includes a story covering Jordan’s comeback, it would not turn up
under an archie search. As WAIS digs through the entire text of the article, that
file would appear with a WAIS search. Moreover, the WAIS mechanism pro-
vided a scored response, ranking retrieved information based on the quantity of
keyword appearances in the text and on how close to the document’s beginning
they turned up. WAIS was originally developed at the beginning of the 1990s by
a consortium of companies that included Thinking Machines Inc., Dow Jones,
Apple Computer, and KPMG Peat Marwick. The first version of WAIS was
available in the public domain in 1991. By summer 1992, the project had evolved
into a separate company called—not surprisingly—WAIS Inc. This company
can be considered the first to commercialize technology related to content
retrieval over the Internet.
However, the WAIS system was not perfect—the user interface was relatively
difficult to use and the search capabilities were initially limited to text docu-
ments. Besides, it scored documents based on the absolute number of keyword
appearances rather than the density of their appearance. As a result, long docu-
ments were more likely than short documents to end up at the top of the list.
WAIS further lacked the capability for hierarchical organization of content
resources—a feature introduced by the Gopher system [RFC 1436].
Gopher was developed at the University of Minnesota in 1991 and named
after the school’s furry mascot. It let users retrieve data over the Internet with-
out using complicated commands and addresses. Gopher servers searched the
Internet using WAIS and arranged the results in hierarchical menus, using plain
language. As users selected menu items, they were lead to other menus, files,
or images, which might not even have resided on the local Gopher server.
References could move users to remote servers or fetch files from distant loca-
tions. Gopher significantly simplified information retrieval on the Internet. It
handled the details of actually getting requested information, without requiring
users to know how and from where to retrieve those resources. Initially deployed
only on the University of Minnesota campus, other institutions quickly discov-
ered Gopher’s versatility and set up their own Gopher servers. At one time, there
were a few thousand Gopher servers registered with the top-level server
“Gopher Central” at the University of Minnesota or its counterparts in other
countries.
Archie, WAIS, and Gopher emerged in the same era and coexisted for some
time. They all had their advantages and disadvantages, and occasionally, they
are still used today. Nevertheless, in the course of the 1990s, they all were
subsumed into yet another system—the World Wide Web (WWW).
1.1 The Early Days of Content Delivery over the Internet 3
Ch01.qxd 01/19/2005 12:49 PM Page 3