The most important measure of a search
engine is the quality of its search results.
While a complete user evaluation is
beyond the scope of this paper, our own
experience with Google has shown it to
produce better results than the major
commercial search engines for most
searches. As an example which illustrates
the use of PageRank, anchor text, and
proximity, Figure 4 shows Google's
results for a search on "bill clinton".
These results demonstrates some of
Google's features. The results are
clustered by server. This helps
considerably when sifting through result
sets. A number of results are from the
whitehouse.gov domain which is what
one may reasonably expect from such a
search. Currently, most major commercial
search engines do not return any results
from whitehouse.gov, much less the right
ones. Notice that there is no title for the
first result. This is because it was not
crawled. Instead, Google relied on anchor
text to determine this was a good answer
to the query. Similarly, the fifth result is
an email address which, of course, is not
crawlable. It is also a result of anchor text.
All of the results are reasonably high
quality pages and, at last check, none
were broken links. This is largely because
they all have high PageRank. The
PageRanks are the percentages in red
along with bar graphs. Finally, there are no results about a Bill other than Clinton or about a Clinton
other than Bill. This is because we place heavy importance on the proximity of word occurrences. Of
course a true test of the quality of a search engine would involve an extensive user study or results
analysis which we do not have room for here. Instead, we invite the reader to try Google for themselves
at http://google.stanford.edu.