Oakland Community College Logo - Link to College Home Page
OCC Libraries - Link to Libraries Home Page

Tutorials Welcome
A WORD OF CAUTION
A number of search engines feature “pay for placement.”  That means that the results that rank the highest―those that you see first―are there because those sites have paid to come up first.

Google, for example, has paid ads on the right side of the results screen, but it may also place "sponsored links" (more paid ads) on top of the other results of a search. In Google these are clearly labeled as “sponsored.” However, some of the more unscrupulous engines either omit any kind of warning or give warnings that are very difficult to understand. If you learn that a search engine does this, avoid it like the plague.

Library
Tutorial
Module 4: Effective Searching on the Web

Still More about Search Engines

Spiders crawling the web?  Please elaborate!

  1. The spider program reviews pages on the web, following links from one page to another, and then stores the pages and indexes them (classifies them by words contained and subjects involved).  But some pages are not linked to other pages, so not all pages get found by the spiders.  (See the pink box to the left for more information.)
     

  2. After a spider retrieves a page, another program in the search engine indexes it.  That is, the program examines a variety of aspects to determine the subject(s) of the page, including but not limited to:

    1. the position of words in the page (e.g., in the title, the URL, the first paragraph, etc.)

    2. the order of the words

    3. the number of occurrences of words

    4. links in the page

Based on these factors, a link to the page is put in the appropriate area of an index, or list of subjects.  Then when you search, you actually search the index, which calls up the appropriate links to pages on the Internet.

But more than number of occurrences of words and their positions is involved.  Pages are also ranked in various ways.  Google, for example, employs a particular Abacusalgorithm for ranking pages called "PageRank."  This algorithm generally counts the number of other pages that link to a particular Webpage.  If page A has 26 pages linking to it, and page B has 23, all other things being equal, page A's PageRank is higher.

Still another factor is involved in PageRank.  How choosy are the pages that link to page A and page B?  If the 23 pages that link to page B have few outgoing links--that is, they are very particular in linking to other pages--that raises page B's PageRank.  Conversely, if the pages linking to page A have scads of links to other pages also, they are treated as not very discriminating and page A's PageRank sinks.

Again, all other things being equal, the higher the PageRank of a Webpage, the higher on the list of results is Google's link to the page.

Now we're ready for another look at how Google answered the question about the blue sky.  And then we'll discuss techniques for getting the best results out of search engines.


Contents  |  <Previous  |  Next>  |  Glossary  |  Library Home