Hidden Universes of Information
3.Search Engines Features
There are hundreds of search engines which vary in size, methodology and features. To evaluate the search engines, you may want to do your own controlled experiment where you use the various search engines using the same query term. You should also recognize the difference between a subject directory (Yahoo) and a search engine (Lycos or Alta Vista). Subject directories are best used during the early stages of your search, if you do not know much about a topic. The search engines are most useful when you are searching for a very specific term or concept.
When using any particular search engine, you should take the time to familiarize yourself with the default settings for that search engine and learn what the advanced options are. Most users simply type-in a couple of key words and get thousands of query results. Next a user may add several additional words to the query string, thing that this will help specify the search. What most users do not realize, is that the default logical operator in most search engines is an "or" (not "and"). Adding more words to a query string can often dilute your search results.
|Search Engine Guides:|
Alta Vista is one of the most feature-rich search engines. Alta Vista has two different query modes: Simple and advanced. Each mode also has its own "help page" which contains a wealth of information. I encourage you to pull-up the help screen for each query mode and print it. This way you can refer to it while composing a query string. You will see that Alta Vista supports a full set of logical operators and lets you control the prioritization of the query results
Many of the more interesting search options in Alta Vista stem from the way Alta Vista builds its index. Alta Vista not only indexes the full text displayed on every web page it harvests, but is also indexes the HTML of each page. Alta Vista will let you search for terms in conjunction with various HTML attributes. Consider the following examples:
returns web pages which :
|gigabit||contain the word"gigabit"|
|gigabit testbed||contain the word "gigabit" or "testbed"|
|gigabit host.gov||contain the word "gigabit" and are hosted in US Government web sites|
|gigabit host.uk||contain the word "gigabit" and are hosted in United Kingdom web sites|
|link:*.whitehouse.gov||contain a hyperlink to the whitehouse web site|
|link:*.whitehouse.gov -host.whitehouse.gov||contain a hyperlink to the whitehouse web site, but are not themselves located at the whitehouse web site|
These options are also helpful is searching a particular web site which does not offer its own web site search capability. Just go to Alta Vista and search for "keyword host:*.website.com"
To see the kind of "poor" search terms that other users try, look at metaspy.
Dejanews is a searchable index of the full text of all messages in all the Usenet newsgroups. Once again, I encourage you to go beyond just the default query interface. Your first stop will be Dejanew's Power Search Form. The Power Search Form contains a first level of options where you can control the search. You should note that each option name is a hyperlink to the power search help page which guides you in setting the parameters for your search (take the time to look at the power search help page) One of the most useful options to set, is the "usenet database" option. By default, Dejanews searches its "current database" which only covers the past month or so of messages. To go back over a year into the newsgroups, you need to select the "old database"
One of the most powerful features of Dejanews, is its ability for you to define a Query Filter. With a query filter, you can perform your keyword search against a very detailed subset of the entire dejanews database. As shown in the course handouts, you can construct a query which shows all messages from government employees posted into the "comp" newsgroups during a 2 month window.
Another interesting feature of Dejanews is called the "Author Profile" When reading a particular posting via Dejanews, The "Author Profile" will search Dejanews for that author's email address, and show you all the messages that person has posted into any newsgroups. This can be an invaluable tool to get a sense of what that person is all about. Organizations can also use Dejanews to see if their employees have been conforming to the organization's Access policy with regards to the Newsgroups. All you have to do, is limit you search to authors from *@company.com
If search engines are giving you thousands of "hits", it is time to start using the advanced features of the search engine. take the time to learn how your favorite search engine works and how it interprets your search query
Note for my Alumni: Your referrals are always appreciated