internet research search engines & subject directories

15
Internet Research Search Engines & Subject Directories

Upload: gyles-bailey

Post on 23-Dec-2015

218 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Internet Research Search Engines & Subject Directories

Internet Research

Search Engines & Subject Directories

Page 2: Internet Research Search Engines & Subject Directories

Search engines

• Search engines are the means by which most people search the Web.

• Common examples are Google, Altavista, Direct Hit.

Page 3: Internet Research Search Engines & Subject Directories

Yet they don’t search the Internet

• Yet a search engine does not actually search the Web during your search.

• A search engine searches itself.

• It’s a three-step process.

Page 4: Internet Research Search Engines & Subject Directories

1) Bots index words

• Search engines continually send out hundreds of “robots” or “bots” (or “spiders” or “crawlers” )

• Bots visit web sites, read word by word, and then index those words.

Page 5: Internet Research Search Engines & Subject Directories

2) A database is created

• A huge database of Web sites thus is gathered and indexed by word.

• These databases can be huge, with millions of links.

Page 6: Internet Research Search Engines & Subject Directories

3) The Interface gives you access

• Using the keywords you give it, a search engine then searches its own current index.

Page 7: Internet Research Search Engines & Subject Directories

Interfaces are based on rankings

• Search engines return results based on a ranking system.

• Ranking is the order that files are listed when they are retrieved.

Page 8: Internet Research Search Engines & Subject Directories

The ranking system is secret• These systems are proprietary and

often “secret.” In general:• Altavista ranks web pages higher if

your search terms are found in the first few words of the page

• Google ranks by document “popularity” with other similar searches

• Direct Hit ranks by the length of time other users spent at the site

Page 9: Internet Research Search Engines & Subject Directories

Not even half the Web

• With all of this software and sophistication, even the best search engines cover only 40-50% of the Web.

• And they miss much else on the Internet.

Page 10: Internet Research Search Engines & Subject Directories

Bots hit and miss

Bots miss: XML pages, pdf files Dynamically created HTML pages Frames-based pages New pages or recent updated text Some say the Invisible Web is 500

times larger than Web

Page 11: Internet Research Search Engines & Subject Directories

Subject Directories

• A subject directory is also a database of web sites and references.

• But a subject directory is organized not by keywords but by category or subject.

Page 12: Internet Research Search Engines & Subject Directories

Yahoo!

• Yahoo! Is the most popular subject directory.

• www. about.com takes the idea a step further with subject guides for selected topics.

Page 13: Internet Research Search Engines & Subject Directories

Subjects are organized by people.

• Information is selected, organized and cataloged by a person, not software.

• You can usually be more assured that the search results will make sense.

Page 14: Internet Research Search Engines & Subject Directories

You get an index of sites.

• Subject directories will not often provide you with ranked web sites.

• Instead, you will get a broad index related to your topic, divided further by subheadings.

Page 15: Internet Research Search Engines & Subject Directories

Use for early searching.• Use a subject directory

early in your search process to learn about your subject.

• You will get fewer links of higher quality.

• When you get more specific questions, you should use a search engine.