1 deep web searching carl heine, ph.d. illinois mathematics and science academy
TRANSCRIPT
1
Deep Web SearchingDeep Web Searching
Carl Heine, Ph.D.Carl Heine, Ph.D.
Illinois Mathematics and Science AcademyIllinois Mathematics and Science Academy
2
Problem: Translating a question into a query
Problem: Choosing the best database
Problem: Finding better keywords
Problem: Recognizing information that’s relevant
Problem: Verifying the credibility of information
Illinois Technology Standards for Teachers: 2, 6, 7, 8
3
What digital natives don’t do well
What they tend to do instead
Turn a question into a queryRush ahead toward an answer, either grabbing the whole question “as is” or missing an important part of it
Select the ‘right’ database
Enter words or phrases into Google
Recognize information when they find it
Rush past important information and clues, continue to browse
Find better keywords Stick with their original words and browse
Verify the credibility of informationAccept what they find at face value, hoping somewhere in the information there is an answer
4
Problem: Selecting the right Database
This failure occurs before submitting the first query.
Task: Predict where expert information may be found.
No search engine performs a live Internet search when you submit a query.
That which is stored in one search engine’s database is invisible to another search engine (also called Enterprise Data)
5
Research Findings (IMSA second semester sophomores)
90% search with Google more than half of the time.
32% use Google exclusively.
Problem: Selecting the right Database
6
Modes of Searching
Table of Contents
Index
Flipping through the pages
Subject Directory
Search Engine
Browsing (surfing)
7
Subject
Directory
8
Search Engine
9
Browsing
10
Information that is stored in a different database than where you are searching now.
Deep Web, Invisible Web
Information that is “buried” deeper in the abstracts than results from another database.
11
Retrieval Speed e.g., Results 1 - 10 of about 111,000,000 (0.11 seconds)
HTTP Error 404
Evidence of Database Searching
12
Robotic programs known as Crawlers or Spiders…
…Retrieve information from web pages…
…That is compiled into a database…
…Called out by queries passed through a search engine…
…Displayed as abstracts (snippets)
Database Searching
13
Information retrieved (Google): Words in the <title> Characters in the URL Words in the body Links from this page to another page
Database Searching
Information available (Google): One or more Words in the <title> Characters in the URL One or more Words in the body (except stop words) Pages that link to this page
<html> <head> <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"> <meta name="keywords" content="Accessibility, Web accessibility, alternative browsers, alternative browsing, assistive technologies"> <meta name="description" content="Reference list of alternative browsers and assistive technologies for people with disabilities using the Web."> <title>Alternative Web Browsing</title> <link rel="stylesheet" type="text/css" href="http://www.w3.org/WAI/style/wairesource.css"> </head>
14
Database Searching
Information available (Google): One or more Words in the <title> Characters in the URL One or more Words in the body (except stop words) Pages that link to this page
15
Strategies for navigating the Deep Web
Making an educated prediction:
1. Where would an expert post the information?
2. Can Google google it?
16
Strategies for navigating the Deep Web
Where would an expert post the information? Who might know the answer to my question?
Where could I find that person?
17
Strategies for navigating the Deep Web
Where would an expert post the information?
An obvious choice? Keywords: <subject matter> + database Instead of database, try archive, repository, historical,
information, facts, almanac
18
Strategies for navigating the Deep WebWhat classical composer's music is heard
most often in movies?
An obvious choice?
Keywords: movie classical music database
OR information OR facts OR …?
19
Strategies for navigating the Deep Web
How many movies has John Williams, the classical guitarist, scored?
An obvious choice?
Keywords: ?? +database
OR information OR facts OR …?
20
Strategies for navigating the Deep Web
What airplane finished the Sport Qualifying race in second place at the Reno Air Races, September 11, 2002?
An obvious choice?
Keywords: ?? +database
OR information OR facts OR …?
21
Strategies for navigating the Deep Web
Can Google google it?
Types of information that Google is NOT LIKELY to access:
1. Information stored behind a SEARCH BOX
22
Strategies for navigating the Deep Web
Can Google google it?
Types of information that Google is NOT LIKELY to access:
2. Information stored behind a PASSWORD
Investors.com
23
Strategies for navigating the Deep Web
Can Google google it?
3. Other types of information that Google is NOT LIKELY to access:
24
Strategies for navigating the Deep Web
Page Not Found www.archive.org
25
How, When and Where can Deep Web How, When and Where can Deep Web searching skills be taught?searching skills be taught?
Performance skills require hands-on practice
Interactive tutorials (example: Can Google google it?) MicroModules (example: Invisible Web) Timed Search Challenges (example: Broadway shows)
When and Where can these skills be taught? In the context of research paper preparation Library and research orientation Computer lab and non-digital applications
26keywords: information fluency
On line Resources for Information FluencyOn line Resources for Information Fluency
27
Contact Us!Contact Us!URL http://21cif.imsa.edu
General info [email protected] Carl Heine [email protected] O’Connor [email protected]
Thank you for participating!