incorporating metadata into search user interfaces

70
Incorporating Metadata into Search User Interfaces Marti Hearst UC Berkeley UCB Digital Libraries Seminar Oct 10, 2000

Upload: storm

Post on 04-Feb-2016

55 views

Category:

Documents


0 download

DESCRIPTION

Incorporating Metadata into Search User Interfaces. Marti Hearst UC Berkeley. UCB Digital Libraries Seminar Oct 10, 2000. Web Search is Working!. Survey finds high user satisfaction Study by npd group. Web Search is Working!. Survey finds high user satisfaction - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Incorporating Metadata into Search User Interfaces

Incorporating Metadata into Search User Interfaces

Marti HearstUC Berkeley

UCB Digital Libraries Seminar Oct 10, 2000

Page 2: Incorporating Metadata into Search User Interfaces

From http://searchenginewatch.internet.com/reports/npd.html

Web Search is Working!Survey finds high user satisfaction

Study by npd group

Page 3: Incorporating Metadata into Search User Interfaces

From http://searchenginewatch.internet.com/reports/npd.html

Web Search is Working!Survey finds high user satisfaction

(a recent upswing – the decline was caused by an increase in # of pages indexed)

Page 4: Incorporating Metadata into Search User Interfaces

From http://searchenginewatch.internet.com/reports/npd.html

Web Search is Working!Why? Queries are still short!

Average query length currently ~2.4 words (Doug Cook, Inktomi)

Page 5: Incorporating Metadata into Search User Interfaces

My guess:

Web Search is Successful at Finding Good Starting Points (home pages)

Page 6: Incorporating Metadata into Search User Interfaces

Evidence

Web search engines are heavily using Link analysis Page popularity Interwoven categories

These all find dominant home pages

Page 7: Incorporating Metadata into Search User Interfaces
Page 8: Incorporating Metadata into Search User Interfaces
Page 9: Incorporating Metadata into Search User Interfaces

Consequences

Web search engines are providing source selection! A side note:

A digital library issue as well. DL’s make people do this step explicitly. People don’t generally like this!

What happens at the site? Follow hyperlinks or use site search

Page 10: Incorporating Metadata into Search User Interfaces

Following Hyperlinks

Works great when it is clear where to go next

Frustrating when the desired directions are undiscernable or unavailable

Page 11: Incorporating Metadata into Search User Interfaces

Site Search

This is not getting good reviews Large, disorganized results sets

Page 12: Incorporating Metadata into Search User Interfaces

An Analogy

text searchhypertext

Page 13: Incorporating Metadata into Search User Interfaces

Analogy

Hypertext: A fixed number of choices of where to go next; A glance at the map tells you where you are; But may not go where you want to go.

To get from Topeka to Santa Fe, may have to go through Frostbite Falls

Site Search: Can go anywhere; But may get stuck, disoriented, in a crevass!

Page 14: Incorporating Metadata into Search User Interfaces

Goal: An All-Tertrain Vehicle

The best of both techniques A vehicle that magically lays down track to

suggest choices of where you want to go next based on what you’ve done so far and what you are trying to do

The tracks follow the lay of the land and go everywhere, but cross over the crevasses

The tracks allow you to back up easily

Page 15: Incorporating Metadata into Search User Interfaces

How to make an all-tertrain vehicle?

Two ideas:Focus on the task.Use metadata explicitly.

Page 16: Incorporating Metadata into Search User Interfaces

The Importance of the Task Results from HCI suggest the importance

of taking the task into account.

Searching patent databases vs. Proving non-infringement Browsing newsgroups vs. Finding the denial-of-service hacker Getting all satellite news vs. Anticipating the competition

Page 17: Incorporating Metadata into Search User Interfaces

The Importance of the Task: Indirect Evidence

How does Web page download time effect usability? In one study, Spool found:

(56kbit modem) Amazon: 36 sec/page (avg) About.com: 8 sec/page (avg)

Users rated the sites: Fastest: Amazon Slowest: About.com

Why?

Page 18: Incorporating Metadata into Search User Interfaces

The Importance of the Task

Perceived speed Strong correlation between perceived speed

and whether the users felt they completed their task

Strong correlation between perceived speed and whether the users felt they always knew what to do next (scent).

Page 19: Incorporating Metadata into Search User Interfaces

Metadata

Page 20: Incorporating Metadata into Search User Interfaces

Metadata types

Time/Date Topic RoleGeoRegion

Page 21: Incorporating Metadata into Search User Interfaces

Content-based Metadata

Medical text Anatomy, Disease, Chemicals, Procedures…

Architectural images Location, Style, Materials, Period …

Recipes! Cuisine, Ingredients, Season, Calories …

Example: SOAR vs. epicurious

Page 22: Incorporating Metadata into Search User Interfaces

soar.berkeley.edu/recipes

Page 23: Incorporating Metadata into Search User Interfaces

soar.berkeley.edu/recipes

Page 24: Incorporating Metadata into Search User Interfaces

soar.berkeley.edu/recipes

Page 25: Incorporating Metadata into Search User Interfaces

soar.berkeley.edu/recipes

Page 26: Incorporating Metadata into Search User Interfaces

www.epicurious.com

Page 27: Incorporating Metadata into Search User Interfaces

www.epicurious.com

Page 28: Incorporating Metadata into Search User Interfaces

www.epicurious.com

Page 29: Incorporating Metadata into Search User Interfaces

www.epicurious.com

Page 30: Incorporating Metadata into Search User Interfaces

Epicurious Metadata Usage

Advantages Creates combinations of metadata on the fly Different metadata choices show the same information in

different ways Previews show how many recipes will result Easy to back up Supports several task types

``Help me find a summer pasta,'' (ingredient type with event type), ``How can I use an avocado in a salad?'' (ingredient type with dish type), ``How can I bake sea-bass'' (preparation type and ingredient type)

Page 31: Incorporating Metadata into Search User Interfaces

Epicurious Metadata UsageProblem: lacks integration with search

Page 32: Incorporating Metadata into Search User Interfaces

What about Yahoo?

Routes through the metadata are Predefined Unstable (due to symbolic links) Long (due to bad mixing of metadata) Example: Where is Berkeley?

College and University > Colleges and Universities >United States > U > University of California > Campuses > Berkeley

U.S. States > California > Cities >Berkeley > Education > College and University > Public > UC Berkeley

Page 33: Incorporating Metadata into Search User Interfaces

Yahoo using metadata well

Yahoo restaurant guide combines: Region Topic (restaurants) Related Information

Other attributes (cuisines) Other topics related in place and time (movies)

Page 34: Incorporating Metadata into Search User Interfaces

Green: restaurants & attributes

Red: related in place & time

Yellow: geographic region

Page 35: Incorporating Metadata into Search User Interfaces

Combining Information Types

Region State

City

A & E Film Theatre Music Restaurants

California Eclectic Indian French Assumed task: looking for

evening entertainment

Page 36: Incorporating Metadata into Search User Interfaces

Other Possible Combinations

Region + A&E City + Restaurant + Movies City + Weather City + Education: Schools Restaurants + Schools …

Page 37: Incorporating Metadata into Search User Interfaces

Bookstore preview combinations topic + related topics topic + publications by same author topic + books of same type but related topic

Page 38: Incorporating Metadata into Search User Interfaces

Problems with Metadata Usage

Standard approaches Paths are hand-edited, predefined Not well-integrated with search Not tailored to task as it develops Not personalized Not dynamic

Page 39: Incorporating Metadata into Search User Interfaces

A new project: FLAMENCOFLexible Access using MEtadata in Novel COmbinations Main ideas:

Make metadata an explicit part of the interface, but in a highly-usable manner

Preview and postview choices Determine views dynamically and (semi)

automatically, using a task-based model

Page 40: Incorporating Metadata into Search User Interfaces

Flamenco: Dynamic Previews

Medical example Allow user to select metadata in any order At each step, show different types of relevant

metadata, based on prior steps and personal history, include # of documents

Previews restricted to only those metadata types that might be helpful

Page 41: Incorporating Metadata into Search User Interfaces

Asthma > Steroids

1. A steroid-induced acute psychosis in a child with athsma.2. Management of steroid-dependent asthma with methotrexate.

Steroids•Pregnanes• Pregnadienes (5)• Prednisone (5)• Pregnenes• Budesonide (4)• Corticosterone (3)

Other Views• Admin & Dosage (50)• Drug Effects (20• Therapeutic Use (25)• Risk Factors (4)• More …

User Preferred• Musculoskeletal (4)•Drug Resistance (6)

•All Categories (99)

99 Documents: [Sort by author] [Sort by popularity] [Sort by Steroids] [Cluster] 1. Effect of short-course budesonide on the bone turnover of asthmatic children.2. Effect of prednisone on response to influenza virus vaccine in asthmatic children.…

Page 42: Incorporating Metadata into Search User Interfaces

Asthma > Steroids

1. A steroid-induced acute psychosis in a child with athsma.2. Management of steroid-dependent asthma with methotrexate.

Steroids•Pregnanes• Pregnadienes (5)• Prednisone (5)• Pregnenes• Budesonide (4)• Corticosterone (3)

Other Views• Admin & Dosage (50)• Drug Effects (20• Therapeutic Use (25)• Risk Factors (4)• More …

User Preferred• Musculoskeletal (4)•Drug Resistance (6)

•All Categories (99)

99 Documents: [Sort by author] [Sort by popularity] [Sort by Steroids] [Cluster] 1. Effect of short-course budesonide on the bone turnover of asthmatic children.2. Effect of prednisone on response to influenza virus vaccine in asthmatic children.…

Page 43: Incorporating Metadata into Search User Interfaces

Asthma > Steroids

1. A steroid-induced acute psychosis in a child with athsma.2. Management of steroid-dependent asthma with methotrexate.

Steroids•Pregnanes Pregnadienes (5) Prednisone (5)• Pregnenes Budesonide (4) Corticosterone (3)

Other Views• Admin & Dosage (50)• Drug Effects (20• Therapeutic Use (25)• Risk Factors (4)• More …

User Preferred• Musculoskeletal (4)•Drug Resistance (6)

•All Categories (99)

99 Documents: [Sort by author] [Sort by popularity] [Sort by Steroids] [Cluster] 1. Effect of short-course budesonide on the bone turnover of asthmatic children.2. Effect of prednisone on response to influenza virus vaccine in asthmatic children.…

Page 44: Incorporating Metadata into Search User Interfaces

Asthma > Steroids > Admin & Dosage

1. Dosage levels for asthmatic steroids: A survey.

Steroids•Pregnanes Pregnadienes (3) Prednisone (5)

Related Categories•Inhalators (40)•Emotional Effects (25)•Preferred Suppliers (30)

User Preferred• Musculoskeletal (0)•Drug Resistance (2)

•All Categories (50)

50 Documents: [Sort by author] [Sort by popularity] [Sort by Dosage] [Cluster]

1. Optimal dosage levels for prednisone in the treatment of childhood asthma.2. …

Page 45: Incorporating Metadata into Search User Interfaces

Other paths: back up and go forward

Asthma > Steroids > Budesonide > Huang

Asthma > Huang > Budesonide

Asthma > Steroids

Asthma > Steroids > Budesonide

Page 46: Incorporating Metadata into Search User Interfaces

Dynamic Metadata Previews

How different from Yahoo & Amazon? Dynamically determine what to show next

Yahoo’s combos are predefined Amazon’s are also predefined, and limited to taste

and general topic only A way to seamlessly integrate

Related topics User preferences (personalization) Context-sensitivity

Page 47: Incorporating Metadata into Search User Interfaces

Evaluation Methodology

Regression Test Select a set of tasks

Use these throughout the evaluation Start with a baseline system

Evaluate using the test tasks Add a feature

Evaluation again Compare to baseline Only retain those changes that improve results

Page 48: Incorporating Metadata into Search User Interfaces

Image Search

Content analysis is making strides Rich hand-assigned metadata is available But most search based on

Keyword matching (alltheweb/lycos multimedia) Image-component based querying (QBIC) Overall similarity to sample image (Blobworld) Combo of keyword and image component

Page 49: Incorporating Metadata into Search User Interfaces

Image Search: What is the task?

Illustrate my slides? “Find a crevasse” Keyword match works pretty well

Find inspiration for an architectural design? General similarity: maybe But more control might be better

Page 50: Incorporating Metadata into Search User Interfaces
Page 51: Incorporating Metadata into Search User Interfaces

Blo

bwor

ld

Page 52: Incorporating Metadata into Search User Interfaces
Page 53: Incorporating Metadata into Search User Interfaces

Slide by Ame Elliott

Architects’ Image Use Work practices

Observations from personal design experience, and surveys of designers

Common activities for image use Browsing most common at early stages of design Collage making, sketching, pinning up on walls

Cultural and social practice Designers learn how to do this in schools Ways of communicating with images varies with

organization

Page 54: Incorporating Metadata into Search User Interfaces

Slide by Ame Elliott

Current Problems Browsing Images On-Line Current on-line image collections offer few

advantages over paper collections Lose papers’ ease of manipulation Little gain in accessibility

Queries are textual and must be well-formed Not appropriate for the early phases of design when

image browsing is critical Image search engines don’t follow good UI design

in general Poor support for search starting points, collection

visualization

Page 55: Incorporating Metadata into Search User Interfaces

Slide by Ame Elliott

Creating Browsing Scenarios Rationale

Learned about search strategies and how architects look for images now

3 design scenarios presented to 2 architects “Add handicapped ramp to entrance of suburban

home” “Design addition to children’s home in a Victorian

mansion” “Design a maritime cultural center on the beach in

San Diego”

Page 56: Incorporating Metadata into Search User Interfaces

Slide by Ame Elliott

Results of Interviews

Were they believable professional problems? Yes

How would they browse for images to help with these tasks? Gathered lists of terms Learned some ideas about strategies High degree of consistency between interviewees

Page 57: Incorporating Metadata into Search User Interfaces

How different from medical example?

More open-ended Easier to scan many images quickly Tertrain metaphor not used here

Not narrowing down a large set Rather, always viewing more images A mechanism for “steering” through the

metadata

Page 58: Incorporating Metadata into Search User Interfaces

SPIRO:>40,000 art & architecture imagesDetailed metadata

Page 59: Incorporating Metadata into Search User Interfaces

SPIRO Query Form

Page 60: Incorporating Metadata into Search User Interfaces

SP

IRO

que

ry o

n S

ubje

ct: c

hurc

h

Page 61: Incorporating Metadata into Search User Interfaces

A Better Example

Greatbuildings.com Hyperlinks metadata together But a small collection

~1000 buildings ~4500 images total

Page 62: Incorporating Metadata into Search User Interfaces

www.greatbuildings.com

Page 63: Incorporating Metadata into Search User Interfaces

www.greatbuildings.com

Page 64: Incorporating Metadata into Search User Interfaces

www.greatbuildings.com

Page 65: Incorporating Metadata into Search User Interfaces

www.greatbuildings.com

Page 66: Incorporating Metadata into Search User Interfaces

www.greatbuildings.com

Page 67: Incorporating Metadata into Search User Interfaces

www.greatbuildings.com

Page 68: Incorporating Metadata into Search User Interfaces

www.greatbuildings.com

Page 69: Incorporating Metadata into Search User Interfaces

The Approach Create an architecture that allows

experimentation with different approaches Add functionality in a stepwise fashion Architecture task:

Emphasize images over text Use greatbuildings-style interface as a reasonable

baseline for comparison Find out how much choice is too much Find out whether explicit metadata is better than

implicit more-like-this

Page 70: Incorporating Metadata into Search User Interfaces

Summary

Standard search is too flexible Hyperlinks too restrictive Flamenco:

Task-centric search interfaces Integrate metadata with search Dynamic previews Easily retrace steps Systematically determine what works for real users