ncsu libraries endeca and faceted browsing: giving the user a useful catalog scott warren ncsu...

64
NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting June 7, 2007

Upload: lizbeth-cook

Post on 12-Jan-2016

222 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Endeca and faceted browsing: Giving the user a useful catalog

Scott WarrenNCSU Libraries

South Carolina Library Association Annual MeetingJune 7, 2007

Page 2: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Outline

1. Problem and Context

2. Online searching, shopping, and examples

3. Demo

4. Faceted Navigation

5. Implementation Challenges

6. Facet Usage Statistics

7. Reflections

Page 3: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

The Context

Page 4: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Online Catalogs

"Most integrated library systems, as they are currently configured and used, should be removed from public view.“

- Roy Tennant, CDL

Page 5: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

What is the problem?

• Existing catalogs are hard to use:

– known item searching works pretty well, but …

• users often do keyword searching and get large result sets returned in system sort order (last in, etc.)

• catalogs are unforgiving on spelling errors, stemming

• Authority searching completely mystifying

Page 6: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Catalog metadata is buried

• Subject headings are not leveraged in searching• they should be browsed or linked from, not

searched

• Data from the item record is not leveraged• should be able to filter by item type, location,

circulation status, popularity

Page 7: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Word of the Day for Saturday, May 5, 2007

• moil \MOYL\, intransitive verb:

• 1. To work with painful effort; to labor; to toil; to drudge.2. To churn or swirl about continuously.3. Toil; hard work; drudgery.4. Confusion; turmoil.

Page 8: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

What’s the big picture?

• Improve the quality of the library catalog user experience.

• Exploit our existing metadata infrastructure (make MARC work harder).

• Build a more flexible catalog tool that can be integrated with discovery tools of the future.

Page 9: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

What is Endeca?

• Software company based in Cambridge, MA

• Search/information access technology provider for a number of major e-commerce websites

• Developers of the Endeca Information Access Platform

Page 10: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Why Endeca?

• Customized relevance ranking of results

• Better subject access by leveraging available metadata through facets

• Improved response time

• Enhanced natural language searching through spell correction, etc.

• Browse

Page 11: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

A question

• “How is the new generation of library catalog being developed?”

• informed and enhanced by search technologies developed outside of the library

• based on how our users know how to search, not on how we want them to search

• What does search look like for our users?

Page 12: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Examples

Page 13: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Page 14: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Page 15: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Faceted Navigation on the Web

Page 16: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Page 17: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Facet

Value

Page 18: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Page 19: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Page 20: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Page 21: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Faceted Navigation in Libraries

Page 22: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Faceted Navigation in Libraries

Page 23: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Faceted Navigation in Libraries

Page 24: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Demonstration

Page 25: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Faceted Navigation

Page 26: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

What is Faceted Navigation?

Page 27: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

What is Faceted Navigation?

• Search and browse in a single interface

• Facets can vary in scope– What is the item about?– What kind of item is it?– Where is it?

• Enables users to narrow results

• Macroscopic behavior of results set

• Clues to being on the right path

Page 28: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Origins of Facets

• 1930s Ranganathan

• Colon Classification

Page 29: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Cartesian Coordinates

Page 30: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Coordinate System

LCSH

Format

Library

(x, y, z)(Library, LCSH, Format)

(Branch 1, History, Book)(Branch 2, History, DVD)

Multiple records could beassociated with eachcoordinate point.

Each point is associated with at least one record.

Branch 1

Branch 2

DVD

Book

History

Art

(Branch 1, History, Book)

Page 31: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Another way to think about it

• 11 dimensional lattice space

• All points associated with at least one item/record

• Records can be associated with > 1 point

• Keyword search selects subset of points with word(s) in record• Facets shown are those dimensions corresponding to the points in that set

(nonzero values).

• Choosing a facet value is equivalent to slicing through the multidimensional lattice on a plane along that facet value and reducing the lattice’s dimension by 1.

• Choose enough facets and you will get down to a few items (never a null set)

Page 32: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Implementation

Page 33: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Implementation Challenges

• Facet selection

• Interface design

• Data issues

Page 34: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Endeca at NCSU

• Endeca used to improve the discovery portion of the library catalog

• Endeca software indexes 1.6 million MARC records exported nightly from Sirsi Unicorn ILS

• Backend functions of ILS remain intact

Page 35: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Facets Implemented at NCSU

• Availability• Author• Library• Format• Language

• Browse: New

• LC Classification• Subject: Topic• Subject: Genre• Subject: Region• Subject: Era

Page 36: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Facet Selection

Page 37: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Interface Design

• Iterative approach using wireframes

• Eight major revisions in a four month period

• Still lots of room for improvement

Page 38: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Technical Overview

• Endeca co-exists with SirsiDynix Unicorn ILS and Web2 online catalog

• Endeca handles keyword search• Web2 handles authority search and detail page

display

• Endeca indexes MARC records exported nightly from Unicorn

• Endeca = discovery portion of the ILS

Page 39: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Technical Overview

Raw MARC data

NCSU exports and reformats

Flat text files

Data Foundr

y

Parse text files

Indices

MDEX Engine

NCSU Web Application

HTTP

HTTP

Information Access Platform

Page 40: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Technical Overview

Raw MARC data

NCSU exports and reformats

Flat text files

Data Foundr

yParse text

files Indices

MDEX Engine

NCSU Web Application

HTTP

HTTP

Offline - Nightly

Page 41: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Technical Overview

Raw MARC data

NCSU exports and reformats

Flat text files

Data Foundr

yParse text

files Indices

MDEX Engine

NCSU Web Application

HTTP

HTTP

Always Online

Page 42: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Implementation Team

• Seven member team– 5 IT staff,– 1 cataloging librarian,– 1 reference librarian

• Timeline– License / negotiation: Spring 2005– Software acquisition: Summer 2005– Implementation: Aug 2005 to Jan 2006

Page 43: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Data Issues

• ILS data with MARC-8 encoding => Text data with UTF-8 encoding

• Data consistency between ILS and Endeca catalog indexes (updates!)

• Data issues revealed by exposing metadata (ex: subject headings) in facets

Page 44: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Outcomes

Page 45: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Added search tools

• Automatic spell correction

• “Did you mean…” suggestions

– Automatic stemming– Bookmark-ability

Page 46: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

True browse

• Regain ability to browse catalog without entering any search terms

Page 47: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Search and Navigation

Search 67%Navigation 8%

Search -> Navigation 25%

July 06 – Jan 07

Page 48: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Requests by Search Type

Search 67%Navigation 8%

Search -> Navigation 25%

Requests by Search Type

Includes Navigation 33%

Search 67%

July 06 – Jan 07

Page 49: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Navigation by Dimensions

Subject: Topic26%

Availability2%

LC Classification21%

Format10%

New10%

Library10%

Subject: Genre6%

Subject: Era2% Language

3%

Subject: Region4%

Author6%

July 06 – Jan 07

Page 50: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Navigation by Dimension (most used)

0 20,000 40,000 60,000 80,000 100,000 120,000 140,000

Availability

Subject: Era

Language

Subject: Region

Author

Subject: Genre

Library

New

Format

LC Classification

Subject: Topic

Requests

July 06 – Jan 07

Page 51: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Navigation by Dimension (order of UI presentation)

32,650

16,009

12,257

22,818

54,476

57,667

34,096

145,589

120,644

9,286

0 20,000 40,000 60,000 80,000 100,000 120,000 140,000 160,000

Author

Language

Subject: Era

Subject: Region

Library

Format

Subject: Genre

Subject: Topic

LC Classification

Availability

Requests

July 06 – Jan 07

Page 52: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Dimension Value RequestsNew NEW 56,286

Format Book 16,188

LC Classification Q - Science 12,462

Library Textiles 11,160

Library D.H. Hill 11,060

Availability Available 9,276

Library Online Resources 8,164

LC Classification T – Technology 8,052

Subject: Topic History 7,915

Format Online 7,858

LC Classification P - Language and literature 7,005

LC Classification H - Social Sciences 6,953

Language English 6,854

Subject: Region United States 6,298

Format Journal, Magazine, or Serial 4,621

Page 53: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Usability testing

• 10 undergraduate students– 5 with new Endeca-based interface– 5 with old catalog interface– Identical searching tasks

• Data collected– Task difficulty/failure– Task duration

Page 54: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Usability testing

Task Difficulty: Old Catalog

Easy43%

Medium12%

Hard22%

Failed23%

Task Difficulty: New Catalog

Easy59%

Medium12%

Hard7%

Failed22%

Page 55: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Usability testing

Average Task Duration:Old vs New Catalog

00:00.0 00:43.2 01:26.4 02:09.6 02:52.8 03:36.0

Task 1

Task 2

Task 3

Task 4

Task 5

Task 6

Task 7

Task 8

Task 9

Task 10

Old Catalog

New Catalog

Page 56: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Usability testing

• For students, relevance ranking is key.– July 06 – Jan 07: ~19% continued to page 2

• Faceted navigation is intuitive, even for students who don’t use it.

• Beware of library jargon– “keyword anywhere”, “keyword in subject”

• User behavior is influenced by previous experience.

Page 57: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Reflections

• Faceted navigation enables new ways to discovery resources

• Library collections often contain rich descriptive metadata… exploit this!

• We have much to learn about how to optimize these interfaces for the user

• Great for collection analysis

Page 58: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Analyzing collections

Textiles books divided by LCSHcarpets

embroidery

knitting

dressmaking

data processing

textile printing

chemistry

nonw oven fabrics

fashion

costume

patterns

plastics

hand w eaving

cottom

clothing and dress

yarn

w eaving

textile machinery

quality control

textile f ibers, synthetic

cotton manufacture

management

clothing trade

textile f ibers

testing

dyes and dyeing

history

polymers

textile fabrics

textile industry

Page 59: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Conclusions

Page 60: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Features Not Supported

• Work level aggregations / roll-up

• Customization / personalization

• Folksonomies / user contributed content

• Recommender functionality

• Shopping cart functionality

Page 61: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

QuickSearch

Page 62: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Future directions

• Experiment with FRBR search/display through partnership with OCLC.

• Integrate catalog w/other tools through web services:– OpenSearch, RSS

• Enrich catalog through external web services:– book jackets, reviews, etc. – Amazon/OCLC

• Build modular shopping cart functionality.

• Use Endeca to index local collections.

Page 63: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Big Issues

• Benchmarking– Just how much better is it? For whom? When is it not

better?

• Natural Language– Revolutionary War problem

• Experimenting – What is the optimal interface?– Power Search?

Page 64: NCSU Libraries Endeca and faceted browsing: Giving the user a useful catalog Scott Warren NCSU Libraries South Carolina Library Association Annual Meeting

NCSU Libraries

Big Wins

• Relevance ranking

• Speed / performance

• Locally managed presentation interface

• Persistent parameter based entry points

• Proving it could be done