metadata quality assurance : the university of north texas libraries experience daniel gelaw alemneh...

24
Metadata Quality Assurance: The University of North Texas Libraries Experience Daniel Gelaw Alemneh & Hannah Tarver 3rd annual Texas Conference on Digital Libraries (TCDL) May 27-28, 2009

Upload: ava-bolton

Post on 27-Mar-2015

213 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Metadata Quality Assurance : The University of North Texas Libraries Experience Daniel Gelaw Alemneh & Hannah Tarver 3rd annual Texas Conference on Digital

Metadata Quality Assurance: The University of North Texas Libraries

Experience

Daniel Gelaw Alemneh & Hannah Tarver 3rd annual Texas Conference on

Digital Libraries (TCDL) May 27-28, 2009

Page 2: Metadata Quality Assurance : The University of North Texas Libraries Experience Daniel Gelaw Alemneh & Hannah Tarver 3rd annual Texas Conference on Digital

Information Retrieval.

Match

DocumentDocument Representation Query

Information Need

Bates, M. J. (1989). The design of browsing and berrypicking techniques for the online search interface. Online Review, 13(5), 407-424.

Page 3: Metadata Quality Assurance : The University of North Texas Libraries Experience Daniel Gelaw Alemneh & Hannah Tarver 3rd annual Texas Conference on Digital

Trends

Information creation, organization, retrieval, use, and preservation is becoming more complex

User as creator, annotator, indexer, searcher, and eventual user of his/her content

Visualization of the information space instead of a ranked list of search results

Page 4: Metadata Quality Assurance : The University of North Texas Libraries Experience Daniel Gelaw Alemneh & Hannah Tarver 3rd annual Texas Conference on Digital

Total Sites Across All Domains August 1995-April 2009

232000000

162400000

92800000

0

HostnamesActive

Page 5: Metadata Quality Assurance : The University of North Texas Libraries Experience Daniel Gelaw Alemneh & Hannah Tarver 3rd annual Texas Conference on Digital

Digital Projects• UNT Digital Collections

• Portal to Texas History• 100 + Collaborators

• Congressional Research Service Archives

• Other Statewide and National Projects

Page 6: Metadata Quality Assurance : The University of North Texas Libraries Experience Daniel Gelaw Alemneh & Hannah Tarver 3rd annual Texas Conference on Digital

Factors Influencing Metadata Quality

Local Requirements:

• Objects• Granularity • Functionality

Collaborative Requirements:

• Diversity of Users• Interoperability• Digital Rights Issues

Page 7: Metadata Quality Assurance : The University of North Texas Libraries Experience Daniel Gelaw Alemneh & Hannah Tarver 3rd annual Texas Conference on Digital

Ambiguities

Poor recall

Poor precision

Inconsistency of search results

Poor Metadata Quality

Page 8: Metadata Quality Assurance : The University of North Texas Libraries Experience Daniel Gelaw Alemneh & Hannah Tarver 3rd annual Texas Conference on Digital

Common Errors

The data is:

• Incorrect

• Missing

• Ambiguous

Page 9: Metadata Quality Assurance : The University of North Texas Libraries Experience Daniel Gelaw Alemneh & Hannah Tarver 3rd annual Texas Conference on Digital

Metadata Quality Assurance Mechanisms & Tools at UNT

Pre-Ingest Post-Ingest

Training

Creation Tools

Proofing & Editing Tools

Analysis Tools

Page 10: Metadata Quality Assurance : The University of North Texas Libraries Experience Daniel Gelaw Alemneh & Hannah Tarver 3rd annual Texas Conference on Digital

Training

• Face-to-Face Instruction

• Metadata Schema & Documentation

• Internal Project Wikis

• Staff Support

Page 11: Metadata Quality Assurance : The University of North Texas Libraries Experience Daniel Gelaw Alemneh & Hannah Tarver 3rd annual Texas Conference on Digital

Metadata Creation Template

Page 12: Metadata Quality Assurance : The University of North Texas Libraries Experience Daniel Gelaw Alemneh & Hannah Tarver 3rd annual Texas Conference on Digital
Page 13: Metadata Quality Assurance : The University of North Texas Libraries Experience Daniel Gelaw Alemneh & Hannah Tarver 3rd annual Texas Conference on Digital

Template Reader

Page 14: Metadata Quality Assurance : The University of North Texas Libraries Experience Daniel Gelaw Alemneh & Hannah Tarver 3rd annual Texas Conference on Digital

jEdit Text

Editor

Page 15: Metadata Quality Assurance : The University of North Texas Libraries Experience Daniel Gelaw Alemneh & Hannah Tarver 3rd annual Texas Conference on Digital

UNT Metadata Analysis Tools:

Post-ingest

Page 16: Metadata Quality Assurance : The University of North Texas Libraries Experience Daniel Gelaw Alemneh & Hannah Tarver 3rd annual Texas Conference on Digital

Enhanced by Highlighter – On/Off

Enhanced by Qualifier – Use/Ignore

Page 17: Metadata Quality Assurance : The University of North Texas Libraries Experience Daniel Gelaw Alemneh & Hannah Tarver 3rd annual Texas Conference on Digital

Null Value Analysis Tools

Page 18: Metadata Quality Assurance : The University of North Texas Libraries Experience Daniel Gelaw Alemneh & Hannah Tarver 3rd annual Texas Conference on Digital

Controlled Vocabularies (UNTL-BS)

Page 19: Metadata Quality Assurance : The University of North Texas Libraries Experience Daniel Gelaw Alemneh & Hannah Tarver 3rd annual Texas Conference on Digital

Better Metadata

More Functionality

Page 20: Metadata Quality Assurance : The University of North Texas Libraries Experience Daniel Gelaw Alemneh & Hannah Tarver 3rd annual Texas Conference on Digital

Better Metadata

More Functionality

Page 21: Metadata Quality Assurance : The University of North Texas Libraries Experience Daniel Gelaw Alemneh & Hannah Tarver 3rd annual Texas Conference on Digital

Summary

• Determine level of quality required

• Determine nature of gap and how to close it

• Machine verses human error handling

• Compromise

• Prioritize

• Test the workflow

Page 22: Metadata Quality Assurance : The University of North Texas Libraries Experience Daniel Gelaw Alemneh & Hannah Tarver 3rd annual Texas Conference on Digital

UNTLMetadata

UNTLMetadata

Generation

User

User

SystemPrecisionRecall

BrowsingSearching

Data Entry

Evaluation

Understanding

Change

Measure Quality and Usefulness of UNT Metadata

Page 23: Metadata Quality Assurance : The University of North Texas Libraries Experience Daniel Gelaw Alemneh & Hannah Tarver 3rd annual Texas Conference on Digital

References & Web Sites Consulted

• Bates, M. J. (1989). The design of browsing and berrypicking techniques for the online search interface. Online Review, 13(5), 407-424.

• Netcraft (2009). April 2009 Web Server Survey. Retrieved May 19th, 2009 from http://news.netcraft.com/archives/web_server_survey.html

• OCLC (2007). Sharing, privacy and Trust in our Networked World. Retrieved May 19th, 2009 from http://www.oclc.org/reports/pdfs/sharing.pdf

• TechSmith, Co. (2008). “UX 2.0: Any User, Any Time, Any Channel.” Retrieved May 19th, 2009 from: http://download.techsmith.com/morae/docs/UserExperience2_0.pdf

• UNT Libraries Metadata Initiative page. Retrieved May 19th, 2009 from: http://www.library.unt.edu/digitalprojects/metadata

Page 24: Metadata Quality Assurance : The University of North Texas Libraries Experience Daniel Gelaw Alemneh & Hannah Tarver 3rd annual Texas Conference on Digital

Questions?