susan wynne gugm 16 may 2013

38
Susan Wynne GUGM 16 May 2013 Streamlining Quality Control of Shelf-Ready Records with checkmarc

Upload: kobe

Post on 23-Feb-2016

54 views

Category:

Documents


0 download

DESCRIPTION

Streamlining Quality Control of Shelf-Ready Records with checkmarc. Susan Wynne GUGM 16 May 2013. checkmarc quick facts. Developer: Chad Nelson Inspired by University of Florida’s CatQC program Programming language: Python Runs on a MARC file - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Susan Wynne GUGM  16 May 2013

Susan WynneGUGM 16 May 2013

Streamlining Quality Control of Shelf-Ready Records with checkmarc

Page 2: Susan Wynne GUGM  16 May 2013

checkmarc quick facts• Developer: Chad Nelson• Inspired by University of Florida’s

CatQC program• Programming language: Python• Runs on a MARC file• Current version has predefined

“checks” of fixed and variable field data that may signal errors/problems

Page 3: Susan Wynne GUGM  16 May 2013

Audience poll• Who’s using shelf-ready services?–What’s your approach to quality control?

• Who’s considering shelf-ready services?–What are your concerns about setting up

shelf-ready services or workflows?

Page 4: Susan Wynne GUGM  16 May 2013

About Georgia StateFall 2012 Enrollment

~32,000

University Library Holdings

~1.5 million volumes

Photo credits: Georgia State University Media Relations

Page 5: Susan Wynne GUGM  16 May 2013
Page 6: Susan Wynne GUGM  16 May 2013
Page 7: Susan Wynne GUGM  16 May 2013

Current cataloging staffParaprofessional Catalogers• Copy cataloging• Original cataloging• Authority work/database

maintenance• Physical processing• And more….

2

MLIS Cataloger• Original cataloging• Training• Documentation• Metadata consultant• And more…

1

Page 8: Susan Wynne GUGM  16 May 2013

Some cataloging statsFY13 through March 31

Average Titles/Month* 996Average Items/Month* 1009Average # of Shelf-Ready Records/Month

833

% Titles Received Shelf-Ready**

83.6%

% Items Received Shelf-Ready**

82.5%*Newly cataloged print monographs only**Some shelf-ready materials need attention in-house

Page 9: Susan Wynne GUGM  16 May 2013

Outsourcing helps us to…• Avoid a backlog of purchased

materials• Maintain a cleaner database

(especially access points)• Catalog more unique or complex

resources, especially…– Special Collections materials– Scores, DVDs, other special formats

• Upgrade minimal master records in WorldCat

• Have more time to learn RDA

Page 10: Susan Wynne GUGM  16 May 2013

Shelf-ready workflow (before)

• Began in January 2010• YBP Approval plan and slip orders only– Books– No special formats– Any encoding level or cataloging source

• “Exceptions” identified by vendor or during unpacking cataloging staff

• Non-exceptions stacks– Cataloging staff hold vendor slips for manual

checking

Page 11: Susan Wynne GUGM  16 May 2013

“Shelf-ready exceptions”• No spine label• Volume numbers readily apparent• Accompanying material• Call numbers beginning with Z

Page 12: Susan Wynne GUGM  16 May 2013

Quality control (before)• Exceptions workflow• Vendor slips• Manual

Page 13: Susan Wynne GUGM  16 May 2013
Page 14: Susan Wynne GUGM  16 May 2013

Common errors found manually• empty/incomplete 300

• partial call numbers• 2 050 fields• mismatch between bib & MFHD call

number• serials cataloged as monographs

Page 15: Susan Wynne GUGM  16 May 2013

Common errors found manually

• 490 1st indicator 0• discrepancies between dates in 26X,

call no., fixed field• CIP records• volume sets not identified

Page 16: Susan Wynne GUGM  16 May 2013

What to do?• Abandon quality control?– Still finding errors that affect access

• Maintain status quo?– Too much time spent for the number of

errors found– Defeats the purpose of outsourcing

Page 17: Susan Wynne GUGM  16 May 2013

What to do?• Use WorldCat Cataloging Partners

reports for quality control?– Identify encoding levels, record source,

missing labels

Page 18: Susan Wynne GUGM  16 May 2013
Page 19: Susan Wynne GUGM  16 May 2013

What to do?• Find a way to automate?– University of Florida’s CatQC program– Another institution’s Perl scripts for

cross-matching bib data– Terry Reese’s MarcEdit

Develop a new program?

Page 20: Susan Wynne GUGM  16 May 2013

checkmarc criteria• Type of record not “a”– Scores, cartographic material, etc.

• BLvl not “m”– Serial records

• Encoding levels 2, 3, 5, 7, 8, E, J, K, or M – Alerts us to minimal, prepub, and less-

than-full• 040 ‡b other than “eng”– Non-English language of cataloging

Page 21: Susan Wynne GUGM  16 May 2013

checkmarc criteria• Presence of ‡n or ‡p in 245– Clue to a part of a multi-volume work

• Numerals in 245 ‡a or ‡b – Clue to possible serial, but leads to

some false positives• Keywords in 245: “annual” or

“biennial”– Clue to possible serial

Page 22: Susan Wynne GUGM  16 May 2013

checkmarc criteria• 300 ‡a begins with p. or v. – Clue to incomplete 300 and some multi-volume

records, but leads to some false positives• Presence of 300 ‡e – Accompanying material

• 490 with 1st indicator 0– Untraced series

• Records lacking at least one 6XX with 2nd indicator 0– No LC subject headings present

Page 23: Susan Wynne GUGM  16 May 2013

Testing# Records

% of File

% of checkmarc Results

Flagged by checkmarc 52 39.4% 100%

Already fixed (probably SREs)

24 18.2% 46.2%

Required manual attention

14 10.6% 26.9%

False positives 8 6.1% 15.4%

Flagged as Elvl 8 but OK

6 4.5% 11.5%

Page 24: Susan Wynne GUGM  16 May 2013

Errors missed by checkmarcError # Notes050 with two ‡a

1 Not in current programmingOften a signal of call no. mismatch

Multiple 050 fields

1 Flagged by checkmarc for another criteriaOften a signal of call no. mismatch

490 0 1 Found by checkmarc, but title didn’t display in report (bug reported)

Typo in publisher name

1

Diacritic errors in contents note

1

Record with multiple issues

1 Misplaced ‡c in 245Inadequate subject headings

Page 25: Susan Wynne GUGM  16 May 2013

Demo• Run a MARC file• Display and discuss report– False positives– Other items to consider skipping

• Correct errors manually

Page 26: Susan Wynne GUGM  16 May 2013
Page 27: Susan Wynne GUGM  16 May 2013
Page 28: Susan Wynne GUGM  16 May 2013

Shelf-ready workflow (after)• Began in January 2010• YBP Approval plan and slip orders

only• “Exceptions” identified by vendor or

during unpacking cataloging staff

• Non-exceptions stacks– Cataloging staff hold vendor slips for

manual checking only to identify the MARC file

Page 29: Susan Wynne GUGM  16 May 2013

Quality control (after)• Exceptions workflow• Cataloging staff run checkmarc

on the MARC file – Scan reports– Check selected records in Voyager– Fix errors in Voyager as needed–~1 hour on a typical file (not

including time to retrieve any items from stacks)

Page 30: Susan Wynne GUGM  16 May 2013

What checkmarc can’t find/do

• Mismatches between the call no. in MFHD and the spine label

• Information not matching the item in hand• Typos, misspellings, etc.• Incorrectly formed subject headings• Errors in application of cataloging rules• Missing fields or information other than the

criteria described earlier• Fix the problems directly in the Voyager

database

Page 31: Susan Wynne GUGM  16 May 2013

Desired enhancements• Change 300 ‡a begins with p or v to

300 ‡a begins with p or contains v or volume?– E.g, 300 ‡a 2 v.

• Presence of multiple ‡a in one 050 field

• Presence of multiple 050 fields– And both 050 + 090 in the same record

Page 32: Susan Wynne GUGM  16 May 2013
Page 33: Susan Wynne GUGM  16 May 2013

Current explorations/future directions

• Start running checkmarc 1.0 on e-book approval files– E-book approvals began January 2013• Part of “mainstreaming” e-workflows

• Consider possible changes to criteria for RDA data

• Customizable version under development– Select your own criteria to look for– No timetable for availability

Page 34: Susan Wynne GUGM  16 May 2013

How can I use checkmarc?• GSU has no plans to host on our servers

for public use• Options…– http://serene-scrubland-6954.herokuapp.com/• Disclaimer

– Set up on your machine– Have your system administrator deploy it on

a local server– Find another hosting option for public use?– Let’s talk!

Page 35: Susan Wynne GUGM  16 May 2013

Selected readingJay, M., Simpson, B., & Smith, D. (2009). CatQC and Shelf-Ready Material: Speeding Collections to Users While Preserving Data Quality. Information Technology & Libraries, 28(1), 41-48.

Vinh-The, L. (2005). Quality Control Issues in Outsourcing Cataloging in United States and Canadian Academic Libraries.Cataloging & Classification Quarterly, 40(1), 101-122.

Walker, M., & Kulczak, D. (2007). Shelf-ready books using PromptCat and YBP: Issues to consider (An analysis of errors at the University of Arkansas). Library Collections, Acquisitions, & Technical Services, 31(2), 61-84.

Page 36: Susan Wynne GUGM  16 May 2013

Acknowledgements• Chad Nelson, former GSU programmer

– creator of checkmarc• Hannah Grant & Christine Cooney, cataloging staff

– collaborators on checkmarc criteria– Christine is the primary current user of checkmarc

• Cliff Landis, Web Services Librarian• Eric Willoughby, Web Programmer• Bill Walsh, Dept. Head, Technical Services• Michael Jay, Betsy Simpson, & Doug Smith,

Creators of University of Florida’s CatQC program• Lai-Ying Hsiung told me about CatQC and shared

some info on shelf-ready workflows

Page 37: Susan Wynne GUGM  16 May 2013

Questions/Discussion

Page 38: Susan Wynne GUGM  16 May 2013

Contact me

Susan WynneCataloging & Metadata

[email protected]

404-413-2729http://gsu.academia.edu/SusanWynne