ebook cataloging: trouble, even in batch
DESCRIPTION
Kathryn Lybarger SLA Kentucky Chapter Program and Business Meeting November 2, 2012. Ebook cataloging: trouble, even in batch. MARC. A data format used to encode and share bibliographic data Developed in the 1960’s, still quite popular. Cataloging. Vendors often provide MARC records. - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: Ebook cataloging: trouble, even in batch](https://reader035.vdocuments.site/reader035/viewer/2022062814/568166e9550346895ddb2b4a/html5/thumbnails/1.jpg)
Ebook cataloging: trouble, even in batch
Kathryn LybargerSLA Kentucky ChapterProgram and Business MeetingNovember 2, 2012
![Page 2: Ebook cataloging: trouble, even in batch](https://reader035.vdocuments.site/reader035/viewer/2022062814/568166e9550346895ddb2b4a/html5/thumbnails/2.jpg)
MARC
A data format used to encode and share bibliographic data
Developed in the 1960’s, still quite popular
![Page 3: Ebook cataloging: trouble, even in batch](https://reader035.vdocuments.site/reader035/viewer/2022062814/568166e9550346895ddb2b4a/html5/thumbnails/3.jpg)
Cataloging
Catalog
Library of Congress
OCLCor
SkyRiverOriginal
Cataloging
![Page 4: Ebook cataloging: trouble, even in batch](https://reader035.vdocuments.site/reader035/viewer/2022062814/568166e9550346895ddb2b4a/html5/thumbnails/4.jpg)
Vendors often provide MARC records
![Page 5: Ebook cataloging: trouble, even in batch](https://reader035.vdocuments.site/reader035/viewer/2022062814/568166e9550346895ddb2b4a/html5/thumbnails/5.jpg)
Batch loading
Vendor MARC Catalog
![Page 6: Ebook cataloging: trouble, even in batch](https://reader035.vdocuments.site/reader035/viewer/2022062814/568166e9550346895ddb2b4a/html5/thumbnails/6.jpg)
All done?
![Page 7: Ebook cataloging: trouble, even in batch](https://reader035.vdocuments.site/reader035/viewer/2022062814/568166e9550346895ddb2b4a/html5/thumbnails/7.jpg)
Not quite…
![Page 8: Ebook cataloging: trouble, even in batch](https://reader035.vdocuments.site/reader035/viewer/2022062814/568166e9550346895ddb2b4a/html5/thumbnails/8.jpg)
Records may be icky…
Title: CESMM3 price database 2009, edited by Franklin + Andrews
100 1_ Franklin.245 10 CESMM3 price database 2009 ‡h [electronic resource] / ‡c edited by Franklin and Andrews.500 __ Ebook.516 __ Document.538 __ PDF: Adobe PDF700 1_ Andrews.856 40 …
![Page 9: Ebook cataloging: trouble, even in batch](https://reader035.vdocuments.site/reader035/viewer/2022062814/568166e9550346895ddb2b4a/html5/thumbnails/9.jpg)
…but worse, non-functional!
Data may be unhelpful, or misleading
Links may not work
This may change over time
![Page 10: Ebook cataloging: trouble, even in batch](https://reader035.vdocuments.site/reader035/viewer/2022062814/568166e9550346895ddb2b4a/html5/thumbnails/10.jpg)
A crazy mixed-up record(with 112 holdings)
From one book: Title Author Series Subject headings
From another book: Notes ISBN Link to e-book
![Page 11: Ebook cataloging: trouble, even in batch](https://reader035.vdocuments.site/reader035/viewer/2022062814/568166e9550346895ddb2b4a/html5/thumbnails/11.jpg)
URLs from other vendors
Provider-neutral records may have URLs from multiple vendors
An OCLC search for records with URLs from eblib, ebrary, ebscohost AND
myilibrary returned over 25,000.
Even if they are labeled, your patrons don’t know which vendor you’re using
![Page 12: Ebook cataloging: trouble, even in batch](https://reader035.vdocuments.site/reader035/viewer/2022062814/568166e9550346895ddb2b4a/html5/thumbnails/12.jpg)
URLs that point nowhere
![Page 13: Ebook cataloging: trouble, even in batch](https://reader035.vdocuments.site/reader035/viewer/2022062814/568166e9550346895ddb2b4a/html5/thumbnails/13.jpg)
URLs that point somewhere new!
![Page 14: Ebook cataloging: trouble, even in batch](https://reader035.vdocuments.site/reader035/viewer/2022062814/568166e9550346895ddb2b4a/html5/thumbnails/14.jpg)
DOI troubles
![Page 15: Ebook cataloging: trouble, even in batch](https://reader035.vdocuments.site/reader035/viewer/2022062814/568166e9550346895ddb2b4a/html5/thumbnails/15.jpg)
Books may not be available yet(or ever)
![Page 16: Ebook cataloging: trouble, even in batch](https://reader035.vdocuments.site/reader035/viewer/2022062814/568166e9550346895ddb2b4a/html5/thumbnails/16.jpg)
“Slippage”
Some ebooks on a frontlist may never appear on the site
Individual ebooks may just disappear
![Page 17: Ebook cataloging: trouble, even in batch](https://reader035.vdocuments.site/reader035/viewer/2022062814/568166e9550346895ddb2b4a/html5/thumbnails/17.jpg)
Lists may be available…
But not forthcoming.
You may have to periodically dig several levels deep on the website to get them:
![Page 18: Ebook cataloging: trouble, even in batch](https://reader035.vdocuments.site/reader035/viewer/2022062814/568166e9550346895ddb2b4a/html5/thumbnails/18.jpg)
Platform change
![Page 19: Ebook cataloging: trouble, even in batch](https://reader035.vdocuments.site/reader035/viewer/2022062814/568166e9550346895ddb2b4a/html5/thumbnails/19.jpg)
Solutions?
Use provider-neutral records when you can
Edit MARC records to conform with local standards
Verify access to all titles (periodically)
Report problems when you find them
![Page 20: Ebook cataloging: trouble, even in batch](https://reader035.vdocuments.site/reader035/viewer/2022062814/568166e9550346895ddb2b4a/html5/thumbnails/20.jpg)
Vendors may do some editing
But how do you predict what you will need?
![Page 21: Ebook cataloging: trouble, even in batch](https://reader035.vdocuments.site/reader035/viewer/2022062814/568166e9550346895ddb2b4a/html5/thumbnails/21.jpg)
MarcEdit
Developed by Terry Reese at Oregon State
MARC editing in a friendly yet powerful text editor
Z39.50 client
(Binary editor!)
![Page 22: Ebook cataloging: trouble, even in batch](https://reader035.vdocuments.site/reader035/viewer/2022062814/568166e9550346895ddb2b4a/html5/thumbnails/22.jpg)
Version control Maintain previous versions of files efficiently
No need for fileFeb12-FINAL6.mrk.bak Undo to any previous version
Mercurial (Hg): Free, lightweight, cross-platform Easy to set up and remove repositories
Command line, GUI (TortoiseHG, SourceTree)
![Page 23: Ebook cataloging: trouble, even in batch](https://reader035.vdocuments.site/reader035/viewer/2022062814/568166e9550346895ddb2b4a/html5/thumbnails/23.jpg)
Automation
MarcEdit Macros Visual Basic, Visual Basic.NET
.mrk format is text, so you can process with your favorite programming language
Don’t have a favorite language (yet)?
![Page 24: Ebook cataloging: trouble, even in batch](https://reader035.vdocuments.site/reader035/viewer/2022062814/568166e9550346895ddb2b4a/html5/thumbnails/24.jpg)
#catcode #libcodeyear
From CodeAcademy.com:
![Page 25: Ebook cataloging: trouble, even in batch](https://reader035.vdocuments.site/reader035/viewer/2022062814/568166e9550346895ddb2b4a/html5/thumbnails/25.jpg)
Text processing tools
Cygwin (unix) tools: grep, vim, vimdiff, sort, wc (and the list goes on)
grep ^=856 ebooks.mrk
=856 40$u http://dx.doi.org/10.1007/978-1-4419-9934-4=856 40$u http://dx.doi.org/10.1007/978-1-4302-3513-2=856 40$u http://public.eblib.com/EBLPublic/PublicVie...=856 40$u http://dx.doi.org/10.1007/978-0-85729-661-0=856 40$u http://dx.doi.org/10.1007/978-3-8349-6217-1
![Page 26: Ebook cataloging: trouble, even in batch](https://reader035.vdocuments.site/reader035/viewer/2022062814/568166e9550346895ddb2b4a/html5/thumbnails/26.jpg)
My automation (bash, PHP, mysql) new_ebsco.sh
Profile for each vendor answers: What lines should I add/delete? What does a valid URL look like? How can I tell if the ebook is live?
(Check logs for problems)
pull.sh <filename>
![Page 27: Ebook cataloging: trouble, even in batch](https://reader035.vdocuments.site/reader035/viewer/2022062814/568166e9550346895ddb2b4a/html5/thumbnails/27.jpg)
Generic link checkers may not be effective Ebook errors can be valid web pages, and
errors don’t mean you should give up!
HTTP/1.1 200 OK Full text ebook Web site form to buy the book
HTTP/1.1 404 Not Found No such page on server Broken DOI (that you should report)
![Page 28: Ebook cataloging: trouble, even in batch](https://reader035.vdocuments.site/reader035/viewer/2022062814/568166e9550346895ddb2b4a/html5/thumbnails/28.jpg)
Effective link checking (my method) Database holds a list of links to be
checked
Script checks each according to site profile (pausing 10 seconds between each link): Is it a PDF? Does it contain the phrase “This is not
part of your subscription”? Can you click through to fulltext
chapters?
![Page 29: Ebook cataloging: trouble, even in batch](https://reader035.vdocuments.site/reader035/viewer/2022062814/568166e9550346895ddb2b4a/html5/thumbnails/29.jpg)
Communicate
Dead links lurk in catalogs everywhere, and will until people know about them!
If you spot one locally, let your catalogers know.
(Report zombies!)
![Page 30: Ebook cataloging: trouble, even in batch](https://reader035.vdocuments.site/reader035/viewer/2022062814/568166e9550346895ddb2b4a/html5/thumbnails/30.jpg)
Any questions?
![Page 31: Ebook cataloging: trouble, even in batch](https://reader035.vdocuments.site/reader035/viewer/2022062814/568166e9550346895ddb2b4a/html5/thumbnails/31.jpg)
Links
MarcEdit http://people.oregonstate.edu/~reeset/marcedit/html/index.php
Mercurialhttp://mercurial.selenic.com/
Code Academyhttp://www.codeacademy.com
Cygwin http://www.cygwin.com