hathitrust: building the universal collection john wilkin 18 may 2009

18
HathiTrust: Building the Universal Collection John Wilkin 18 May 2009

Upload: mason-hoffman

Post on 27-Mar-2015

213 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: HathiTrust: Building the Universal Collection John Wilkin 18 May 2009

HathiTrust: Building the Universal Collection

John Wilkin

18 May 2009

Page 2: HathiTrust: Building the Universal Collection John Wilkin 18 May 2009

www.hathitrust.org

Presentation structure

• Quick background on where we are• A few pieces of what’s in the hopper• Development work underway• New collaborative structures

• Explore HathiTrust as a vehicle for collaboration in the realm of collections

Page 3: HathiTrust: Building the Universal Collection John Wilkin 18 May 2009

www.hathitrust.org

Mission and Goals

• to contribute to the common good by collecting, organizing, preserving, communicating, and sharing the record of human knowledge – materials converted from print– improve access …to meet the needs of the co-owning

institutions– reliable and accessible electronic representations– coordinate shared storage strategies– “public good” … free-riders.– simultaneously …centralized …open

Page 4: HathiTrust: Building the Universal Collection John Wilkin 18 May 2009

www.hathitrust.org

current members

Page 5: HathiTrust: Building the Universal Collection John Wilkin 18 May 2009

www.hathitrust.org

Governance Model

• Executive Committee• Strategic Advisory Board• Coordinated input from groups of members– Hathi/CIC Steering Committee– UC library directors

Page 6: HathiTrust: Building the Universal Collection John Wilkin 18 May 2009

www.hathitrust.org

Executive Committee• Paul Courant, University Librarian and Dean of Libraries, University

of Michigan• Laine Farley, Executive Director, California Digital Library• Paula Kaufman, University Librarian and Dean of Libraries,

University of Illinois at Champaign-Urbana• John King, Vice Provost for Academic Information, University of

Michigan• Brian Schottlaender, University Librarian, University of California,

San Diego Libraries• Patricia Steele, Dean of Libraries, Indiana University• Brad Wheeler, Chief Information Officer, Indiana University• John Wilkin, Executive Director of HathiTrust and Associate

University Library, Library Information Technology, University of Michigan

Page 7: HathiTrust: Building the Universal Collection John Wilkin 18 May 2009

www.hathitrust.org

Strategic Advisory Board– Ed Van Gemert (Chair), Director of Libraries, University of

Wisconsin-Madison– John Butler, Associate University Librarian for Information

Technology, University of Minnesota– Patricia Cruse, Director, Preservation, California Digital Library– Robin Dale, Associate University Librarian for Collections and

Library Information Systems, University of California, Santa Cruz– R. Bruce Miller, University Librarian, University of California,

Merced– Sarah Pritchard, University Librarian, Northwestern University– Paul Soderdahl, Director, Library Information Technology,

University of Iowa – John Wilkin, Executive Director, HathiTrust (ex officio)

Page 8: HathiTrust: Building the Universal Collection John Wilkin 18 May 2009

www.hathitrust.org

Preservation: OAIS Reference Model

GRINInternal Data Loading

GRINInternal Data Loading

Google[OCA]

In-house Conversion

Google[OCA]

In-house Conversion

MARC record extensions (Aleph)

Rights DB

MARC record extensions (Aleph)

Rights DB

Page TurnerHathiTrust API

OAIGeoIP DB

CNRI Handles[Solr]

Page TurnerHathiTrust API

OAIGeoIP DB

CNRI Handles[Solr]

METS/PREMIS objectTIFF G4/JPEG2000

OCRMD5 checksums

METS/PREMIS objectTIFF G4/JPEG2000

OCRMD5 checksums

METS objectPNGOCRPDF

METS objectPNGOCRPDFIsilon

Site ReplicationTSM

MD5 checksum validation

IsilonSite Replication

TSMMD5 checksum validation

GROOVE(JHOVE)GROOVE(JHOVE)

Page 9: HathiTrust: Building the Universal Collection John Wilkin 18 May 2009

www.hathitrust.org

growth trajectory

Page 10: HathiTrust: Building the Universal Collection John Wilkin 18 May 2009

www.hathitrust.org

accomplishments to date

1. 25 partners2. successful ingest and millions of vols online3. mirroring and backup4. rich access5. collection builder6. Catalog beta and WCL working group

Page 11: HathiTrust: Building the Universal Collection John Wilkin 18 May 2009

www.hathitrust.org

What next?• Data API and other strategies for increased

openness• Internet Archive/OCA ingest followed by misc.

non-Google ingest• Full text search over entire repository• Extending out services through Shib• Creating research corpus• Deeper collaborative strategies

Page 12: HathiTrust: Building the Universal Collection John Wilkin 18 May 2009

www.hathitrust.org

Where next with collaboration?

• Begin sharing actual development, cf. ingest of Internet Archive content– Specifications– Validation routines?– Packaging?

• Collaboratively develop a collaborative framework– SAB and working group charges

Page 13: HathiTrust: Building the Universal Collection John Wilkin 18 May 2009

www.hathitrust.org

Working groups?• Security• Collection management• Non-Consumptive Research• Digital preservation• Discovery (bibliographic and full text)• Externally-facing repository APIs• Bibliographic metadata management• Rights Management

Page 14: HathiTrust: Building the Universal Collection John Wilkin 18 May 2009

www.hathitrust.org

Universal collection

• What is a collection?• Bibliographic identity• Certification (and for specific or purposes)– Object as content– Object as artifact

Page 15: HathiTrust: Building the Universal Collection John Wilkin 18 May 2009

www.hathitrust.org

Toward the Cloud Library

• Shared Print repository or repositories with all the best attributes (service, treatment, management)

• Shared digital repository with all the best attributes (compliance with TRAC, accessible in every sense, a foundation for services)

• … and even some coordination between the two• … and even (particularly for in-copyright works where

we don’t have permissions) a viewable copy in GBS

Page 16: HathiTrust: Building the Universal Collection John Wilkin 18 May 2009

www.hathitrust.org

Expectations and plans?

• How would we define our requirements for satisfaction with each?

• What would the business model be?• How would we build our local collections in

light of the presence of something like this?• What would we do on the “core” or shared

collections?

Page 17: HathiTrust: Building the Universal Collection John Wilkin 18 May 2009

www.hathitrust.org

Next steps for libraries

• Case study library: NYU Library• ReCap storage facility in Princeton, NJ• HathiTrust digital repository• CLIR as broker and RLG Research as agent• Futures that depend on looking beyond the

local to the shared, from the shared as “you” to the shared as “we”

Page 18: HathiTrust: Building the Universal Collection John Wilkin 18 May 2009

www.hathitrust.org

Thank you!

• http://www.hathitrust.org/• RSS feed for updates• [email protected]