1 large-scale collaborative digitisation 19 th century pamphlets online mar-2007 – feb-2009 grant...

23
1 Large-scale collaborative digitisation 19 th Century Pamphlets Online Mar-2007 – Feb-2009 Grant Young [email protected] Project Manager, 19 th Century Pamphlets Online, University of Southampton & RLUK Digitisation & Digital Preservation Specialist, Cambridge University Library

Upload: melody-jane

Post on 15-Jan-2016

222 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: 1 Large-scale collaborative digitisation 19 th Century Pamphlets Online Mar-2007 – Feb-2009 Grant Young gy219@cam.ac.uk Project Manager, 19 th Century

1

Large-scale collaborative digitisation19th Century Pamphlets OnlineMar-2007 – Feb-2009

Grant [email protected]

Project Manager,19th Century Pamphlets Online,University of Southampton & RLUK

Digitisation & DigitalPreservation Specialist,Cambridge University Library

Page 2: 1 Large-scale collaborative digitisation 19 th Century Pamphlets Online Mar-2007 – Feb-2009 Grant Young gy219@cam.ac.uk Project Manager, 19 th Century

2

Overview

Pamphlets– what’s interesting about pamphlets?

Project– what’s interesting about the project?

Resource– what’s on offer for users?

Lessons– what lessons are there for digitisers?

Page 3: 1 Large-scale collaborative digitisation 19 th Century Pamphlets Online Mar-2007 – Feb-2009 Grant Young gy219@cam.ac.uk Project Manager, 19 th Century

3

1. What’s interesting about pamphlets?

Page 4: 1 Large-scale collaborative digitisation 19 th Century Pamphlets Online Mar-2007 – Feb-2009 Grant Young gy219@cam.ac.uk Project Manager, 19 th Century

4

1. What’s interesting about pamphlets?

Key means of getting message out

Informative and opinionated

Debates over time Collected and kept Complement other

forms of publication Underutilised

scholarly resource

Page 5: 1 Large-scale collaborative digitisation 19 th Century Pamphlets Online Mar-2007 – Feb-2009 Grant Young gy219@cam.ac.uk Project Manager, 19 th Century

5

More than just printed text!

1. What’s interesting about pamphlets?

Page 6: 1 Large-scale collaborative digitisation 19 th Century Pamphlets Online Mar-2007 – Feb-2009 Grant Young gy219@cam.ac.uk Project Manager, 19 th Century

6

2. What’s interesting about the project?

Large partnership involved Substantial & significant content Builds on previous work Business model for sustainability &

preservation Resource discovery model

Page 7: 1 Large-scale collaborative digitisation 19 th Century Pamphlets Online Mar-2007 – Feb-2009 Grant Young gy219@cam.ac.uk Project Manager, 19 th Century

7

Large partnership (12)

JISC – major funder RLUK – sponsor & funder University of Southampton /

BOPCRIS unit– lead, digitisation

JSTOR– resource discovery, delivery and preservation

Mimas– resource discovery

RLUK Libraries– pamphlet contributors

• Bristol• Durham• Liverpool• LSE• Manchester• Newcastle• UCL

Page 8: 1 Large-scale collaborative digitisation 19 th Century Pamphlets Online Mar-2007 – Feb-2009 Grant Young gy219@cam.ac.uk Project Manager, 19 th Century

8

Significant contentLibraries Collections

Durham Earls Grey – Family collection

Liverpool Earls of Derby – Family collection

UCL Joseph Hume (1777-1855) – Personal collection

Newcastle Joseph Cowen (1829-1900) – Personal collection

Manchester Foreign Office & Colonial Office collections – Government collections of overseas pamphlets

Selections from 19th Century collection – Strong on slavery and local issues

Bristol Selections from 19th Century collection – Strong on political parties

LSE Selections from 19th Century collection – Strong on pressure groups

Page 9: 1 Large-scale collaborative digitisation 19 th Century Pamphlets Online Mar-2007 – Feb-2009 Grant Young gy219@cam.ac.uk Project Manager, 19 th Century

9

Substantial content

23,000+ pamphlets 1 million+ pages 3 million+ files

£1.1 million budget (780K from JISC)

Page 10: 1 Large-scale collaborative digitisation 19 th Century Pamphlets Online Mar-2007 – Feb-2009 Grant Young gy219@cam.ac.uk Project Manager, 19 th Century

10

Substantial content

Per page:

Image OCR text(plain & co-ordinated)

Page 11: 1 Large-scale collaborative digitisation 19 th Century Pamphlets Online Mar-2007 – Feb-2009 Grant Young gy219@cam.ac.uk Project Manager, 19 th Century

11

Substantial content

Per pamphlet:

XML metadata: MODS, MIX and PREMIS in a METS wrapper

Folder of image and OCR files

Page 12: 1 Large-scale collaborative digitisation 19 th Century Pamphlets Online Mar-2007 – Feb-2009 Grant Young gy219@cam.ac.uk Project Manager, 19 th Century

12

Builds on previous work

Metadata –RSLP/CURL 19th Century Pamphlets Cataloguing Project (1999-2002, £800K)

Digitisation infrastructure –BOPCRIS digitisation unit

Delivery & preservation infrastructure – JSTOR

Relationships – RLUK membership

Page 13: 1 Large-scale collaborative digitisation 19 th Century Pamphlets Online Mar-2007 – Feb-2009 Grant Young gy219@cam.ac.uk Project Manager, 19 th Century

13

Interesting business model

Partners license all content to RLUK

RLUK-JSTOR agreement for 25 years• JSTOR provides free archiving & delivery for UK

in exchange for commercialisation elsewhere

Only exclusive for 5 years. After this…• Libraries could deliver digital copies of their own

pamphlets via open access• RLUK could enter into further agreements over

use of the content

Page 14: 1 Large-scale collaborative digitisation 19 th Century Pamphlets Online Mar-2007 – Feb-2009 Grant Young gy219@cam.ac.uk Project Manager, 19 th Century

14

Interesting resource discovery model

Pamphlet Collection

Google Scholar Search

Copac Academic & National Library Catalogue

Catalogues of libraries holding pamphlets

JSTOR’s search interface

19th Century Pamphlets Web Guide

Pamphlet level (bibliographic)Full text search

JSTOR

Mimas

Links from other JSTOR content

Regular Google Search

Many other services, resources & collections

CrossRef, OAI…

Page 15: 1 Large-scale collaborative digitisation 19 th Century Pamphlets Online Mar-2007 – Feb-2009 Grant Young gy219@cam.ac.uk Project Manager, 19 th Century

15

3. What’s on offer for users?

From early February: c. 7,000 pamphlets in initial release from JSTOR

From early March: www.pamphlets.ac.uk - online guide to pamphlets for researchers and educators

20 March - Formal launch at conference in Liverpool (free academic event)

Page 16: 1 Large-scale collaborative digitisation 19 th Century Pamphlets Online Mar-2007 – Feb-2009 Grant Young gy219@cam.ac.uk Project Manager, 19 th Century

16

3. What’s on offer for users?

Page 17: 1 Large-scale collaborative digitisation 19 th Century Pamphlets Online Mar-2007 – Feb-2009 Grant Young gy219@cam.ac.uk Project Manager, 19 th Century

17

4. What lessons are there for digitisers?

The headlines:

Projects don’t go to plan – things go wrong and opportunities arise

Projects depend on people as well as technology – good communication and trust are vital

Page 18: 1 Large-scale collaborative digitisation 19 th Century Pamphlets Online Mar-2007 – Feb-2009 Grant Young gy219@cam.ac.uk Project Manager, 19 th Century

18

4. What lessons are there for digitisers?

…about digitising pamphlets:

Scholars view pamphlets differently (intellectual content vs archival objects; individual items vs collections)

Libraries treat pamphlets differently (definition, location, binding, handling)

Page 19: 1 Large-scale collaborative digitisation 19 th Century Pamphlets Online Mar-2007 – Feb-2009 Grant Young gy219@cam.ac.uk Project Manager, 19 th Century

19

4. What lessons are there for digitisers?

… about the workflow:

Sampling & piloting are helpful but not foolproof

Time & motion is important – every second counts when undertaking large-scale digitisation

Page 20: 1 Large-scale collaborative digitisation 19 th Century Pamphlets Online Mar-2007 – Feb-2009 Grant Young gy219@cam.ac.uk Project Manager, 19 th Century

20

4. What lessons are there for digitisers?

… about IPR:

Important to accept some element of risk with copyright (<1% vs >25%)

Licensing arrangements can be extremely complex and protracted(9 separate agreements required)

Page 21: 1 Large-scale collaborative digitisation 19 th Century Pamphlets Online Mar-2007 – Feb-2009 Grant Young gy219@cam.ac.uk Project Manager, 19 th Century

21

4. What lessons are there for digitisers?

… about the use of standards:

Not always clear (e.g. different ways to mark-up with METS)

Not always stable (MIX and PREMIS were updated during course of project)

Page 22: 1 Large-scale collaborative digitisation 19 th Century Pamphlets Online Mar-2007 – Feb-2009 Grant Young gy219@cam.ac.uk Project Manager, 19 th Century

22

4. What lessons are there for digitisers?

… about working collaboratively:

Can pose challenges & require work (differing priorities, cultures, timezones)

Can provide opportunities & flexibility (pool of skills/experience to draw on, ‘extra-curricular’ activities)

Page 23: 1 Large-scale collaborative digitisation 19 th Century Pamphlets Online Mar-2007 – Feb-2009 Grant Young gy219@cam.ac.uk Project Manager, 19 th Century

23

Any questions or comments?

Email: [email protected]

Visit: http://www.rluk.ac.uk/node/71