creating an international environment for research in library materials adolf knoll national library...
TRANSCRIPT
Creating an International Creating an International Environment for Research in Environment for Research in
Library MaterialsLibrary MaterialsAdolf KnollAdolf Knoll
National Library of the Czech National Library of the Czech RepublicRepublic
[email protected]@nkp.cz
Europe – National LibrariesEurope – National Libraries
0
10000000
20000000
30000000
40000000
50000000
AT BA CZ DK EE FI FR DE HU IS IT LV NL NO PL PT RU-P RU-M SP SK SL ES SE CH UK VA
Where are we?Where are we?
76000000
78000000
80000000
82000000
84000000
86000000
88000000
90000000
92000000
94000000
European NL Altogether National Library of Korea
Europe – without Spain and FranceEurope – without Spain and France
0
500000
1000000
1500000
2000000
2500000
3000000
3500000
AT CZ DK FI IS IT NL NO RU-M SP UK
Digitized manuscriptsDigitized manuscripts
0
100000
200000
300000
400000
500000
600000
BE BA HR CZ DK FI FR HU IS IT LT NL NO PL PT MK RU-P
RU-M
SP SK SL ES SE UK VA
ManuscriptoriumManuscriptorium
Launched on-line in autumn 2003Launched on-line in autumn 2003 Shared catalogue and digital libraryShared catalogue and digital library More than 700,000 pagesMore than 700,000 pages
• ImagesImages• Structured full textsStructured full texts
Manuscripts, old printed books, Manuscripts, old printed books, historical mapshistorical maps
In average: 30 parallel usersIn average: 30 parallel users
Metadata schema todayMetadata schema todayMaster+Master+
http://digit.nkp.cz/MMSB/1.1/msnkaip.xsd
Partial standardsPartial standards
Identification and moreIdentification and more• MASTER.dtdMASTER.dtd
Technical descriptionTechnical description• NISO Z39.87NISO Z39.87• DIG35DIG35• (ICC + colour targets)(ICC + colour targets)
Page levelPage level• Former Memoria standardFormer Memoria standard
Metadata schema tomorrowMetadata schema tomorrowKDD – Complex Digital DocumentKDD – Complex Digital Document
Application of METS – remapping of major Application of METS – remapping of major Master+ sections into KDD has been doneMaster+ sections into KDD has been done
KDD Heading
Description metadataIdentification records
Digital copies
Full texts
Audio recordings
Related works in XML
Multimedia docs
Referencedexternal docs
Complex Digital Document
Compatibility and connectivityCompatibility and connectivity
Transformation from MARC formats Transformation from MARC formats on the identification levelon the identification level
Creation of profiles for Creation of profiles for communicationcommunication• MARC for Z39.50 – used only for the MARC for Z39.50 – used only for the
Czech National Portal (Uniform Czech National Portal (Uniform Information Gateway)Information Gateway)
• MARC, MODS, various DC versions for MARC, MODS, various DC versions for concrete harvesters (used for TEL, CERL-concrete harvesters (used for TEL, CERL-MSS, and M-CAST portal)MSS, and M-CAST portal)
KrameriusKramerius
W3C schemasW3C schemas• Digitized periodicalsDigitized periodicals• Digitized monographsDigitized monographs
Built in production tools:Built in production tools:• NL specific (XMetal-based)NL specific (XMetal-based)• Commercial (Sirius complex digigization Commercial (Sirius complex digigization
solution – Elsyst Engineering)solution – Elsyst Engineering) Built in the Kramerius DL application that Built in the Kramerius DL application that
is going to be linked to the Union is going to be linked to the Union CatalogueCatalogue
DataData
Full text structured following our TEI-Full text structured following our TEI-based DTDbased DTD
Images:Images:• 3W recommended formats for 3W recommended formats for
Manuscriptorium current documents Manuscriptorium current documents (JPEG and GIF/PNG)(JPEG and GIF/PNG)
• MrSID + Lizardtech Express server for MrSID + Lizardtech Express server for scanned mapsscanned maps
• DjVu for newspapers and modern DjVu for newspapers and modern monographs (Kramerius)monographs (Kramerius)
Internationalized ManuscriptoriumInternationalized Manuscriptorium
The main problem lies in the area of The main problem lies in the area of psychology:psychology:
Why should I/we be taken aboard by the Why should I/we be taken aboard by the Manuscriptorium?Manuscriptorium?
Rather typical situation:Rather typical situation:• Availability of scanned outputAvailability of scanned output• Often not associated with a catalogue recordOften not associated with a catalogue record• Even if so, no complex document (XML) format Even if so, no complex document (XML) format
usedused• No or not good presentation toolsNo or not good presentation tools
We offer the toolsWe offer the tools
The most difficult problem in such The most difficult problem in such situation lies in the area ofsituation lies in the area of• Structuring metadataStructuring metadata• Linking them correctly to the sets of Linking them correctly to the sets of
imagesimages
Centralmetadatadatabase
ImageBank
IBCZ Main
ImageBank
IB-ESx
ImageBankIB-LTx
ImageBank
IB-SK1
ImageBankIB-PLx
M-TOOLM-TOOL
Freely downloadable in CZ and ENFreely downloadable in CZ and EN SupportingSupporting
• Basic identificationBasic identification• Structuring of pages Structuring of pages • Renaming of image files and geberazion of Renaming of image files and geberazion of
correct links to the images stored at any correct links to the images stored at any chosen URLchosen URL
• ValidationValidation• Output: XML file of the whole document that is Output: XML file of the whole document that is
Manuscriptorium compatibleManuscriptorium compatible International workshop 9 June 2006International workshop 9 June 2006