| ingest levels and persistent identification | october 2014 1 ingest levels and persistent...
Post on 04-Jan-2016
221 Views
Preview:
TRANSCRIPT
| Ingest Levels and Persistent Identification | October 20141
Ingest Levels and Persistent Identification Services for R & D and heritage organisations
from the German National Library
Reinhard Altenhöner and Katarina Haage
German National Library
Content overview
- Some background: Online publications – how do they find their way into the repository?
- Ingest Levels as a leading principle in German National Library‘s dp-workflow
- Persistent Identification – a basic service in the APARSEN community and how do we find what we are looking for?
- AREDO – Archiving and sustainable preservation of digital objects – a cooperation service
- DNB contribution in R&D
| Ingest Levels and Persistent Identification | October 20142
Background: Preservation at the German National Library
– Legal deposit for German online publications since 2006
– Archival system for digital preservation developed in German project kopal (2004 – 2007)- Based on DIAS by IBM, which is multi-client capable and therefore
aimed to cooperation- Support for file format migration: Object versioning, query
interface for technical metadata- Ingest and access by own developed open source library (koLibRI)
– Infrastructure for online publications: Local repository, DIAS (externally hosted), catalogue (biographical metadata, user access)
| Ingest Levels and Persistent Identification | October 20143
Online publications and their way into the repository – a new ingest workflow
– Need for flexible ingest process for variety of object types and file formats with integrated dp
– Three delivery interfaces for publishers: OAI-PMH, web form with upload, hotfolder (FTP)
– Cooperation service for other institutions (AREDO) - Traceable ingest workflow- Predictable preservation risks- Quality checks
| Ingest Levels and Persistent Identification | October 20144
Ingest workflow
| Ingest Levels and Persistent Identification | October 20145
TP
Import serviceHotfolder manager(OAI-PMH, web form)
SIP launcherRepository
SIP
didigo
Bibl. metadata
Integrity checks
Ticket system
DIAS
Ingest indicators
- File integrity (FI): Correct checksum
- Identifiability (ID): File format could be recognized
- No restrictions (NR): No technical protection measurements
- Generation of technical metadata (MD): Tool or module is available for the characterization of the recognized file format
- Validity (V): File format is valid
| Ingest Levels and Persistent Identification | October 20146
Ingest levels and error evaluation
| Ingest Levels and Persistent Identification | October 20147
FI ID NR MD V
Level 0 X O O O O
Level 1 X X O O O
Level 2 X X X O O
Level 3 X X X X O
Level 4 X X X X X
| Ingest Levels and Persistent Identification | October 20148
Persistent Identification – how do we find what we are looking for?- Uniform Resource Name (URN) is a persistent identifier
(PI); an unique identifier for digital objects, enabling online resources to be permanently identified and reliably cited
- APARSEN’s WP22 (CITABILITY AND IDENTIFICATION) development of an interoperability framework for PI systems
- All APARSEN deliverables are provided with an URN and can be found in the DNB catalogue: https://portal.dnb.de/opac.htm?query=%22aparsen%22&method=simpleSearch
| Ingest Levels and Persistent Identification | October 20149
The DNB and its URN Service – The German National Library assigns and administrates
URNs from the namespace "urn:nbn:de" and offers an URN resolving service for Germany, Austria and Switzerland
– The URN service is a free service of the German National Library
See also:
– Policy for issuing URNs in the urn:nbn:de namespace (DNB)
– Catalogue of criteria for assessing the trustworthiness of PI systems (nestor)
10
Long-term preservation cooperation service in the DNB:
- Basic offers- Optional services- Formal structure
| Ingest Levels and Persistent Identification | October 2014
AREDO
ARchivierung und dauerhafte Erhaltung Digitaler Objekte(Archiving and sustainable preservation of digital objects)
| Ingest Levels and Persistent Identification | October 201411
AREDO
| Ingest Levels and Persistent Identification | October 201412
Basic offers - Provision of a mass input and output interface
- Integrity check of all digital objects
- Check for duplicates in partner's preservation repository
- Quality checking of all digital objects
- Generation of technical metadata
- Reporting
- Provision of permanent access and search functions
- Backup storage of all archived objects (triple) / Archiving at multiple
locations
- Rights management
- Preservation planning: ongoing risk analysis and risk information
Statistic tool: Ingest
| Ingest Levels and Persistent Identification | October 201413
14
Optional services
- Ingest:- Risk handling based on jointly agreed quality rules for
digital objects- Processing of descriptive metadata supplied in Dublin Core
Metadata Element Set, Version 1.1- Processing of additional information
- Curation:- Regular and comprehensive status reports on all archived
digital objects- Agreement on suitable migration, or other safeguarding,
measures
| Ingest Levels and Persistent Identification | October 2014
German National Library
Libraries
Museums Educational Institutes
Archives
15 | Ingest Levels and Persistent Identification | October 2014
Adaptation to partner's requirementsAREDO allows individualised long-term preservation workflows to be set up between the partners and the German National Library which meet the specific needs of the institution in question:
Service provider
External Provider (DIAS)
Service catalogue / Underpinning Contract
Service catalogue
SLA SLA SLA SLA SLA SLA
S e r v i c e t a k e r
Service model
16 | Ingest Levels and Persistent Identification | October 2014
Formal structure
- Cooperation contract between both organisations
- The deliverer makes sure the submitted objects have a certain standard (see Ingest Level)
- The cooperation provider makes sure the stored objects are accessible over a long-term and certificated for a long-term preservation process ( nestor certification)
- DNB as Not-for-Profit institution; no other costs than the actual costs
For more details go to: http://www.dnb.de/aredo
or contact: Karlheinz Schmitt, k.schmitt@dnb.de
| Ingest Levels and Persistent Identification | October 201417
| Ingest Levels and Persistent Identification | October 201418
DNB and R&D: action and principles 1
- Participation in projects with strong relation to R&D-topics: - KEEP, SHAMAN, KOPAL, DP4lib, …
- Evaluation and preparing-the-scene-projects- ODE, LUKII, 4C
– Umbrella projects & services- Nestor, Parse.insight, APARSEN
| Ingest Levels and Persistent Identification | October 201419
Deliver requirements and transmit results from r&d into operation- New project: „Using emulation for the provision of multimedia
objects” – based on KEEP
Sharing knowledge, OSS-tools, regular exchange
- More than before the Horizon: Research data
- Topic for DNB as a memory organisation, if the data „is part“ of a publication and openly published in the web
- Advanced need for cooperate with data producers and data centres to make use of generated data VCoE as a platform?!
DNB and R&D: action and principles 2
| Ingest Levels and Persistent Identification | October 201420
Questions and remarks are welcome now or later:
Reinhard Altenhöner, r.altenhoener@dnb.de
and
Katarina Haage, k.haage@dnb.de
German National Library Information Infrastructure and PreservationAdickesallee 1D-60322 Frankfurtwww.dnb.de
top related