the happi toolkit: from model to implementation

21
The HAPPI Toolkit: from model to implementation Digital Preservation in Earth-Science Missions” Workshop Rome, 26 th November 2014 - INGV [email protected] virtualgix R&D LAB

Upload: luigi-briguglio

Post on 14-Jul-2015

177 views

Category:

Software


0 download

TRANSCRIPT

The HAPPI Toolkit: from model to implementation

“Digital Preservation in Earth-Science Missions” Workshop Rome, 26th November 2014 - INGV

[email protected]

virtualgix R&D LAB

SCIDIP-ES Workshop @ INGV

L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop

Outline

• What HAPPI is

• What HAPPI does

• How HAPPI works

• HAPPI Technology

• How to integrate HAPPI

• Current Status

• Next Steps

L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop

What HAPPI is

• HAPPI is one of the 12 software solutions composing the SCIDIP-ES e-Infrastructure

• HAPPI stands for Handling Authenticity, Provenance and Persistent Identifiers

Packaging

Notification

HAPPI

Finding Aid

Registry

Gap Ident.

L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop

What HAPPI is

• HAPPI Toolkit is a software component that manages part of preservation metadata defined in ISO:14721:2012, i.e. OAIS Preservation Description Information (PDI)

• HAPPI Toolkit is a software component that manages evidences for the transformations may occur on digital objects during their life cycle.

L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop

OAIS:PDI

Provenance Reference

Context

Fixity

Rights

EH

What HAPPI does

• HAPPI supports archive managers for documenting transformations of digital objects

• Documentation of transformation is based on the model of EvidenceRecord and EvidenceHistory, as well as OAIS, OPM and PREMIS

Archive Manager

Store HAPPI

Register Intellectual Entity

Capture Evidence Record

Import/Export Evidence History

Search & Browse Intellectual Entity, Evidence Records

L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop

How HAPPI works

L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop

Report • info • Fixity • SignificantProperties

How HAPPI works

L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop

Agent • ID+info • Type

Representation • ID+info • Format • Type

Transformation • ID+Info • Software • Type

How HAPPI works

Register the Intellectual Entity • title • creation date • reference • annotation

Gather information into Evidence Records

• transformation • who controls the transformation • result of transformation • report with annotation and specific properties

Step 1

Step 2

L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop

How HAPPI works

Browse the History of Data Timeline mode

L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop

How HAPPI works

Browse the History of Data Graph mode

L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop

How HAPPI works • Archive Manager can add specific significant properties, for later

supporting authenticity assessment

• Reference is applied to Intellectual Entities and evidence items (i.e. Agent, Transformation, Representation) • Organisation – who assigns the reference • Type – type of reference (e.g. URI, DOI, Protocol, …) • Value – value of reference

• Type of Transformations

• AGGREGATION • CAPTURE • CHANGEOFCUSTODY • EXTRACTION • INGESTION • MIGRATION

L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop

GraphDB

HAPPI-LOGIC

HAPPI Technology

Neo4j

HAPPI-SERVER

L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop

IntellectualEntityManager EvidenceHistoryManager

+ addIntellectualEntity(ie) + addEvidenceRecord(er, eh)

+ getIntellectualEntity(label) + getEvidenceRecord(label, eh)

+ getAllIntellectualEntities() + getAllEvidenceRecords(eh)

+ getIntellectualEntitiesBy(from, to, keyword) + getEvidenceRecordHistory(label, eh)

+ getLastEvidenceRecords(eh)

+ importEvidenceHistory(eh, gxmlFile)

+ exportEvidenceHistory(eh, gxmlFile)

HAPPI-LOGIC-1.5.0

How to integrate HAPPI

IEManager

EHManager

L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop

IntellectualEntity EvidenceHistory/Record

How to integrate HAPPI

// obtain the IntellectualEntityManager IntellectualEntityManager iemanager = ManagerFactory.getInstance().getIntellectualEntityManager(); // create the reference Reference sampleRef = new Reference("Picktochart", "URI","https://magic.piktochart.com/output/3098625-untitled-report"); /** * create the intellectual entity, that is composed by * reference, label, title, annotation and date of creation. */ IntellectualEntity ie1 = new IntellectualEntity(sampleRef, null, "HAPPI Infographics", "SCIDIP-ES HAPPI Infographics", new Date()); // add the intellectual entity through iemanager iemanager.addIntellectualEntity(ie1);

http://sourceforge.net/p/digitalpreserve/code/HEAD/tree/SCIDIP-ES/

L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop

How to integrate HAPPI

// obtain the EvidenceHistoryManager EvidenceHistoryManager ehmanager = ManagerFactory.getInstance().getEvidenceHistoryManager(); // get the evidence history of the intellectual entity EvidenceHistory eh1 = ie1.getEvidenceHistory(); /** create the first evidence record with sample data, by * using the buildRecord utility method. */ EvidenceRecord er1 = new EvidenceRecord(); er1 = buildSampleRecord("Luigi Briguglio", "capture","origin", "er1", null); // add the evidence record to its history eh1.addEvidenceRecord(er1); ehmanager.addEvidenceRecord(er1, eh1); // adding a second record to the history to the first one EvidenceRecord er2 = new EvidenceRecord(); er2 = buildSampleRecord("Luigi Briguglio", "ingestion","submitted", "er2", er1); eh1.addEvidenceRecord(er2); ehmanager.addEvidenceRecord(er2, eh1);

http://sourceforge.net/p/digitalpreserve/code/HEAD/tree/SCIDIP-ES/

L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop

Current Status

• HAPPI 1.5.0 instances are running for validation in

L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop

Current Status • A Modular Infrastructure for the Management of Authenticity and

Persistent Identifiers in Long Term Digital Preservation Repositories in Int. J. of Knowledge and Learning - forthcoming, http://www.inderscience.com/info/ingeneral/forthcoming.php?jcode=ijkl

• Thesis - Analisi Progettazione e Sviluppo di un Prototipo per la Gestione della Provenienza nel Processo di Conservazione Digitale, Tor Vergata Univ., October 2013

• “Modelling Data Value in Digital Preservation”, iPRES2013 Conference Proceedings, September 2013, http://purl.pt/24107/1/iPres2013_PDF/Modelling%20Data%20Value%20in%20Digital%20Preservation.pdf

• Preserving Authenticity Evidence to Assess Provenance and Integrity of Digital Resources in ECLAP 2013 Conference Proceedings, LNCS issue no. 7990, April 2013 - http://link.springer.com/chapter/10.1007%2F978-3-642-40050-6_7 L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop

Next Steps

• Extend the predefined types of transformations (external transformation, e.g. extraction and aggregation)

• Add integrity check at provenance graph

• Add management of distributed repositories

• Improve performances

• Extension of the model for supporting «Context Information» and «Rights Management» (OAIS:PDI)

L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop

time

Next Steps: Extending to Context and Rights

L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop

• add relationships to other digital objects, in order to document its context

• document rights and their evolution

OAIS:PDI

Provenance Reference

Context

Fixity

Rights

EH

Thanks for your kind attention

www.scidip-es.eu R&D LAB

L. Briguglio - The HAPPI Toolkit - Rome 26 November 2014 - SCIDIP-ES Workshop