premis in archivematica · premis in archivematica peter van garderen artefactual systems inc....

29
PREMIS in Archivematica PETER VAN GARDEREN Artefactual Systems Inc. American Library Association New Orleans - June 24, 2011

Upload: others

Post on 20-May-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: PREMIS in Archivematica · PREMIS in Archivematica PETER VAN GARDEREN Artefactual Systems Inc. American Library Association New Orleans - June 24, 2011. Peter Van Garderen President

PREMIS in Archivematica

PETER VAN GARDERENArtefactual Systems Inc.

American Library Association New Orleans - June 24, 2011

Page 2: PREMIS in Archivematica · PREMIS in Archivematica PETER VAN GARDEREN Artefactual Systems Inc. American Library Association New Orleans - June 24, 2011. Peter Van Garderen President

Peter Van GarderenPresident / Systems Archivist

Evelyn McLellanSystems Archivist

David JuhaszSoftware Engineer

Austin TraskSystems Engineer

Jesús García CrespoSoftware Engineer

Joseph PerrySoftware Engineer

open-source sofware for archives and librariesdigital preservation consulting services

http://artefactual.com

Jessica BusheySystems Archivist

MJ SuhonosSystems Librarian / Software Engineer

Page 3: PREMIS in Archivematica · PREMIS in Archivematica PETER VAN GARDEREN Artefactual Systems Inc. American Library Association New Orleans - June 24, 2011. Peter Van Garderen President

archivematica.org

Page 4: PREMIS in Archivematica · PREMIS in Archivematica PETER VAN GARDEREN Artefactual Systems Inc. American Library Association New Orleans - June 24, 2011. Peter Van Garderen President

Who is using Archivematica?

● Artefactual Systems clients● City of Vancouver Archives● International Monetary Fund Archives● Rockefeller Archives Center● University of British Columbia Library● Simon Fraser University Archives

Page 5: PREMIS in Archivematica · PREMIS in Archivematica PETER VAN GARDEREN Artefactual Systems Inc. American Library Association New Orleans - June 24, 2011. Peter Van Garderen President

Who is using Archivematica?

● Archivematica community● 20-30 pilot testers● Documentation contributions● Media preservation plan testing● Hydra fork: ‘Rubymatica’● Education: University of Toronto, University College

London, SAA● 10+ workshops in past 12 months

Page 6: PREMIS in Archivematica · PREMIS in Archivematica PETER VAN GARDEREN Artefactual Systems Inc. American Library Association New Orleans - June 24, 2011. Peter Van Garderen President
Page 7: PREMIS in Archivematica · PREMIS in Archivematica PETER VAN GARDEREN Artefactual Systems Inc. American Library Association New Orleans - June 24, 2011. Peter Van Garderen President
Page 8: PREMIS in Archivematica · PREMIS in Archivematica PETER VAN GARDEREN Artefactual Systems Inc. American Library Association New Orleans - June 24, 2011. Peter Van Garderen President
Page 9: PREMIS in Archivematica · PREMIS in Archivematica PETER VAN GARDEREN Artefactual Systems Inc. American Library Association New Orleans - June 24, 2011. Peter Van Garderen President

Data Management

Preservation Planning

Archival Storage

Ingest

Administration

SIP

MANAGEMENT

AIP Access DIP

PRODUCER

CONSUMER

Open Archival Information System

Submission Agreement

DesignatedCommunity

Page 10: PREMIS in Archivematica · PREMIS in Archivematica PETER VAN GARDEREN Artefactual Systems Inc. American Library Association New Orleans - June 24, 2011. Peter Van Garderen President
Page 11: PREMIS in Archivematica · PREMIS in Archivematica PETER VAN GARDEREN Artefactual Systems Inc. American Library Association New Orleans - June 24, 2011. Peter Van Garderen President

2011 Development Priorities: 0.8 beta → production

● Interoperable AIP structure

● AIP Indexing (Lucene/ElasticSearch)

● SWORD REST APIs

● Persistent URIs (dns/uuid)

● Format registry integration (Open Planets Foundation)

● ContentDM, Dspace, XTF, TRIM & ICA-AtoM integration (Fedora?)

● Email + attachments preservation plan

● Transfer: SIP preparation (data visualization, keyword filtering)

Page 12: PREMIS in Archivematica · PREMIS in Archivematica PETER VAN GARDEREN Artefactual Systems Inc. American Library Association New Orleans - June 24, 2011. Peter Van Garderen President

Free Beer!

Page 13: PREMIS in Archivematica · PREMIS in Archivematica PETER VAN GARDEREN Artefactual Systems Inc. American Library Association New Orleans - June 24, 2011. Peter Van Garderen President

“They’ll never take our freedom”

© 1995 Paramount Pictures & 20th Century FoxSee fair use rationale: http://en.wikipedia.org/wiki/File:Brave_mel.jpg

Page 14: PREMIS in Archivematica · PREMIS in Archivematica PETER VAN GARDEREN Artefactual Systems Inc. American Library Association New Orleans - June 24, 2011. Peter Van Garderen President
Page 15: PREMIS in Archivematica · PREMIS in Archivematica PETER VAN GARDEREN Artefactual Systems Inc. American Library Association New Orleans - June 24, 2011. Peter Van Garderen President

Foundation orSteering Committee

Governance

Coordination

Funding

Promotion

Users

Lead institutions Funding DevelopmentAll users Bug reports Enhancement requests Code patches Documentation Promotion

Open Source Software

Code

Knowledge

Community

Service Providers

Development

Technical Support

Hosting

Training

Promotion

CodeTime

MoneyKnowledge

CodeTimeMoneyKnowledge

TimeMoney

Knowledge

The open-source eco-system

Page 16: PREMIS in Archivematica · PREMIS in Archivematica PETER VAN GARDEREN Artefactual Systems Inc. American Library Association New Orleans - June 24, 2011. Peter Van Garderen President

What is PREMIS good for?

The PREMIS data dictionary defines what a preservation repository needs to know.

The primary uses of PREMIS are for repository design, repository evaluation, andexchange of archived information packages among preservation repositories. --Caplan, Understanding PREMIS (2009)

Page 17: PREMIS in Archivematica · PREMIS in Archivematica PETER VAN GARDEREN Artefactual Systems Inc. American Library Association New Orleans - June 24, 2011. Peter Van Garderen President

What is PREMIS good for?

Authenticity: establish identity and integrity

Keep the records secure Maintain the chain of custody Document all activities Describe the records

Page 18: PREMIS in Archivematica · PREMIS in Archivematica PETER VAN GARDEREN Artefactual Systems Inc. American Library Association New Orleans - June 24, 2011. Peter Van Garderen President

PREMIS in Archivematica

Semantic unit values managed as SQL data while going through Archivematica ingest

Output as XML into AIP upon ingest completion

Page 19: PREMIS in Archivematica · PREMIS in Archivematica PETER VAN GARDEREN Artefactual Systems Inc. American Library Association New Orleans - June 24, 2011. Peter Van Garderen President

PREMIS in Archivematica: AIP

Bagit package (.zip, .tar optional): /data /logs /metadata /objects mets.xml

Whole PREMIS record in METS digiprovMD: <mets><amdSec><digiprovMD><mdWrap> <XMLdata><premis><object><event><agent><rights>

<objectCharacteristicsExtension><FITS>

Page 20: PREMIS in Archivematica · PREMIS in Archivematica PETER VAN GARDEREN Artefactual Systems Inc. American Library Association New Orleans - June 24, 2011. Peter Van Garderen President

Objects

Identifier

Category

Composition level

Size

Fixity

Format

Characteristics

Relationships

Page 21: PREMIS in Archivematica · PREMIS in Archivematica PETER VAN GARDEREN Artefactual Systems Inc. American Library Association New Orleans - June 24, 2011. Peter Van Garderen President
Page 22: PREMIS in Archivematica · PREMIS in Archivematica PETER VAN GARDEREN Artefactual Systems Inc. American Library Association New Orleans - June 24, 2011. Peter Van Garderen President
Page 23: PREMIS in Archivematica · PREMIS in Archivematica PETER VAN GARDEREN Artefactual Systems Inc. American Library Association New Orleans - June 24, 2011. Peter Van Garderen President

Events

Ingestion

Message digest calculation (fixity)

Quarantine

Unpacking

Virus check

Format identification

Format validation

Normalization

Page 24: PREMIS in Archivematica · PREMIS in Archivematica PETER VAN GARDEREN Artefactual Systems Inc. American Library Association New Orleans - June 24, 2011. Peter Van Garderen President
Page 25: PREMIS in Archivematica · PREMIS in Archivematica PETER VAN GARDEREN Artefactual Systems Inc. American Library Association New Orleans - June 24, 2011. Peter Van Garderen President
Page 26: PREMIS in Archivematica · PREMIS in Archivematica PETER VAN GARDEREN Artefactual Systems Inc. American Library Association New Orleans - June 24, 2011. Peter Van Garderen President

Agents

Who or what is doing all these things to the digital objects? Organizations

Individuals

Software

Page 27: PREMIS in Archivematica · PREMIS in Archivematica PETER VAN GARDEREN Artefactual Systems Inc. American Library Association New Orleans - June 24, 2011. Peter Van Garderen President
Page 28: PREMIS in Archivematica · PREMIS in Archivematica PETER VAN GARDEREN Artefactual Systems Inc. American Library Association New Orleans - June 24, 2011. Peter Van Garderen President

PREMIS in Archivematica: Next Steps

Rights metadata: sync with security classification, FOIPA, accessioning, licensing vocabularies. Flexible structure for defining rights: <rightsExtension> <rightsGranted><act><restriction><termOfGrant>

EAC for Agents?

Indexing AIP metadata: use PREMIS entities as domain/document model

Page 29: PREMIS in Archivematica · PREMIS in Archivematica PETER VAN GARDEREN Artefactual Systems Inc. American Library Association New Orleans - June 24, 2011. Peter Van Garderen President

The original content in this presentation is copyright Artefactual Systems Inc. 2011. You may freely re­use this content under the terms of the Creative Commons Attribution­Non­Commercial­Share Alike 3.0 license. 

AttributionTitle:         PREMIS in Archivematica: ALA 2011 New OrleansCreator:    Peter Van Garderen, Artefactual Systems Inc.Date:         June 25, 2011