a framework for the preservation of a docker container/docker... · docker container • uses of...

15
www.oerc.ox.ac.uk Iain Emsley, David de Roure 12th International Digital Curation Conference (IDCC17), 20–23 February 2017 A Framework for the Preservation of a Docker Container

Upload: others

Post on 20-Aug-2020

59 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: A Framework for the Preservation of a Docker Container/docker... · Docker Container • Uses of Containers • Issues with ... Workflow • The workflow with the configuration details

www.oerc.ox.ac.uk

Iain Emsley, David de Roure 12th International Digital Curation Conference (IDCC17), 20–23 February 2017

A Framework for the Preservation of a Docker Container

Page 2: A Framework for the Preservation of a Docker Container/docker... · Docker Container • Uses of Containers • Issues with ... Workflow • The workflow with the configuration details

•  Uses of Containers •  Issues with existing approaches

•  A Framework •  Integration Prototype overview •  Providing a Provenance of Container Build •  Capturing the Dockerfile details •  Capturing Container Build Environment and Processes •  Capturing the Workflow •  Integrity •  Conclusion

Page 3: A Framework for the Preservation of a Docker Container/docker... · Docker Container • Uses of Containers • Issues with ... Workflow • The workflow with the configuration details

Current uses of containers

•  Being used as repositories for pipelines –  Bio-sciences especially and Radio Astronomy

•  Dockerfiles as a recipe •  Repeatable building of the Container •  Tagging containers in repository for versioning

© Loiol, CC BY ND NC 2.0, https://www.flickr.com/photos/lioil/1198857098

Page 4: A Framework for the Preservation of a Docker Container/docker... · Docker Container • Uses of Containers • Issues with ... Workflow • The workflow with the configuration details

Issues with existing approaches

•  Isn’t a container software itself?

•  Linking this to a pipeline

•  Repeatability and reproducibility issues

© S_Volenszki, CC BY NC 2.0, https://www.flickr.com/photos/s_volenszki/2218589271

© Wojtek Ogrodowczyk, CC BY NC 2.0, https://www.flickr.com/photos/sharnik/4619816186

Page 5: A Framework for the Preservation of a Docker Container/docker... · Docker Container • Uses of Containers • Issues with ... Workflow • The workflow with the configuration details

A Framework

Software Composition •  Description of the components in the container

Provenance and Ownership •  Processes used to build the container. •  The maintainer of the Dockerfile

Container Environment •  The environment used to construct the container

Workflow •  The workflow with the configuration details for the pipeline

Page 6: A Framework for the Preservation of a Docker Container/docker... · Docker Container • Uses of Containers • Issues with ... Workflow • The workflow with the configuration details

SKA Integration Prototype

Github

Jenkins

Build

Cluster

Container

Store

Developers

Page 7: A Framework for the Preservation of a Docker Container/docker... · Docker Container • Uses of Containers • Issues with ... Workflow • The workflow with the configuration details

Providing a Provenance of Container Build

Template

Dockerfile pulled from Github

Build template

Providing a base build

Tagging the Container Providing a namespace and version

Storing it

Making it available

•  Not all software can be archived for all time •  Linking artefacts and owners •  Capturing the environment and processes

Page 8: A Framework for the Preservation of a Docker Container/docker... · Docker Container • Uses of Containers • Issues with ... Workflow • The workflow with the configuration details

Capturing the Dockerfile details

Page 9: A Framework for the Preservation of a Docker Container/docker... · Docker Container • Uses of Containers • Issues with ... Workflow • The workflow with the configuration details

Capturing Container Build Environment and Processes

Page 10: A Framework for the Preservation of a Docker Container/docker... · Docker Container • Uses of Containers • Issues with ... Workflow • The workflow with the configuration details

Capturing the Workflow

Page 11: A Framework for the Preservation of a Docker Container/docker... · Docker Container • Uses of Containers • Issues with ... Workflow • The workflow with the configuration details

Integrity

• Build System

Completeness and accurate processes used?

• Software Processes Conformance to project standards

Page 12: A Framework for the Preservation of a Docker Container/docker... · Docker Container • Uses of Containers • Issues with ... Workflow • The workflow with the configuration details

Conclusions

• Capturing the Dockerfile, Environment and Workflow Processes

Developing the Dockerfile as recipe

• Potential challenges in retrieving parameters

•  Integrity

Build Process Software Process

• Software Citation

Page 13: A Framework for the Preservation of a Docker Container/docker... · Docker Container • Uses of Containers • Issues with ... Workflow • The workflow with the configuration details

Thanks for listening

Iain Emsley Oxford e-Research Centre [email protected]

@iainemsley

Page 14: A Framework for the Preservation of a Docker Container/docker... · Docker Container • Uses of Containers • Issues with ... Workflow • The workflow with the configuration details

Acknowledgements

Benjamin Mort, Fred Dulwich, and Wes Armour, Oxford e-Research Centre

Page 15: A Framework for the Preservation of a Docker Container/docker... · Docker Container • Uses of Containers • Issues with ... Workflow • The workflow with the configuration details

References

Emsley, I and De Roure, D, A framework for the preservation of a Docker container, International Digital Curation Conference, https://ora.ox.ac.uk/objects/uuid:f567f27a-4efb-431e-abcb-07b6e8c03ce2 Emsley, I, SDP Memo 29: Integration Prototype Build and Deployment Infrastructure Design