"data in context" ig sessions @ rda 3rd plenary

19
Data in Context Co-chairs: Brigitte Jörg, Keith Jeffery RDA 3rd Plenary, March, 26th - 28th, 2014 Dublin

Upload: brigitte-joerg

Post on 09-Jul-2015

93 views

Category:

Technology


0 download

DESCRIPTION

Slides for session 1 + 2 of Data in Context Interest Group Meeting at 3rd RDA Plenary in Dublin.

TRANSCRIPT

Page 1: "Data in Context" IG sessions @  RDA 3rd Plenary

Data in Context

Co-chairs: Brigitte Jörg, Keith Jeffery

RDA 3rd Plenary,

March, 26th - 28th, 2014 Dublin

Page 2: "Data in Context" IG sessions @  RDA 3rd Plenary

Brief History

• 1st Plenary Gothenburg Preparing a WG Proposal/Case Statement „Contextual Metadata“

• A lot of interest• Revision of Initial Use Cases• Use Cases as specific as possible• Alignment with other WGs / Activities• Four revised use cases:

– Researcher: Find data ..– Manager: Indicate to funder– Provenance: Allow to take segments from streamed data

workflows– Interoperability: Exchange of contextual metadata

• Rename Group to „Data in Context“

Page 3: "Data in Context" IG sessions @  RDA 3rd Plenary

Data in Context IG Approach• Lifecycle Approach

– Linear Sequence of Elements– Cyclic Repetition of Elements

• Investigate Lifecycle Models– DCC: Conceptualize; Create;

Access; Use; Appraise; Select; Dispose; etc

– DDI: Discovery & Planning; Initial Data Collection; etc.

– Research Lifecycle (Jisc): Research Process: SimulateExperiment; Manage Data; Analyse; etc.

– etc. ??

• Investigate contextually orsubcontextually-aware standardization work– OAIS; CASRAI; CERIF; VIVO;

PROV; PREMIS; MARC; CKAN; DCAT; ISO; W3C; OMG; Research Objects, etc.

• Investigate / PrioritizeReusable Requirements

• Deliverables: – M6: Overview of contextually-

aware standardization work– M12: Priority List of

Requirements

• Goal: – Set up of a Working Group– Implementation of Standardized

Profiles

• Long-term Goal: – Automated Transformation

Between Standards

Page 4: "Data in Context" IG sessions @  RDA 3rd Plenary

Collaboration / Exchange

• RDA Foundation and Terminology• RDA Metadata Standards Directory WG• RDA PID Information Types WG• ICSU Open Metadata Catalogue and Knowledge

Networks WG• RDA/WDS Workflows for Publishing Data IG• RDA Data Description Registry Interoperability• RDA Semantic Interoperability Activity• RDA Metadata Interest Group• Various W3C groups (LOD, SW....)

Page 5: "Data in Context" IG sessions @  RDA 3rd Plenary

Requirements / Needs

• Stakeholders

• Data Producers

• Data Consumers

• Standardized Open Vocabularies

• Standardized Formal Data Profiles

• Standardized Formal Semantics

Template

First Steps taken withdeveloping a Template

Apply

Page 6: "Data in Context" IG sessions @  RDA 3rd Plenary

DCC – The Curation Lifecycle

Stakeholders

Data Producer

Data Consumer

Standardized Open Vocabularies

Standardized Formal Data Profiles

Standardized Formal Semantics

http://www.dcc.ac.uk/digital-curation/what-digital-curation

Page 7: "Data in Context" IG sessions @  RDA 3rd Plenary

DDI Lifecycle

http://www.ddialliance.org/Specification/DDI-CV/

DDI Controlled VocabulariesAnalysis Unit; Character Set; Commonality Type; Lifecycle Event Type; Response Unit; Software Package; Summary Statistic TypeTime Method

Stakeholders

Data Producer

Data Consumer

Standardized Open Vocabularies

Standardized Formal Data Profiles

Standardized Formal Semantics

Page 8: "Data in Context" IG sessions @  RDA 3rd Plenary

Data Assets Framework

Stakeholders

Data Producer

Data Consumer

Standardized Open Vocabularies

Standardized Formal Data Profiles

Standardized Formal Semanticshttp://www.data-audit.eu/

Page 9: "Data in Context" IG sessions @  RDA 3rd Plenary

Research Lifecycle

DDI Controlled VocabulariesAnalysis Unit; Character Set; Commonality Type; Lifecycle Event Type; Response Unit; Software Package; Summary Statistic TypeTime Method

Stakeholders

Data Producer

Data Consumer

Standardized Open Vocabularies

Standardized Formal Data Profiles

Standardized Formal Semantics

http://www.jisc.ac.uk/whatwedo/campaigns/res3/jischelp.aspx

Page 10: "Data in Context" IG sessions @  RDA 3rd Plenary

RDA Practical Policy WG

PolicyCategories

Collec on-basedPolicies

Integrity

DataLifecycleManagement

DataStaging

Federa on

Descrip on

Publica on

Compliance

DataManagement

Plans

AccessControl

Preserva onProvenance

Replica on

Regulatory

Management

Administrative

Assessment

Stakeholders

Data Producer

Data Consumer

Standardized Open Vocabularies

Standardized Formal Data Profiles

Standardized Formal Semantics

Src: Slide Extract Rainer Stotzka, Reagan Moore provided for „Data in Context“ session, RDA 3rd Plenary

Page 11: "Data in Context" IG sessions @  RDA 3rd Plenary

Data Lifecycle

Stakeholders

Data Producer

Data Consumer

Standardized Open Vocabularies

Standardized Formal Data Profiles

Standardized Formal Semantics

DATA

Collaboration&

Visualisation

Dissemination &

Sharing

Archiving&

Preserving

Analysis&

Data Mining

Acquisition&

Modeling

Src: Keynote Tony Hey at RDA 3rd Plenary

Page 12: "Data in Context" IG sessions @  RDA 3rd Plenary

Experimental Context, Publishing and Research Objects

Proposal

Approval

SchedulingExperiment/Investigation

Data storage

Record Publication

Scientist submits application for

beamtime

Facility committee approves

applicationFacility registers,

trains, and schedules

scientist’s visit

Scientists visits facility, run’s experiment

Subsequent publication

registered with facility

Raw data filtered, and stored

Data analysis

Tools for processing made

available

Investigation as a first class object

Src: Slide extract Brian Matthews, STFC provided for „Data in Context“ session, RDA 3rd Plenary

Page 13: "Data in Context" IG sessions @  RDA 3rd Plenary

Liberalised Meta-DataIs a network

13

Citation

Coverage(Temporal,

Spatial, Topic)

Use, Caveats, Lineage,

Methods, and Licenses

Publisher

People

Institutions

RDI Outputs/ Online

Resources

Projects

Initiatives

Networks

Funders

Relationships are contributed by (1) meta-data mining (2) information from websites conforming to schema (3) social-media-type sites and VREs (4) existing network contributions (5) scraping existing websites (6) ontologies and vocabularies (…)

Src: Slide Extract Wim Hugo, ICSU WDS provided for „Data in Context“ session, RDA 3rd Plenary

Page 14: "Data in Context" IG sessions @  RDA 3rd Plenary

Etc.

• Data Curation Profiles (Purdue University)

• ODP Model (ISO Reference Model for Open Distributed Processing)

Page 15: "Data in Context" IG sessions @  RDA 3rd Plenary

Standards

Jeffery et. al. 2013 http://resources.metapress.com/pdf-preview.axd?code=vl5422n2u7112669&size=largest

• e.g. • OAIS• CASRAI• CERIF• VIVO• PROV• PREMIS• MARC

• CKAN• DCAT• ISO• W3C• OMG• ODP• etc.

Page 16: "Data in Context" IG sessions @  RDA 3rd Plenary

Emerging e-Infrastructure

Discovery

Contextual

Discovery

Jeffery et. al. 2013 http://resources.metapress.com/pdf-preview.axd?code=vl5422n2u7112669&size=largest

Page 17: "Data in Context" IG sessions @  RDA 3rd Plenary

Agenda

Session 1: Thursday, March 27 - 15:30 - 17:00

• Introduction and Overview from Co-Chairs• Contributions from RDA Members

– Data Publishing Workflows, DCC Data Profiles (Angus Whyte) – Data Description Registry Interoperability (Amir Aryani)– Long-tail Data IG, Data Publishing IG (Jochen Schirrwagen)– WDS Knowledge Network activity (Wim Hugo) – Experimental Context, Publishing and Research Objects (Brian Matthews)– Reference Model Proposal (Yin Chen)

• Discussion

Notes Taking: Alessia Bardi, RDA Early Career Researchers Programme recipient.

Page 18: "Data in Context" IG sessions @  RDA 3rd Plenary

Agenda

Session 2: Friday, March 28 - 11:00 – 12:30

• Recap and Overview from Co-Chairs• Contributions from RDA Members

– Semantic Interoperability, (Gery Berg-Cross) – Metadata WGs (Keith Jeffery, Rebecca Koskela)– Practical Policy Sessions (Slides Reagan Moore)

• Discussion

Notes Taking: Alessia Bardi, RDA Early Career Researchers Programme recipient.

Page 19: "Data in Context" IG sessions @  RDA 3rd Plenary

Rough Work Plan

• M6: Overview of contextually awarestandardization work

• M12: Priority List of Requirements

From there set up a RDA Working Group

Requirements-driven

Implementation of Standards WG Plan