policy-based data managementdatafed.org/dev/wp-content/uploads/2012/04/sc11-datanet.pdf · 2015....
TRANSCRIPT
![Page 1: Policy-Based Data Managementdatafed.org/dev/wp-content/uploads/2012/04/SC11-Datanet.pdf · 2015. 11. 16. · Topics • Policy-based data management • Integrated Rule-Oriented Data](https://reader036.vdocuments.site/reader036/viewer/2022071302/60a66c6e6a69d05c92378f71/html5/thumbnails/1.jpg)
Policy-Based Data Management
Reagan W. Moore Arcot Rajasekar
University of North Carolina at Chapel Hill [email protected]
![Page 2: Policy-Based Data Managementdatafed.org/dev/wp-content/uploads/2012/04/SC11-Datanet.pdf · 2015. 11. 16. · Topics • Policy-based data management • Integrated Rule-Oriented Data](https://reader036.vdocuments.site/reader036/viewer/2022071302/60a66c6e6a69d05c92378f71/html5/thumbnails/2.jpg)
Topics • Policy-based data management • Integrated Rule-Oriented Data System (v3.0) • Federation
– Soft Links – Realizable objects – Policy-encoded objects – Virtualization of collection life cycle
• DataNet Federation Consortium • LifeTime Library
![Page 3: Policy-Based Data Managementdatafed.org/dev/wp-content/uploads/2012/04/SC11-Datanet.pdf · 2015. 11. 16. · Topics • Policy-based data management • Integrated Rule-Oriented Data](https://reader036.vdocuments.site/reader036/viewer/2022071302/60a66c6e6a69d05c92378f71/html5/thumbnails/3.jpg)
Policy-based Data Sharing Client
Provider iRODS
controlled workflows
Provider iRODS
controlled workflows
Storage Storage
Shared Collection
Consensus on Policies and Procedures controls the shared data
![Page 4: Policy-Based Data Managementdatafed.org/dev/wp-content/uploads/2012/04/SC11-Datanet.pdf · 2015. 11. 16. · Topics • Policy-based data management • Integrated Rule-Oriented Data](https://reader036.vdocuments.site/reader036/viewer/2022071302/60a66c6e6a69d05c92378f71/html5/thumbnails/4.jpg)
4
Policy-‐based Data Environments • Purpose -‐ reason a collec5on is assembled • Proper)es -‐ a6ributes needed to ensure the purpose • Policies -‐ controls for enforcing desired proper'es, • mapped to computer ac'onable rules • Procedures -‐ func5ons that implement the policies • mapped to computer ac'onable workflows • Persistent state informa)on -‐ results of applying the procedures • mapped to system metadata • Assessment criteria -‐ valida5on that state informa'on conforms to
the desired purpose • mapped to periodically executed policies
![Page 5: Policy-Based Data Managementdatafed.org/dev/wp-content/uploads/2012/04/SC11-Datanet.pdf · 2015. 11. 16. · Topics • Policy-based data management • Integrated Rule-Oriented Data](https://reader036.vdocuments.site/reader036/viewer/2022071302/60a66c6e6a69d05c92378f71/html5/thumbnails/5.jpg)
Data Virtualiza5on
Storage System
Storage Protocol !
Access Interface !
Policy Enforcement Points !
Standard Micro-services !
Map from the actions requested by the client to multiple policy enforcement points. Map from policy to standard micro-services. Map from micro-services to standard Posix I/O operations. Map standard I/O operations to the protocol supported by the storage system
Standard I/O Operations !
Dat
a G
rid
![Page 6: Policy-Based Data Managementdatafed.org/dev/wp-content/uploads/2012/04/SC11-Datanet.pdf · 2015. 11. 16. · Topics • Policy-based data management • Integrated Rule-Oriented Data](https://reader036.vdocuments.site/reader036/viewer/2022071302/60a66c6e6a69d05c92378f71/html5/thumbnails/6.jpg)
iRODS Distributed Data Management
![Page 7: Policy-Based Data Managementdatafed.org/dev/wp-content/uploads/2012/04/SC11-Datanet.pdf · 2015. 11. 16. · Topics • Policy-based data management • Integrated Rule-Oriented Data](https://reader036.vdocuments.site/reader036/viewer/2022071302/60a66c6e6a69d05c92378f71/html5/thumbnails/7.jpg)
7
Policy Enforcement Points
• Currently have 71 loca5ons within iRODS framework where policies are checked. – Each ac5on may involve mul5ple policy enforcements points
• Policy enforcement points – Pre-‐ac5on policy (selec5on of storage loca5on) – Policy execu5on (file dele5on control) – Post-‐ac5on policy (derived data products)
![Page 8: Policy-Based Data Managementdatafed.org/dev/wp-content/uploads/2012/04/SC11-Datanet.pdf · 2015. 11. 16. · Topics • Policy-based data management • Integrated Rule-Oriented Data](https://reader036.vdocuments.site/reader036/viewer/2022071302/60a66c6e6a69d05c92378f71/html5/thumbnails/8.jpg)
Policy Enforcement Points (71) ACTION acCreateUser acDeleteUser acGetUserbyDN acTrashPolicy acAclPolicy acSetCreateConditions acDataDeletePolicy acRenameLocalZone acSetRescSchemeForCreate acRescQuotaPolicy acSetMultiReplPerResc acSetNumThreads acVacuum acSetResourceList acSetCopyNumber acVerifyChecksum acCreateUserZoneCollections acDeleteUserZoneCollections acPurgeFiles acRegisterData acGetIcatResults acSetPublicUserPolicy acCreateDefaultCollections acDeleteDefaultCollections
POST-ACTION POLICY acPostProcForCreateUser acPostProcForDeleteUser acPostProcForModifyUser acPostProcForModifyUserGroup acPostProcForDelete acPostProcForCollCreate acPostProcForRmColl acPostProcForModifyAVUMetadata acPostProcForModifyCollMeta acPostProcForModifyDataObjMeta acPostProcForModifyAccessControl acPostProcForOpen acPostProcForObjRename acPostProcForCreateResource acPostProcForDeleteResource acPostProcForModifyResource acPostProcForModifyResourceGroup acPostProcForCreateToken acPostProcForDeleteToken acPostProcForFilePathReg acPostProcForGenQuery acPostProcForPut acPostProcForCopy acPostProcForCreate
PRE-ACTION POLICY acPreProcForCreateUser acPreProcForDeleteUser acPreProcForModifyUser acPreProcForModifyUserGroup acChkHostAccessControl acPreProcForCollCreate acPreProcForRmColl acPreProcForModifyAVUMetadata acPreProcForModifyCollMeta acPreProcForModifyDataObjMeta acPreProcForModifyAccessControl acPreprocForDataObjOpen acPreProcForObjRename acPreProcForCreateResource acPreProcForDeleteResource acPreProcForModifyResource acPreProcForModifyResourceGroup acPreProcForCreateToken acPreProcForDeleteToken acNoChkFilePathPerm acPreProcForGenQuery acSetReServerNumProc acSetVaultPathPolicy
![Page 9: Policy-Based Data Managementdatafed.org/dev/wp-content/uploads/2012/04/SC11-Datanet.pdf · 2015. 11. 16. · Topics • Policy-based data management • Integrated Rule-Oriented Data](https://reader036.vdocuments.site/reader036/viewer/2022071302/60a66c6e6a69d05c92378f71/html5/thumbnails/9.jpg)
iput ../src/irm.c checks 10 policy hooks srbbrick14:10900:ApplyRule#116:: acChkHostAccessControl srbbrick14:10900:GotRule#117:: acChkHostAccessControl srbbrick14:10900:ApplyRule#118:: acSetPublicUserPolicy srbbrick14:10900:GotRule#119:: acSetPublicUserPolicy srbbrick14:10900:ApplyRule#120:: acAclPolicy srbbrick14:10900:GotRule#121:: acAclPolicy srbbrick14:10900:ApplyRule#122:: acSetRescSchemeForCreate srbbrick14:10900:GotRule#123:: acSetRescSchemeForCreate srbbrick14:10900:execMicroSrvc#124:: msiSetDefaultResc(demoResc,null) srbbrick14:10900:ApplyRule#125:: acRescQuotaPolicy srbbrick14:10900:GotRule#126:: acRescQuotaPolicy srbbrick14:10900:execMicroSrvc#127:: msiSetRescQuotaPolicy(off) srbbrick14:10900:ApplyRule#128:: acSetVaultPathPolicy srbbrick14:10900:GotRule#129:: acSetVaultPathPolicy srbbrick14:10900:execMicroSrvc#130:: msiSetGraftPathScheme(no,1) srbbrick14:10900:ApplyRule#131:: acPreProcForModifyDataObjMeta srbbrick14:10900:GotRule#132:: acPreProcForModifyDataObjMeta srbbrick14:10900:ApplyRule#133:: acPostProcForModifyDataObjMeta srbbrick14:10900:GotRule#134:: acPostProcForModifyDataObjMeta srbbrick14:10900:ApplyRule#135:: acPostProcForCreate srbbrick14:10900:GotRule#136:: acPostProcForCreate srbbrick14:10900:ApplyRule#137:: acPostProcForPut srbbrick14:10900:GotRule#138:: acPostProcForPut srbbrick14:10900:GotRule#139:: acPostProcForPut srbbrick14:10900:GotRule#140:: acPostProcForPut
![Page 10: Policy-Based Data Managementdatafed.org/dev/wp-content/uploads/2012/04/SC11-Datanet.pdf · 2015. 11. 16. · Topics • Policy-based data management • Integrated Rule-Oriented Data](https://reader036.vdocuments.site/reader036/viewer/2022071302/60a66c6e6a69d05c92378f71/html5/thumbnails/10.jpg)
10
User w/Client Can Search, Access, Add and
Manage Data & Metadata
Access distributed data with Web-‐based Browser or iRODS GUI or Command Line clients.
iRODS Data Server
Disk, Tape, etc.
iRODS Metadata Catalog
Track informa;on
iRODS Middleware
iRODS Rule Engine
Track Policies
Overview of iRODS Architecture
![Page 11: Policy-Based Data Managementdatafed.org/dev/wp-content/uploads/2012/04/SC11-Datanet.pdf · 2015. 11. 16. · Topics • Policy-based data management • Integrated Rule-Oriented Data](https://reader036.vdocuments.site/reader036/viewer/2022071302/60a66c6e6a69d05c92378f71/html5/thumbnails/11.jpg)
iRODS Extensible Infrastructure • Clients – specific to discipline and life cycle state • Policies – specific to discipline • Procedures – specific to discipline • Remaining infrastructure is generic
– Network transport Parallel I/O – Authentication / Authorization Single Sign-on – Distributed storage access Protocol mediation – Remote execution Deferred/periodic – Metadata management Catalog – Message passing Debugging/progress – Rule engine Policy control
![Page 12: Policy-Based Data Managementdatafed.org/dev/wp-content/uploads/2012/04/SC11-Datanet.pdf · 2015. 11. 16. · Topics • Policy-based data management • Integrated Rule-Oriented Data](https://reader036.vdocuments.site/reader036/viewer/2022071302/60a66c6e6a69d05c92378f71/html5/thumbnails/12.jpg)
Generic Capabilities • Replication • Registration of files into the data grid • Synchronization of remote directory • Managed file transport (iDrop) • Automated metadata extraction • Queries on metadata, tags • Server-side workflows (loop over result sets) • Parallel I/O streams & RBUDP transport
![Page 13: Policy-Based Data Managementdatafed.org/dev/wp-content/uploads/2012/04/SC11-Datanet.pdf · 2015. 11. 16. · Topics • Policy-based data management • Integrated Rule-Oriented Data](https://reader036.vdocuments.site/reader036/viewer/2022071302/60a66c6e6a69d05c92378f71/html5/thumbnails/13.jpg)
Policies • Reten5on, disposi5on, distribu5on, arrangement • Authen5city, provenance, descrip5on • Integrity, replica5on, synchroniza5on • Dele5on, trash cans, versioning • Archiving, staging, caching • Authen5ca5on, authoriza5on, redac5on • Access, approval, IRB, audit trails, report genera5on • Assessment criteria, valida5on • Derived data product genera5on, format parsing • Federa5on of independent data grids
![Page 14: Policy-Based Data Managementdatafed.org/dev/wp-content/uploads/2012/04/SC11-Datanet.pdf · 2015. 11. 16. · Topics • Policy-based data management • Integrated Rule-Oriented Data](https://reader036.vdocuments.site/reader036/viewer/2022071302/60a66c6e6a69d05c92378f71/html5/thumbnails/14.jpg)
Highly Controlled Environment • All accesses are authenticated
– GSI / Kerberos / Challenge-response / Shibboleth
• All operations are authorized – ACLs on files, storage – User groups, storage groups
• All policies evaluate a constraint – Constraints based on persistent state
information and session information
![Page 15: Policy-Based Data Managementdatafed.org/dev/wp-content/uploads/2012/04/SC11-Datanet.pdf · 2015. 11. 16. · Topics • Policy-based data management • Integrated Rule-Oriented Data](https://reader036.vdocuments.site/reader036/viewer/2022071302/60a66c6e6a69d05c92378f71/html5/thumbnails/15.jpg)
iRODS Version 3.0 • September 30 production 3.0 release • Features
– New rule engine • Strong parameter typing • Optimized performance (thousands of rules) • Expanded rule programming language
– Rule versioning – Distributed rule base management – Soft links to external resources – New transport management for large files – Improved Java interface – Windows native port (C++ compilation) – Synchronization interface (iDrop)
![Page 16: Policy-Based Data Managementdatafed.org/dev/wp-content/uploads/2012/04/SC11-Datanet.pdf · 2015. 11. 16. · Topics • Policy-based data management • Integrated Rule-Oriented Data](https://reader036.vdocuments.site/reader036/viewer/2022071302/60a66c6e6a69d05c92378f71/html5/thumbnails/16.jpg)
Applications • Data grids – PB-size distributed collections
– Astronomy – NOAO, CyberSKA, LSST – High Energy Physics – BaBar, KEK – Earth Systems – NASA (MODIS data set) – Australian Research Collaboration Service – Plant biology – iPlant Collaborative
• Institutional repositories – Carolina Digital Repository
• Libraries – Texas Digital Libraries – Seismology - Southern California Earthquake Center
• Archives – Ocean Observatories Initiative
![Page 17: Policy-Based Data Managementdatafed.org/dev/wp-content/uploads/2012/04/SC11-Datanet.pdf · 2015. 11. 16. · Topics • Policy-based data management • Integrated Rule-Oriented Data](https://reader036.vdocuments.site/reader036/viewer/2022071302/60a66c6e6a69d05c92378f71/html5/thumbnails/17.jpg)
Expectations
• Data collection sizes will increase – Now petabytes, soon exabytes
• 1 PB/year = 32 MB/sec • 1 PB/day = 11.6 GB/sec
– Need to do data analyses at the storage system • Integration of data manipulation with
storage controllers – Analyses do not go over the network
![Page 18: Policy-Based Data Managementdatafed.org/dev/wp-content/uploads/2012/04/SC11-Datanet.pdf · 2015. 11. 16. · Topics • Policy-based data management • Integrated Rule-Oriented Data](https://reader036.vdocuments.site/reader036/viewer/2022071302/60a66c6e6a69d05c92378f71/html5/thumbnails/18.jpg)
Applications
• Digital libraries – Continuous indexing of contents
• Scientific data collections – Extraction of features from data sets – Creation of derived data products
• Archives – Transformative migrations – Validation of assessment criteria
![Page 19: Policy-Based Data Managementdatafed.org/dev/wp-content/uploads/2012/04/SC11-Datanet.pdf · 2015. 11. 16. · Topics • Policy-based data management • Integrated Rule-Oriented Data](https://reader036.vdocuments.site/reader036/viewer/2022071302/60a66c6e6a69d05c92378f71/html5/thumbnails/19.jpg)
Federation
• Federate multiple storage systems – Build data grid to create shared collection
• Federate multiple data grids – Establish trust between data grids
• Authenticate access based on user home data grid • Authorize operation based on data home data grid
– Cross-register users between data grid
• Federate multiple data management systems – Register remote data through a soft link
![Page 20: Policy-Based Data Managementdatafed.org/dev/wp-content/uploads/2012/04/SC11-Datanet.pdf · 2015. 11. 16. · Topics • Policy-based data management • Integrated Rule-Oriented Data](https://reader036.vdocuments.site/reader036/viewer/2022071302/60a66c6e6a69d05c92378f71/html5/thumbnails/20.jpg)
Federation through Shared Collection
Client
Institution iRODS
controlled workflows
Institution iRODS
controlled workflows
Storage Storage
Shared Collection
Shared name spaces for collaborative environment
![Page 21: Policy-Based Data Managementdatafed.org/dev/wp-content/uploads/2012/04/SC11-Datanet.pdf · 2015. 11. 16. · Topics • Policy-based data management • Integrated Rule-Oriented Data](https://reader036.vdocuments.site/reader036/viewer/2022071302/60a66c6e6a69d05c92378f71/html5/thumbnails/21.jpg)
Shared Collection
• Shared name spaces – User name space (single sign-on) – Resource name space (logical names) – Collection name space (collection hierarchy) – File name space (logical names)
• Shared policies and procedures – Micro-service name space (versions) – Rule name space (versioned rules)
![Page 22: Policy-Based Data Managementdatafed.org/dev/wp-content/uploads/2012/04/SC11-Datanet.pdf · 2015. 11. 16. · Topics • Policy-based data management • Integrated Rule-Oriented Data](https://reader036.vdocuments.site/reader036/viewer/2022071302/60a66c6e6a69d05c92378f71/html5/thumbnails/22.jpg)
Data Grid Federation
Client
Data Grid iRODS
controlled workflows
Data Grid iRODS
controlled workflows
Storage
Shared Collection
Consensus on Policies and Procedures controls the shared data across federation
Storage Storage Storage
![Page 23: Policy-Based Data Managementdatafed.org/dev/wp-content/uploads/2012/04/SC11-Datanet.pdf · 2015. 11. 16. · Topics • Policy-based data management • Integrated Rule-Oriented Data](https://reader036.vdocuments.site/reader036/viewer/2022071302/60a66c6e6a69d05c92378f71/html5/thumbnails/23.jpg)
23
SoQ Link
Remote Resource
Data Object
iRODS Data Grid
Register Access
Cache Resource
Steps 1) Register access information 2) When access, instantiate a copy in the data grid 3) If copy is deleted, go back to the original site
![Page 24: Policy-Based Data Managementdatafed.org/dev/wp-content/uploads/2012/04/SC11-Datanet.pdf · 2015. 11. 16. · Topics • Policy-based data management • Integrated Rule-Oriented Data](https://reader036.vdocuments.site/reader036/viewer/2022071302/60a66c6e6a69d05c92378f71/html5/thumbnails/24.jpg)
Soft Links (iRODS version 3.0)
• Use ireg command to register access to data within a remote repository
• Can access data in web pages, ftp, z39.50, srb data grid, irods data grid
• Requires definition of micro-service object: – resc_type mso – data_type mso
• Requires creation of mso resource and mso resource group for data caching
![Page 25: Policy-Based Data Managementdatafed.org/dev/wp-content/uploads/2012/04/SC11-Datanet.pdf · 2015. 11. 16. · Topics • Policy-based data management • Integrated Rule-Oriented Data](https://reader036.vdocuments.site/reader036/viewer/2022071302/60a66c6e6a69d05c92378f71/html5/thumbnails/25.jpg)
Soft Link (iRODS 3.0)
ireg -D mso -R msoResc14 -G msoRescGroup "//http://irods.org/pubs/iRODS_Fact_Sheet-0907c.pdf" /tempZone/home/rods/webfile Remote object is registered as webfile in the directory /tempZone/home/rods Can retrieve a copy of the web page iget webfile
![Page 26: Policy-Based Data Managementdatafed.org/dev/wp-content/uploads/2012/04/SC11-Datanet.pdf · 2015. 11. 16. · Topics • Policy-based data management • Integrated Rule-Oriented Data](https://reader036.vdocuments.site/reader036/viewer/2022071302/60a66c6e6a69d05c92378f71/html5/thumbnails/26.jpg)
Soft Link Micro-service myTestRule { #Input parameters are: # inRequestPath - the string sent to the remote URL # inFileMode - the cache file creation mode # inFileFlags - the access modes for the cache file # inCacheFileName - the full path of the cache file on the local system #No output parameters #Output is the name of the file that was created msiobjget_http(*Request, *Mode, *Flags, *Path); writeLine("stdout","Retrieved file *Path from *Request"); } INPUT *Request ="http://irods.org/pubs/iRODS_Fact_Sheet-0907c.pdf", *Mode = "w", *Flags = "O_RDWR", *Path = "/home/reagan/Vaulttest/webfile" OUTPUT ruleExecOut
![Page 27: Policy-Based Data Managementdatafed.org/dev/wp-content/uploads/2012/04/SC11-Datanet.pdf · 2015. 11. 16. · Topics • Policy-based data management • Integrated Rule-Oriented Data](https://reader036.vdocuments.site/reader036/viewer/2022071302/60a66c6e6a69d05c92378f71/html5/thumbnails/27.jpg)
Realizable Objects • Soft links are implemented through micro-service
objects. – A micro-service is defined that manages the required
access protocol • Realized Objects
– Register into iRODS the workflow needed to create a derived data product
– Accessing the link causes the derived data to be generated and registered as a replica
– Implemented using compound resource concept – Elevates provenance information to first class object
![Page 28: Policy-Based Data Managementdatafed.org/dev/wp-content/uploads/2012/04/SC11-Datanet.pdf · 2015. 11. 16. · Topics • Policy-based data management • Integrated Rule-Oriented Data](https://reader036.vdocuments.site/reader036/viewer/2022071302/60a66c6e6a69d05c92378f71/html5/thumbnails/28.jpg)
Active Objects Provenance as a First Class Object
Object A workflow depends on
Object B workflow depends on
Object C workflow depends on
Object D
Change object D System automatically re-computes objects C, B, A when object A is accessed
![Page 29: Policy-Based Data Managementdatafed.org/dev/wp-content/uploads/2012/04/SC11-Datanet.pdf · 2015. 11. 16. · Topics • Policy-based data management • Integrated Rule-Oriented Data](https://reader036.vdocuments.site/reader036/viewer/2022071302/60a66c6e6a69d05c92378f71/html5/thumbnails/29.jpg)
Policy-Encoded Objects • Micro-services exist for loading policies • The workflow associated with a realizable
object can load the policies under which the object may be used – Enforce redaction – Transform to required format – Verify distribution – Assert audit trails on use – Object specific policies
![Page 30: Policy-Based Data Managementdatafed.org/dev/wp-content/uploads/2012/04/SC11-Datanet.pdf · 2015. 11. 16. · Topics • Policy-based data management • Integrated Rule-Oriented Data](https://reader036.vdocuments.site/reader036/viewer/2022071302/60a66c6e6a69d05c92378f71/html5/thumbnails/30.jpg)
Virtualization of Collection Life Cycle
• Define policies that implement desired management objectives
• As user community broadens, develop new consensus on require management objectives
• Define the new policies needed to meet the new objectives
• Migrate collection onto new policies • Community specific policies
![Page 31: Policy-Based Data Managementdatafed.org/dev/wp-content/uploads/2012/04/SC11-Datanet.pdf · 2015. 11. 16. · Topics • Policy-based data management • Integrated Rule-Oriented Data](https://reader036.vdocuments.site/reader036/viewer/2022071302/60a66c6e6a69d05c92378f71/html5/thumbnails/31.jpg)
Community-Based Collection Life Cycle
Project Collection
Private
Local Policy
Data Grid
Shared
Distribution
Policy
Digital Library
Published
Description
Policy
Data Processing
Pipeline
Analyzed
Service Policy
Reference Collection
Preserved
Representation
Policy
Federation
Sustained
Re-purposing Policy
Stages correspond to addition of new policies for a broader community Virtualize the collection life cycle through policy evolution
Each life cycle stage re-purposes the original collection
![Page 32: Policy-Based Data Managementdatafed.org/dev/wp-content/uploads/2012/04/SC11-Datanet.pdf · 2015. 11. 16. · Topics • Policy-based data management • Integrated Rule-Oriented Data](https://reader036.vdocuments.site/reader036/viewer/2022071302/60a66c6e6a69d05c92378f71/html5/thumbnails/32.jpg)
LIfeTime Library • Student personal digital library
– Manage course material – Photograph collections – Video collections – Reference collection (soft links to information)
• Choose favorite access mechanism – iDrop (synchronize local directory, share data) – iDrop cloud browser (add tags, metadata) – Unix tools (execute personal rules)
![Page 33: Policy-Based Data Managementdatafed.org/dev/wp-content/uploads/2012/04/SC11-Datanet.pdf · 2015. 11. 16. · Topics • Policy-based data management • Integrated Rule-Oriented Data](https://reader036.vdocuments.site/reader036/viewer/2022071302/60a66c6e6a69d05c92378f71/html5/thumbnails/33.jpg)
LifeTime Library
Storage System
Storage Protocol !
Access Interface !
Policy Enforcement Points !
Standard Micro-services !
Multiple clients Policies to automate replication turn on versioning set audit trails enforce strict ACLs replicate metadata Micro-services to create thumbnails extract metadata assign organization
Standard I/O Operations !
Dat
a G
rid
![Page 34: Policy-Based Data Managementdatafed.org/dev/wp-content/uploads/2012/04/SC11-Datanet.pdf · 2015. 11. 16. · Topics • Policy-based data management • Integrated Rule-Oriented Data](https://reader036.vdocuments.site/reader036/viewer/2022071302/60a66c6e6a69d05c92378f71/html5/thumbnails/34.jpg)
DataNet Federation Consortium • Implement na5onal data grid
– Federate exis5ng discipline-‐specific data management systems to enable na5onal research collabora5ons
• Enable collabora5ve research on shared data collec5ons – Manage collec5on life cycle as the user community
broadens
• Integrate “live” research data into educa5on ini5a5ves – Enable student research par5cipa5on through control
policies
Project Shared Collection
Processing Pipeline Digital Library
Reference Collection Federation
Collection Life Cycle Cyber-infrastructure Partners: Univ. of North Carolina, Chapel Hill Univ. of California, San Diego Arizona State University Drexel University Duke University University of Arizona University of South Carolina
Science and Engineering Initiatives: Ocean Observatories Initiative the iPlant Collaborative CUAHSI CIBER-U Odum Social Science Institute Temporal Dynamics of Learning Center
National Science Foundation Cooperative Agreement: OCI-0940841 Policy-based
data management
![Page 35: Policy-Based Data Managementdatafed.org/dev/wp-content/uploads/2012/04/SC11-Datanet.pdf · 2015. 11. 16. · Topics • Policy-based data management • Integrated Rule-Oriented Data](https://reader036.vdocuments.site/reader036/viewer/2022071302/60a66c6e6a69d05c92378f71/html5/thumbnails/35.jpg)
Open Source SoQware
• Community driven soQware development – Focus on features required by user communi5es – Focus on bug-‐free soQware – Focus on highly reliable soQware – Focus on highly extensible soQware – Approximately 3-‐4 soQware releases per year
• Distributed under a BSD license – Interna5onal collabora5ons on soQware development – IN2P3 (France), SHAMAN (UK), ARCS (Australia), Academia Sinica (Taiwan)
![Page 36: Policy-Based Data Managementdatafed.org/dev/wp-content/uploads/2012/04/SC11-Datanet.pdf · 2015. 11. 16. · Topics • Policy-based data management • Integrated Rule-Oriented Data](https://reader036.vdocuments.site/reader036/viewer/2022071302/60a66c6e6a69d05c92378f71/html5/thumbnails/36.jpg)
Capability Summary
• Clients - 50 • Policy enforcement points - 71 • Micro-services - 256 • Persistent state information - 209 • Posix I/O protocol operations - 22 • Types of data objects - 3
– data, database, soft links – adding realizable & policy encoded objects
![Page 37: Policy-Based Data Managementdatafed.org/dev/wp-content/uploads/2012/04/SC11-Datanet.pdf · 2015. 11. 16. · Topics • Policy-based data management • Integrated Rule-Oriented Data](https://reader036.vdocuments.site/reader036/viewer/2022071302/60a66c6e6a69d05c92378f71/html5/thumbnails/37.jpg)
iRODS -‐ Open Source SoQware
Reagan W. Moore [email protected]
h6p://irods.diceresearch.org
NSF OCI-0940841 “DataNet Federation Consortium” NSF OCI-1032732 "SDCI Data Improvement: Improvement and Sustainability of iRODS Data Grid Software for Multi-Disciplinary Community Driven Application" NSF OCI-0848296 “NARA Transcontinental Persistent Archives Prototype” NSF SDCI-0721400 “Data Grids for Community Driven Applications”