the role of registries within a spatial data infrastructure simon coxrob atkinson research...

26
The role of registries within a spatial data infrastructure Simon Cox Rob Atkinson Research Scientist Spatial Architect 16 April 2008

Upload: chastity-hart

Post on 15-Jan-2016

217 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: The role of registries within a spatial data infrastructure Simon CoxRob Atkinson Research ScientistSpatial Architect 16 April 2008

The role of registries within a spatial data infrastructure

Simon Cox Rob Atkinson

Research Scientist Spatial Architect

16 April 2008

Page 2: The role of registries within a spatial data infrastructure Simon CoxRob Atkinson Research ScientistSpatial Architect 16 April 2008

Insert presentation title, do not remove CSIRO from start of footerCSIRO EGU2008-A-05920 Cox SOA

Outline

• Spatial Data Infrastructure ~ Cyberinfrastructure

• Brief comment on state of SDI deployment• Analysis examples

• Metadata

• Concept identifiers

• Expanded role for registries

Page 3: The role of registries within a spatial data infrastructure Simon CoxRob Atkinson Research ScientistSpatial Architect 16 April 2008

Insert presentation title, do not remove CSIRO from start of footerCSIRO EGU2008-A-05920 Cox SOA

SDIs

Page 4: The role of registries within a spatial data infrastructure Simon CoxRob Atkinson Research ScientistSpatial Architect 16 April 2008

Insert presentation title, do not remove CSIRO from start of footerCSIRO EGU2008-A-05920 Cox SOA

Spatial Data Infrastructure ideal

• Goal: automated workflow/service-chain composition on-demand

matching clients to services

• Matching services and clients requires components to be described to a high level of detail

• service type• content that it exposes

• Schema• vocabularies

• queries that it supports• response formats• quality of service• …

“Service classification axes”

Page 5: The role of registries within a spatial data infrastructure Simon CoxRob Atkinson Research ScientistSpatial Architect 16 April 2008

Insert presentation title, do not remove CSIRO from start of footerCSIRO EGU2008-A-05920 Cox SOA

SDI reality

• Some service instances• OGC-WFS, WMS, WCS, OpenDAP

• Dataset metadata directories• Clearinghouses, GEON, ESIP, ASDD, Go-Geo …

• Is this enough to achieve the goal? • No

• Is the number of registered resources growing? • Not enough

• Are the right resources being registered?• No

• Why not?• Governance patterns not resolved• Metadata is insufficient, but creating it is too hard• Semantic interoperability requires community agreements

• See Markup/Standards-based methodology paper

Page 6: The role of registries within a spatial data infrastructure Simon CoxRob Atkinson Research ScientistSpatial Architect 16 April 2008

Insert presentation title, do not remove CSIRO from start of footerCSIRO EGU2008-A-05920 Cox SOA

Metadata

Page 7: The role of registries within a spatial data infrastructure Simon CoxRob Atkinson Research ScientistSpatial Architect 16 April 2008

Insert presentation title, do not remove CSIRO from start of footerCSIRO EGU2008-A-05920 Cox SOA

Metadata capture

• Everyone agrees that metadata is a good idea • But researchers are reluctant to provide it

Why?

• Not integrated with workflow• No perceived reward

• Researchers don’t themselves rely on metadata-based discovery systems

• Tedious to create• …

Page 8: The role of registries within a spatial data infrastructure Simon CoxRob Atkinson Research ScientistSpatial Architect 16 April 2008
Page 9: The role of registries within a spatial data infrastructure Simon CoxRob Atkinson Research ScientistSpatial Architect 16 April 2008
Page 10: The role of registries within a spatial data infrastructure Simon CoxRob Atkinson Research ScientistSpatial Architect 16 April 2008
Page 11: The role of registries within a spatial data infrastructure Simon CoxRob Atkinson Research ScientistSpatial Architect 16 April 2008
Page 12: The role of registries within a spatial data infrastructure Simon CoxRob Atkinson Research ScientistSpatial Architect 16 April 2008
Page 13: The role of registries within a spatial data infrastructure Simon CoxRob Atkinson Research ScientistSpatial Architect 16 April 2008

Insert presentation title, do not remove CSIRO from start of footerCSIRO EGU2008-A-05920 Cox SOA

Is it the metadata models?

• Standards are complex• … but they are also highly normalized

Page 14: The role of registries within a spatial data infrastructure Simon CoxRob Atkinson Research ScientistSpatial Architect 16 April 2008

Insert presentation title, do not remove CSIRO from start of footerCSIRO EGU2008-A-05920 Cox SOA

It’s the implementation!

• Records are usually de-normalized

• A better way

• Records refer to externally governed elements

 <gmd:contact>      <gmd:CI_ResponsibleParty>         <gmd:organisationName>            <gco:CharacterString>Geoscience Australia (GA)</gco:CharacterString>         </gmd:organisationName>         <gmd:positionName>            <gco:CharacterString>Director, Sales and Distribution, CIMA</gco:CharacterString>         </gmd:positionName>         <gmd:contactInfo>            <gmd:CI_Contact>               <gmd:phone>                  <gmd:CI_Telephone>                     <gmd:voice>                        <gco:CharacterString>+61 2 6249 9966</gco:CharacterString>                     </gmd:voice>                     <gmd:facsimile>                        <gco:CharacterString>+61 2 6249 9960</gco:CharacterString>                     </gmd:facsimile>                  </gmd:CI_Telephone>               </gmd:phone>               <gmd:address>                  <gmd:CI_Address>                     <gmd:deliveryPoint>                        <gco:CharacterString>GPO Box 378 </gco:CharacterString>                     </gmd:deliveryPoint>                     <gmd:city>                        <gco:CharacterString>Canberra</gco:CharacterString>                     </gmd:city>                     <gmd:administrativeArea>                        <gco:CharacterString>ACT</gco:CharacterString>                     </gmd:administrativeArea>                     <gmd:postalCode>                        <gco:CharacterString>2601</gco:CharacterString>                     </gmd:postalCode>                     <gmd:country>                        <gco:CharacterString>Australia</gco:CharacterString>                     </gmd:country>                     <gmd:electronicMailAddress>                        <gco:CharacterString>[email protected]</gco:CharacterString>                     </gmd:electronicMailAddress>                  </gmd:CI_Address>               </gmd:address>            </gmd:CI_Contact>         </gmd:contactInfo>         <gmd:role>            <gmd:CI_RoleCode codeList="http://www.isotc211.org/2005/resources/Codelist/gmxCodelists.xml#CI_RoleCode"                             codeListValue="custodian">custodian</gmd:CI_RoleCode>         </gmd:role>      </gmd:CI_ResponsibleParty>   </gmd:contact>

 <gmd:contact xlink:href=“urn:cgi:party:CGI:GA”/>

• Each record reproduces every elementEach repository assumes governance of all the elements

Page 15: The role of registries within a spatial data infrastructure Simon CoxRob Atkinson Research ScientistSpatial Architect 16 April 2008

Insert presentation title, do not remove CSIRO from start of footerCSIRO EGU2008-A-05920 Cox SOA

Normalized records distributed governance

• Metadata records should primarily consist of a set of references - Use keyboard only for title/label & abstract/description!

- Drop-down lists for everything else- List == (online) register

Separate registers for key classes, e.g.• Responsible party

• Access conditions

• Feature types …

• These registers are under independent governance• Access Federation data

• standard licenses

• published community schemas

Infrastructure

Page 16: The role of registries within a spatial data infrastructure Simon CoxRob Atkinson Research ScientistSpatial Architect 16 April 2008

Insert presentation title, do not remove CSIRO from start of footerCSIRO EGU2008-A-05920 Cox SOA

Identifiers

Page 17: The role of registries within a spatial data infrastructure Simon CoxRob Atkinson Research ScientistSpatial Architect 16 April 2008

Insert presentation title, do not remove CSIRO from start of footerCSIRO EGU2008-A-05920 Cox SOA

CGI persistent identifiers

• IUGS Commission for Geoscience Information

• GeoSciML Testbed III• Interoperable WFS from 10 geological surveys

• USGS, GSC, BGS, BGR, GA, GSV, SGU, APAT, GSJ, AzGS

• More logos here

• Interoperability levels:• Schematic/model – common XML Schema

• GeoScML v2.0 - see other paper in this conference

• Semantic – common vocabularies

Page 18: The role of registries within a spatial data infrastructure Simon CoxRob Atkinson Research ScientistSpatial Architect 16 April 2008

Insert presentation title, do not remove CSIRO from start of footerCSIRO EGU2008-A-05920 Cox SOA

GeoSciML Example

<MappedFeature>    …     <observationMethod> <CGI_TermValue>

            <value codeSpace="urn:cgi:classifierScheme:GA:1MillionGeology_ObservationMethods“ >GSNSW785</value>

        </CGI_TermValue></observationMethod>    <positionalAccuracy> <CGI_NumericValue>

            <principalValue uom="urn:ogc:def:uom:UCUM:m">500</principalValue>        </CGI_NumericValue> </positionalAccuracy>    <samplingFrame

xlink:href="urn:cgi:classsifier:GA:SurfaceGeologyOfEasternAustralia_1MillionScale"/>     <specification>

        <LithologicUnit >            <gml:description>Mafic volcaniclastic sandstone, siltstone, shale, chert; minor limestone, conglomerate</gml:description>             <gml:name codeSpace="urn:cgi:classifierScheme:GA:StratigraphicLexicon:Unitname“

>Kabadah Formation</gml:name>             <gml:name codeSpace="urn:cgi:classifierScheme:GA:StratigraphicLexicon:Map_symbol“

>Ojck</gml:name>             <gml:name codeSpace="urn:ietf:rfc:2141">urn:cgi:feature:GA:Stratno:29570</gml:name>

            <observationMethod> <CGI_TermValue>                    <value codeSpace="urn:cgi:classifierScheme:GA:ObservationMethods“

>published description</value>                </CGI_TermValue> </observationMethod>

            <purpose>typicalNorm</purpose>             <preferredAge> <GeologicEvent>

<eventAge> <CGI_TermValue>                            <value codeSpace="urn:cgi:classifierScheme:ICS:StratChart:2004“

>urn:cgi:classifier:ICS:StratChart:2004:Ordovician</value>                        </CGI_TermValue> </eventAge>

 <eventProcess> <CGI_TermValue>                            <value codeSpace="urn:cgi:classifierScheme:GA:Process">unspecified</value>                        </CGI_TermValue> </eventProcess>

 </GeologicEvent> </preferredAge> …

Most property values are references to registersCommon values interoperability

Page 19: The role of registries within a spatial data infrastructure Simon CoxRob Atkinson Research ScientistSpatial Architect 16 April 2008

Insert presentation title, do not remove CSIRO from start of footerCSIRO EGU2008-A-05920 Cox SOA

Concept identifiers

• Concepts are denoted by language-neutral identifiers

• Identifiers must be universal and persistent• urn:ogc:def:crs:EPSG:6.14:4326

• urn:cgi:classifier:ICS:StratChart:2008:ediacaran

• urn:cgi:classifierscheme:ICS:StratChart:2008

• urn:cgi:schema:CGI:GeoSciML:2.0

• urn:cgi:featuretype:CGI:GeoSciML:2.0:GeologicUnit

• urn:cgi:feature:USGS_NGDM:Id56jn23

Page 20: The role of registries within a spatial data infrastructure Simon CoxRob Atkinson Research ScientistSpatial Architect 16 April 2008

Insert presentation title, do not remove CSIRO from start of footerCSIRO EGU2008-A-05920 Cox SOA

“Controlled vocabularies”

• Concepts exist in context• urn:ogc:def:crs:EPSG:6.14:4326

• urn:cgi:classifier:ICS:StratChart:2008:ediacaran

• urn:cgi:classifierscheme:ICS:StratChart:2008

• urn:cgi:schema:CGI:GeoSciML:2.0

• urn:cgi:featuretype:CGI:GeoSciML:2.0:GeologicUnit

• urn:cgi:feature:USGS_NGDM:Id56jn23

• Term from versioned vocabulary owned by an organization• Feature type defined in a schema owned by an organization

Page 21: The role of registries within a spatial data infrastructure Simon CoxRob Atkinson Research ScientistSpatial Architect 16 April 2008

Insert presentation title, do not remove CSIRO from start of footerCSIRO EGU2008-A-05920 Cox SOA

Governance

• The governance arrangements require separate registers of: • Classifiers, classifier-schemes

• Resource classes – def, classifier, schema, featuretype, feature,

• Concept owners – EPSG, ICS, CGI, USGS_NGDM

• and are reflected in/enforced by the structure of the persistent identifier

• urn:ogc:def:crs:EPSG:6.14:4326

• urn:cgi:classifier:ICS:StratChart:2008:ediacaran

• urn:cgi:classifierscheme:ICS:StratChart:2008

• urn:cgi:schema:CGI:GeoSciML:2.0

• urn:cgi:featuretype:CGI:GeoSciML:2.0:GeologicUnit

• urn:cgi:feature:USGS_NGDM:Id56jn23

Page 22: The role of registries within a spatial data infrastructure Simon CoxRob Atkinson Research ScientistSpatial Architect 16 April 2008

Insert presentation title, do not remove CSIRO from start of footerCSIRO EGU2008-A-05920 Cox SOA

Structured vs. opaque identifiers?

• URN vs DOI?

• Use structured identifiers for strongly governed concepts, system resources

• Slow rate of change, many references,

• Identifiers must be stable

• Resolution often not needed

• Useful if they are memorable

• Use opaque identifiers for weakly governed data resources• Frequent update, few references

• Data & identifiers may be transient

• Should be easily resolvable

• Don’t need to be memorable

Page 23: The role of registries within a spatial data infrastructure Simon CoxRob Atkinson Research ScientistSpatial Architect 16 April 2008

Insert presentation title, do not remove CSIRO from start of footerCSIRO EGU2008-A-05920 Cox SOA

Summary

Page 24: The role of registries within a spatial data infrastructure Simon CoxRob Atkinson Research ScientistSpatial Architect 16 April 2008

Insert presentation title, do not remove CSIRO from start of footerCSIRO EGU2008-A-05920 Cox SOA

Key points

• Many controlled vocabs + other lists required for an infrastructure

• Each is typically under independent governance

Almost all “lists” (and ontologies) should be managed as “registers”

• Semantic web (AI for C21?) hopes to do this automatically?

• Agreements (standards) are possible in the context of coherent technical communities

• To enable an infrastructure, we need a lot of registers • These must

• Use persistent identifiers for both registers and contents

• Be resolvable

• Have transparent governance arrangements

Page 25: The role of registries within a spatial data infrastructure Simon CoxRob Atkinson Research ScientistSpatial Architect 16 April 2008

Insert presentation title, do not remove CSIRO from start of footerCSIRO EGU2008-A-05920 Cox SOA

ISO 19135 Register Organization model

Page 26: The role of registries within a spatial data infrastructure Simon CoxRob Atkinson Research ScientistSpatial Architect 16 April 2008

Contact UsPhone: 1300 363 400 or +61 3 9545 2176

Email: [email protected] Web: www.csiro.au

Thank you

Exploration & MiningSimon CoxResearch Scientist

Phone: 08 6436 8639Email: [email protected]: www.seegrid.csiro.au

Land & WaterRob AtkinsonSpatial Architect

Phone: 02 4226 4339Email: [email protected]