ontology evaluation, metrics, and metadata in ncbo bioportal

16
Ontology Evaluation, Metrics, and Metadata in NCBO BioPortal Natasha Noy Stanford University

Upload: oma

Post on 15-Jan-2016

26 views

Category:

Documents


4 download

DESCRIPTION

Ontology Evaluation, Metrics, and Metadata in NCBO BioPortal. Natasha Noy Stanford University. What Are We Trying to Do?. You’ve built an ontology, how do you let the world know? You need an ontology, where do you go to get it? How do you know whether an ontology is any good?. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Ontology Evaluation, Metrics, and Metadata in NCBO BioPortal

Ontology Evaluation, Metrics, and Metadata in NCBO BioPortal

Natasha NoyStanford University

Page 2: Ontology Evaluation, Metrics, and Metadata in NCBO BioPortal

What Are We Trying to Do?

• You’ve built an ontology, how do you let the world know?

• You need an ontology, where do you go to get it?

• How do you know whether an ontology is any good?

Page 3: Ontology Evaluation, Metrics, and Metadata in NCBO BioPortal

BioPortal: A Community-Based Ontology Repository

http://bioportal.bioontology.org

Page 4: Ontology Evaluation, Metrics, and Metadata in NCBO BioPortal

BioPortal Today

• 195 ontologies in biomedical domain (as of 4/28/2010)

• Ontologies in six different formats• ~ 1.7 million classes• ~ 2 million mappings

Page 5: Ontology Evaluation, Metrics, and Metadata in NCBO BioPortal

Evaluating Ontologies in BioPortal

• Ontology metrics– Computed automatically

• User reviews– Submitted by users: star ratings and free-text

reviews along different dimensions• Other metadata– “Stamp of approval” from a specific community

Page 6: Ontology Evaluation, Metrics, and Metadata in NCBO BioPortal

Ontology Metrics• You can find the details here:

http://www.bioontology.org/wiki/index.php/Ontology_Metrics• Statistics:

– Number of classes– Number of instances– Number of properties– Maximum depth of the class hierarchy– Maximum number of siblings– Average number of siblings

• Metrics assessing conformance to best practices:– Classes with a single subclass– Classes with a large number of subclasses (more than 20)– Classes with no definition

• Auditing information:– Contributors and authors of individual classes

Page 7: Ontology Evaluation, Metrics, and Metadata in NCBO BioPortal

The Metrics Page in BioPortalYou can view metrics for any ontology

Page 8: Ontology Evaluation, Metrics, and Metadata in NCBO BioPortal

Peer Review of Ontologies

• The people who know the answer to these question are– (maybe) ontology authors– other users of the ontology

• Our solution: Allow users to provide ratings for ontologies

Which ontology is appropriate for my task?

Page 9: Ontology Evaluation, Metrics, and Metadata in NCBO BioPortal

Reviewers Provide

• General review and rating• Usage information– Which applications have successfully used the

ontology?– What problems were encountered?

• Coverage– Does it cover the domain properly?– Are there major gaps?– Are some parts developed better than others?

Page 10: Ontology Evaluation, Metrics, and Metadata in NCBO BioPortal

Projects in BioPortal

• Users can describe their ontology-based projects in BioPortal

• Projects are linked to ontologies they are using• Ontology reviews can be done in the context of

projects

Page 11: Ontology Evaluation, Metrics, and Metadata in NCBO BioPortal

Dimensions of a Review

• Degree of formality• Documentation and support• Usability• Domain coverage• Correctness• Quality of content

Page 12: Ontology Evaluation, Metrics, and Metadata in NCBO BioPortal

Organizational Stamps of Approval

• BioPortal ontologies belong to different groups:– OBO Foundry members and candidates– WHO Family of International Classifications– UMLS– caBIG– We can add other groups….

• Each ontology can be in 0, 1, or more groups• We should (but currently don’t) link to the

criteria for inclusion in each group

Page 13: Ontology Evaluation, Metrics, and Metadata in NCBO BioPortal

Maintaining Metadata throughOntology Versioning

• We attach reviews to a specific version of an ontology• We expose the reviews for all subsequent versions

– We plan to add an indicator “This review was submitted for an earlier version of this ontology”

Potentially, any part of the description can change

Page 14: Ontology Evaluation, Metrics, and Metadata in NCBO BioPortal

Representing Metrics and Reviews

• Ontology Metadata Vocabulary (OMV):– OMV is a metadata schema that captures salient

information about an ontology– OMV is implemented as an OWL ontology

• BioPortal Metadata ontology extends OMV to add NCBO-specific metadata and evaluation metrics

Page 15: Ontology Evaluation, Metrics, and Metadata in NCBO BioPortal

BioPortal Metadata Ontology

Page 16: Ontology Evaluation, Metrics, and Metadata in NCBO BioPortal

Discussion

• We can extend the BioPortal Metadata ontology to represent– Ontology profiles– Additional metrics– Sets of metrics and criteria for a specific organization

• All information in BioPortal is accessible through REST services– Other applications can use the metadata– For example, there can be an external application that compares

different profiles• All metadata is specific to an ontology version, but it is

accessible through a version-independent ontology id