the challenge of biodiversity: plot, organism, and taxonomic databases robert k. peet

Post on 20-Mar-2016

29 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

DESCRIPTION

The challenge of biodiversity: Plot, organism, and taxonomic databases Robert K. Peet University of North Carolina The National Plots Database Committee John Harris NCEAS. A case study: VegBank - The ESA Vegetation Plot Archive. Project organized and directed by: - PowerPoint PPT Presentation

TRANSCRIPT

The challenge of biodiversity:Plot, organism, and taxonomic databases

Robert K. PeetUniversity of North Carolina

The National Plots Database Committee

John HarrisNCEAS

A case study: VegBank - The ESA Vegetation Plot Archive

Project supported by:National Center for Ecological Analysis & SynthesisU.S. National Science FoundationUSGS-BRD Gap Analysis ProgramABI / The Nature Conservancy

Project organized and directed by:Robert K. Peet, University of North CarolinaMarilyn Walker, USDA Forest Service & U. AlaskaDennis Grossman, The Nature Conservancy / ABIMichael Jennings, USGS-BRD & UCSB

Observation/CollectionEvent

Object or specimen

Taxon

Locality

Biodiversity data structure

Taxonomic databases

Plot/Inventory databases

Specimen databases

Web-interface

Veg Classification Database

VegBank

Proposal

Raw Plot Data

Vegetation/Biodiversity

Information flow in the US National Vegetation Classification

TaxonomicDatabase

Proposal

Taxonomic database challengeThe problem:

Integration of data potentially representing different times, places, investigators and taxonomic standards

The traditional solution: A standard list of kinds of organisms.

There exist numerous compilations of organism names.

For example:

• Species 2000http://www.sp2000.org/default.html(Composed of 18 participant databases)

• All Species http://www.all-species.org

• ITIS http://www.itis.usda.gov/(The US government standard list, plus Canada & Mexico)

• Index to organism nameshttp://www.biosis.org.uk/triton/indexfm.htm

Taxon-specific standard lists are available. Representative examples for higher plants include:North America / US

USDA Plants http://plants.usda.gov/ITIS http://www.itis.usda.gov/ NatureServe http://www.natureserve.org

WorldIPNI International Plant Names Checklist

http://www.ipni.org/IOPI Global Plant Checklist

http://www.bgbm.fu-berlin.de/IOPI/GPC/

Most standardized taxon lists fail to allow effective integration of datasets.

The reasons include:

• The user cannot reconstruct the database as viewed at an arbitrary time in the past,

• Taxonomic concepts are not defined (just lists),

• Multiple party perspectives on taxonomic concepts and names cannot be supported or reconciled.

Current standards• Biological organisms are named following international rules of nomenclature.

• Database standards are being developed by TDWG, GBIF, IOPI, etc.

• Metadata standards have been developed. For example, the Darwin Core is a profile describing the minimum set of standards for search and retrieval of natural history collections and observation databases. (http://tsadev.speciesanalyst.net/DarwinCore/)

Carya ovata(Miller)K. Koch

Carya carolinae-sept.(Ashe) Engler & Graebner

Carya ovata(Miller)K. Koch

sec. Gleason 1952 sec. Radford et al. 1968

Three concepts of shagbark hickorySplitting one species into two illustrates the ambiguity often associated with scientific names. If you encounter the name “Carya ovata (Miller) K. Koch” in a database, you cannot be sure which of two meanings applies.

R. plumosa

R. plumosa

R plumosav. intermedia

R. plumosav. plumosa

R. intermedia

R. plumosav. interrupta R. pineticola

R. plumosa R. sp. 1

R. plumosav. plumosa

R. plumosav. pineticola

Multiple concepts of Rhynchospora plumosa s.l.

Elliot 1816

Gray 1834

Kral 1998

Peet 2002?

1

2

3

Chapman1860

Name ReferenceAssertion

An assertion represents a unique combination of a name and a reference

“Assertion” is equivalent to “Potential taxon” & “taxonomic concept”

NamesCarya ovata Carya carolinae-septentrionalisCarya ovata v. australis

Assertions(One shagbark)C. ovata sec Gleason ’52C. ovata (sl) sec FNA ‘97

(Southern shagbark)C. carolinae-s. sec Radford ‘68C. ovata v. australis sec FNA ‘97

(Northern shagbark)C. ovata sec Radford ‘68C. ovata (v. ovata) sec FNA ‘97

ReferencesGleason 1952 Britton & BrownRadford et al. 1968 Flora CarolinasStone 1997 Flora North America

Six shagbark hickory assertionsPossible taxonomic synonyms are listed together

Name AssertionUsage

A usage represents a unique combination of an assertion and a name.

Usages can be used to track nomenclatural synonyms

1. Carya ovata2. C. carolinae3. C. ovata var. australis

A. ovata sec. GleasonB. ovata sl sec. FNAC. carolinae sec. RadfordD. ovata australis sec. FNAE. ovata sec. RadfordF. ovata ovata sec. FNA

1-F OK2-D OK3-D Syn

Names AssertionsITIS Usage

ITIS likely views the linkage of the assertion “Carya ovata var. australis sec. FNA 1997” with the name “Carya ovata var. australis” as a nomenclatural synonym.

Name AssertionUsage

A usage (name assignment) and assertion (taxon concept) can be

combined in a single model

Reference

Party Perspective

The Party Perspective on an Assertion includes:

•Status – Standard, Nonstandard, Undetermined

• Correlation with other assertions – Equal, Greater, Lesser, Overlap,

Undetermined.

•Lineage – Predecessor and Successor assertions.

•Start & Stop dates.

ITISFNA CommitteeNatureServeUSDA Plants

Carya ovata sec Gleason 1952Carya ovata (sl) sec FNA 1997 Carya ovata sec Radford 1968Carya carolinae sec Radford 1968Carya ovata (ovata) sec FNA 1997Carya ovata australis sec FNA 1997

Party Assertion

Party Assertion Status Start Name

ITIS ovata – G52 NS 1996ITIS ovata – R68 St 1996 ovataITIS carolinae – R68 St 1996 carolinaeITIS carolinae – R68 NS 2000ITIS ovata aust – FNA St 2000 carolinaeITIS ovata – R68 NS 2000ITIS ovata ovata – FNA St 2000 ovata

Status

VegBank taxonomic data model

Concept-based taxonomy is coming! • All organisms/specimens in databases should be identified by linkage to an assertion = name and reference!

• Various standards are being developed by FGDC, TDWG, IOPI, GBIF, etc.

• Most major databases are working toward inclusion of assertions (e.g. ITIS, IOPI, HDMS).

• Until standard assertion lists are available, databases that track organisms should include couplets containing both a scientific name and a reference.

(Inter)National Taxonomic Database?

• Concept-based• Party-neutral• Synonymy and lineage tracking• Perfectly archived

An upgrade for ITIS & Species 2000?

Specimen/object/occurrence databases

Information on specimens/objects/occurrence-observations should be tracked by reference to

• Place (place or collection)

• Unique identifier (accession number)

• Time

A museum is a place

Annotation should be by assertion (concept)!

Database systems for tracking specimens

The following are a few of the many available• BioLink http://www.ento.csiro.au/biolink/index.html

• Specify http://usobi.org/specify/default.htm

• Biota http://viceroy.eeb.uconn.edu/Biota

• Taxis http://taxis.virtualave.net/

TDWG maintains links to multiple software systemshttp://www.bgbm.fu-berlin.de/TDWG/acc/Software.htm

Plots Database SystemsSeveral plot database systems are available. Among the best know and widely used are:

• TurboVeg (over 1,000,000 plots stored) http://www.alterra.nl/onderzoek/producten/websites/turboveg/

• Plots (NatureServe NPS Mapping Project)

A vegetation plot archive?

There is currently no standard repository for plot data.

A repository is needed for:

• Plot storage

• Plot access and identification

• Plot documentation in literature/databases

This would be equivalent to GenBank for vegetation.

Project

Plot PlotObservation

Taxon Observation

Taxon Interpretation

PlotInterpretation

Core elements of VegBank

Interface tools•Desktop client for data preparation and local use.

•Flexible data inport, including XML.

•Tools for linking taxonomic and community concepts.

•Standard query, flexible query, SQL query.

•Flexible data export, including XML.

•Local data refresh

•Easy web access to central archive

Conclusions for database designers1. Records of organisms should always contain

(or point to) couplets consisting of a scientific name and a reference where the name was used.

2. Design for future annotation of organism and community concepts.

3. Track specimens/objects by unique identifier with metadata including location, annotation & time.

4. Design for reobservation. Separate permanent from transient attributes.

5. Archival databases should support time-specific views.

Infrastructural needs1. A national or international database of taxon concepts

with support for at least one (ITIS?) party perspective.

2. Software tools for development and documentation of taxonomic concepts (including irregular concepts) and party perspectives.

3. Completion and long-term support for a national or international archive for vegetation plot data (VegBank) and similar community observations.

4. A national or international database of community concepts with support for at least the FGDC party perspective.

top related