akn and node data architecture

16
AKN and Node Data Architecture Draft 7 June 8, 2009

Upload: betha

Post on 28-Jan-2016

59 views

Category:

Documents


0 download

DESCRIPTION

AKN and Node Data Architecture. Draft 7 June 8, 2009. The Avian Knowledge Network. USER. PARTNER. Covariates. Taxonomies. Observations. Geography. Metadata. Events. PROVIDER. AKN Node Architecture. Data Services (DiGIR). Node. Node Data Warehouses. Data Access. Data - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: AKN and Node Data Architecture

AKN and Node Data Architecture

Draft 7June 8, 2009

Page 2: AKN and Node Data Architecture

The Avian Knowledge Network

Page 3: AKN and Node Data Architecture

USER

Data Entry

Data Entry

Data Processing

Data Processing

Data Visualization

Data Visualization

Data Access

Data Access

Data Analysis

Data Analysis

Data Services (DiGIR)

Data Services (DiGIR)Node Data

Warehouses

Covariates

Metadata

Data PublishingData Publishing

Events Geography

Observations Taxonomies

Node Data Store (NodeDb)

Access C

on

trol

Access C

on

trol

= dataflow

= process or application

= data store

AKN Node Architecture

PARTNER

PROVIDER

Node

Page 4: AKN and Node Data Architecture

Node Architecture and NodeDbN

od

eD

b

CountsCounts HabitatHabitatBandingBanding

Data components

ProjectSpecificProjectSpecific

EcologySpecificEcologySpecific

Lists, counts, surveys banding records,

nest data, etc…

Avian Ecology Observations

segregated by source(A, B, C..)

Published Interfaces, Filters & TransformationsPublished Interfaces, Filters & Transformations

Data RetrievalData Retrieval Data AnalysisData Analysis

SamplingUnits

SamplingUnits TaxonomyTaxonomyProtocolsProtocols Project

MetadataProject

Metadata DictionariesDictionaries

NodeDbInfrasstructure UsersUsers Roles /

GroupsRoles /GroupsRightsRights LogsLogs Other

PlumbingOther

Plumbing

ListsLists BehaviorBehaviorNestsNests

ProjectSpecificProjectSpecific

EcologySpecificEcologySpecific

BandingBanding CountsCountsNestsNests

ProjectSpecificProjectSpecific

EcologySpecificEcologySpecificA B C

warehouse1 warehouse3warehouse2Node Data

Warehouses warehouse4

Data PublishingData Publishing

. . .

Data EntryData Entry Bulk Data ProcessBulk Data Process

Node Web SiteNode Web Site Node ApplicationsNode Applications

USER PARTNER

Node

Node

USERPARTNERPROVIDER

Page 5: AKN and Node Data Architecture

How NodeDb Codifies Observations

Sampling EventsSampling EventsWhen?When?

How?How?

What recorded?What recorded? Sampling ObservationsSampling Observations

Sampling UnitsSampling UnitsWhere?Where?

Who?Who?

has m

an

y

Project /Protocol/ResearcherProject /Protocol/Researcher

Why?Why?

has m

an

yh

as m

an

y

defined in

Page 6: AKN and Node Data Architecture

NodeDb Major Data Components

TaxonomyTaxonomy

Sampling EventsSampling Events

Sampling EventObservations

Sampling EventObservations

ProtocolsProtocols

ResearchersResearchers

ProjectsProjects Sampling UnitsSampling Units

participates in

collects

is located at contains

consisting of

definesmethodologyfor

uses

Page 7: AKN and Node Data Architecture

NodeDb: Projects Focus

ProjectsProjectsProject

ProjectId

ProjectName (AK1)ProjectDescription ProjectStatusCd ProjectStartYr ProjectEndYr RightsRestrictions CopyRightDt CopyRightExpirationDt OriginalProjectId (IE1)ProjectDatabase

ProjectProtocol

ProjectId (FK,FK)ProtocolId (FK,IE1)

has many

ProjectResearcher

ProjectId (FK)ResearcherId (FK,IE1)

ProjectLeaderInd ProjectResearcherStatusCd

has many

Protocol

ProtocolId

ProtocolName ProtocolDescriptionURL ProtocolTypeCd

defines research for

Researcher

ResearcherId

ResearcherStatusCd DoNotContactInd UnknownResearcherInd PRBOdbAdminInd ResearcherFirstName (IE1)ResearcherLastName ResearcherInitials EmailAddress (variousResearcherContactInfo)

participates in

SamplingUnitProject

SamplingUnitID (FK)ProjectID (FK,IE1)

has many

SamplingUnit

SamplingUnitID

FullName (IE1,AK1)ShortName (IE1,AK1)PointGeometry CoordinatesCollection ParentSamplingUnitID Description EstablishingProjectID (IE1,AK1)PublicInd

is site of

Page 8: AKN and Node Data Architecture

NodeDb: Researchers Focus

ResearchersResearchers

Researcher

ResearcherId

ResearcherStatusCd DoNotContactInd UnknownResearcherInd PRBOdbAdminInd ResearcherFirstName (IE1)ResearcherLastName ResearcherInitials EmailAddress (variousResearcherContactInfo)

ProjectResearcher

ProjectId (FK)ResearcherId (FK,IE1)

ProjectLeaderInd ProjectResearcherStatusCd

participates in

SamplingEventObservation

ProjectId SamplingUnitId SamplingEventTypeCd ProtocolId SamplingEventCounter ObservationCounter

ResearcherId (FK)(variousObservationDetails)

observes

Users

UserName (FK)

Authorization OpenId GroupNames UserState SkinId Comment

authenticated as

GroupPermissions

GroupName (FK)Application Permission

PermissionType

Groups

GroupName (FK)

Description

has rights to

has

Page 9: AKN and Node Data Architecture

NodeDb: Sampling Units Focus

Sampling UnitsSampling Units

SamplingUnit

SamplingUnitID

FullName (IE1,AK1)ShortName (IE1,AK1)PointGeometry CoordinatesCollection ParentSamplingUnitID Description EstablishingProjectID (IE1,AK1)PublicInd

SamplingUnitType

SamplingUnitTypeID

Name ParentRequired Description SamplingUnitTypeGeometryID (IE1)

SamplingUnitParentSamplingUnitType

SamplingUnitTypeID (FK)ParentSamplingUnitTypeID

SamplingUnitSamplingUnitType

SamplingUnitID (FK)SamplingUnitTypeID (FK,IE1)

has many

categorizes encompasses

encompasses

SamplingEvent

ProjectId SamplingUnitId (FK)SamplingEventTypeCd ProtocolId (FK)SamplingEventCounter

is site of

Page 10: AKN and Node Data Architecture

Nest Monitoring

Banding/Tagging

Bird Counts

Study Area

Point CountTransect

Banding NetCollection

Nest Plot

Point CountPoint Net

A

P

A

PL

TrapCollection

Trap PA

AA

Nest SubplotA

OtherNest P

Territory A

Nest Box

Burrow

P

P

Crevice P

NestP

Other BirdObservationPoint P

Index /Census Plot

SeabirdObservationBlind

At-seaTransect

P

P

LA

Area SearchPlot

A

Sea WatchObservationPoint

Shark WatchObservationPoint P

P

Sample Collection

Other Banding LocationAP

Seabird Diet Collection P

Blood Collection P

Egg Collection P

Specimens P

Predatory FishDiet Collection Point

P

Zooplankton Trawl L

Water Sample P

Weather Station P

Remote Sensing

Hydroacoustic Data L

Biologging

TDR PL

GLS PL

Satellite PLLeg

en

d= required to be part of

= optionally part of

PL

A

= point

= line

= area

NodeDb: Sampling Unit Type Hierarchy

Page 11: AKN and Node Data Architecture

NodeDb: Protocols Focus

ProtocolsProtocols

Protocol

ProtocolId

ProtocolName ProtocolDescriptionURL ProtocolTypeCd

ProjectProtocol

ProjectId (FK,FK)ProtocolId (FK,IE1)

defines research for

SamplingEvent

ProjectId SamplingUnitId (FK)SamplingEventTypeCd ProtocolId (FK)SamplingEventCounter

describes research design for

Project

ProjectId

has

ProtocolSamplingEventType

ProtocolId (FK,FK)SamplingEventTypeCd (FK,IE1)

is a

is constrained to

Page 12: AKN and Node Data Architecture

NodeDb: Sampling Events Focus

Sampling EventsSampling Events

SamplingEvent

ProjectId SamplingUnitId (FK)SamplingEventTypeCd ProtocolId (FK)SamplingEventCounter

SamplingEvent_AreaSearch

ProjectId (FK)SamplingUnitId (FK,IE1)SamplingEventTypeCd (FK,IE2)ProtocolId (FK,IE3)SamplingEventCounter (FK)

SamplingEventGroupId SamplingEventDataStatusCd SamplingEventDt SamplingEventStartTm SamplingEventEndTm SamplingEventYear SamplingEventMonth SamplingEventDay TotalBirdsCounted SamplingEventRemarks

SamplingEvent_Banding

ProjectId (FK)SamplingUnitId (FK,IE1)SamplingEventTypeCd (FK,IE2)ProtocolId (FK,IE3)SamplingEventCounter (FK)

SamplingEventGroupId SamplingEventDataStatusCd SamplingEventDt SamplingEventStartTm SamplingEventEndTm SamplingEventYear SamplingEventMonth SamplingEventDay TotalBirdsCounted SamplingEventRemarks

SamplingEvent_Count

ProjectId (FK)SamplingUnitId (FK,IE1)SamplingEventTypeCd (FK,IE2)ProtocolId (FK,IE3)SamplingEventCounter (FK)

SamplingEventGroupId SamplingEventDataStatusCd SamplingEventDt SamplingEventStartTm SamplingEventEndTm SamplingEventYear SamplingEventMonth SamplingEventDay TotalBirdsCounted SamplingEventRemarks

SamplingEvent_Encounter

ProjectId (FK)SamplingUnitId (FK,IE1)SamplingEventTypeCd (FK,IE2)ProtocolId (FK,IE3)SamplingEventCounter (FK)

SamplingEventGroupId SamplingEventDataStatusCd SamplingEventDt SamplingEventStartTm SamplingEventEndTm SamplingEventYear SamplingEventMonth SamplingEventDay TotalBirdsCounted SamplingEventRemarks

SamplingEvent_Nest

ProjectId (FK)SamplingUnitId (FK,IE1)SamplingEventTypeCd (FK,IE2)ProtocolId (FK,IE3)SamplingEventCounter (FK)

SamplingEventGroupId SamplingEventDataStatusCd SamplingEventDt SamplingEventStartTm SamplingEventEndTm SamplingEventYear SamplingEventMonth SamplingEventDay TotalBirdsCounted SamplingEventRemarks

SamplingEvent_Behavior

ProjectId (FK)SamplingUnitId (FK,IE1)SamplingEventTypeCd (FK,IE2)ProtocolId (FK,IE3)SamplingEventCounter (FK)

SamplingEventGroupId SamplingEventDataStatusCd SamplingEventDt SamplingEventStartTm SamplingEventEndTm SamplingEventYear SamplingEventMonth SamplingEventDay TotalBirdsCounted SamplingEventRemarks

SamplingEvent_DeployedNet

ProjectId (FK,IE4)SamplingUnitId (FK,IE1,IE4)SamplingEventTypeCd (FK,IE2,IE4)ProtocolId (FK,IE3,IE4)SamplingEventCounter (FK,IE4)DeployedNetCounter

DeploymentDt DeploymentStartTm DeploymentEndTm NetNumber TotalNumberNets NetHours DeploymentRemarks

SamplingEvent_Habitat

ProjectId (FK)SamplingUnitId (FK,IE1)SamplingEventTypeCd (FK,IE2)ProtocolId (FK,IE3)SamplingEventCounter (FK)

PrimaryHabitatCd PercentPrimaryHabitat SecondaryHabitatCd SurveyUnitAspect SurveyUnitSlope SurveyUnitAspectMeasurementCd WaterInd SnagsGT10cmCount SnagsLT10cmCount LogCount RiparianZoneWidth RiparianZoneLength ChannelWidth AdjacentLandUseCd

SamplingEvent_Diet

ProjectId (FK)SamplingUnitId (FK,IE1)SamplingEventTypeCd (FK,IE2)ProtocolId (FK,IE3)SamplingEventCounter (FK)

SamplingEventGroupId SamplingEventDataStatusCd SamplingEventDt SamplingEventStartTm SamplingEventEndTm SamplingEventYear SamplingEventMonth SamplingEventDay TotalBirdsCounted SamplingEventRemarks

SamplingEvent_Weather

ProjectId (FK)SamplingUnitId (FK,IE1)SamplingEventTypeCd (FK)ProtocolId (FK)SamplingEventCounter (FK)

SamplingEventGroupId SamplingEventDataStatusCd SamplingEventDt SamplingEventStartTm SamplingEventEndTm SamplingEventYear SamplingEventMonth SamplingEventDay DiurnalLowTemperature DiurnalHighTemperature SamplingEventRemarks

SamplingEventTypeCd

Page 13: AKN and Node Data Architecture

NodeDb: Sampling Observation Focus – Point Counts

Sampling EventsSampling Events

Sampling EventObservations

Sampling EventObservations

SamplingEvent_Count

ProjectId (FK)SamplingUnitId (FK,IE1)SamplingEventTypeCd (FK,IE2)ProtocolId (FK,IE3)SamplingEventCounter (FK)

SamplingEventGroupId SamplingEventDataStatusCd SamplingEventDt SamplingEventStartTm SamplingEventEndTm SamplingEventYear SamplingEventMonth SamplingEventDay TotalBirdsCounted SamplingEventRemarks

SamplingEventObservation_Count

ProjectId (FK,FK)SamplingUnitId (FK,IE5)SamplingEventTypeCd (FK)ProtocolId (FK,FK,FK,FK,FK,IE1,IE3,IE2)SamplingEventCounter (FK)ObservationCounter

BirdCount DetectionCueId (FK,IE1)DistanceBinId (FK,IE2)TimeBinId (FK,IE3)BreedingStatusCd SingingInd CompassBearing ClusterInd ObservationTm ResearcherId (FK)FieldBirdId (IE4)ObservationRemarks

produces

Project

ProjectId

Protocol

ProtocolId

conducts

Researcher

ResearcherId

has

defines

ProtocolDetectionCue

ProtocolId (FK)DetectionCueId

generates

has

ProtocolTimeBin

ProtocolId (FK)TimeBinId

has

containsProtocolDistanceBin

ProtocolId (FK)DistanceBinId

has

is observer distance from

Page 14: AKN and Node Data Architecture

NodeDb: Taxonomy Focus

TaxonomyTaxonomy

Taxon

TaxonId (FK,FK,FK,IE1)

TaxonName TaxonKingdomId TaxonRankId EncompassingTaxonId (IE1)ITIS_TaxonSerialNumber AOU_BirdSpeciesId AOU_PhylogeneticSequenceNo USDA_PlantSpeciesId BandSizeNote ReclassifiedTaxonId ReclassificationDt ScientificAuthority IdentificationQualifier CommonName (FK)Language (FK)

TaxonCommonName

TaxonId CommonName Language

TaxonDimension

TaxonId

EncompassingTaxonId TaxonRankId Kingdom Phylum Subphylum Class Order Suborder Family Subfamily Genus Species Subspecies CommonName

encompasses

has

has

Page 15: AKN and Node Data Architecture

NodeDb: Data Dictionary

DictionaryTable

TableId

SamplingEventTypeCd (IE1)TableName (IE2)TablePurpose ViewInd

DictionaryTableColumn

TableId (FK)ColumnId (FK,IE1)

ColumnCounter RequiredInd AutoIncrementInd PrimaryKeyInd ForeignKeyInd VirtualInd

contains

DictionaryColumn

ColumnId

ColumnName ColumnDefinition ColumnDataTypeCd ColumnUnitsCd ColumnLength DecimalPrecision RoleNameInd BaseColumnId (IE1)CodeColumnInd CodeSetId (IE2)

DictionaryCode

CodeSetId

CodeSetName CodeSetDescription ColumnId (FK)

DictionaryCodeValue

CodeSetId (FK)CodeValue

CodeValueDescription

has

is in

has

AnyNodeDbTable

SomeDataCd

table defined in

column(s) defined in

column inventory in

set of allowed values in

allowed values in

Page 16: AKN and Node Data Architecture

NodeDb: Data Dictionary ExampleSamplingEvent_AreaSearch

ProjectId (FK)SamplingUnitId (FK,IE1)SamplingEventTypeCd (FK,IE2)ProtocolId (FK,IE3)SamplingEventCounter (FK)

SamplingEventGroupId SamplingEventDataStatusCd SamplingEventDt SamplingEventStartTm SamplingEventEndTm SamplingEventYear SamplingEventMonth SamplingEventDay TotalBirdsCounted SamplingEventRemarks

DictionaryColumn

ColumnId TableName CodeSetId

221 SamplingEventDataStatusCd 23

DictionaryCode

CodeSetID CodeSetName 23 SamplingEventDataStatus

DictionaryCodeValueCodeSetId CodeSetValue Description

23 RAW Raw Input - Unreviewed23 CLEAN Biologist Reviewed

23 APPROVED Available for analysis - internal PRBO/contributing institution use only

23 AVAILABLE Level 1 Stored in the AKN’s primary data warehouse, with the option that their data are not exposed to the public at all.

23 AVAILABLE Level 2 Data can be used in certain publicly available, predefined visualizations (i.e. maps and graphs), but direct access to the data is restricted.

23 AVAILABLE Level 3Data can be used in publicly available, predefined visualizations (i.e. maps and graphs). Additionally, data is made available to existing bioinformatic efforts (GBIF and ORNIS).

23 AVAILABLE Level 4

Data are used in publicly available, predefined visualizations (i.e. maps and graphs), and are available to existing bioinformatic efforts. Additionally, the complete BMDE data set is available upon request from the original data provider.

23 AVAILABLE Level 5Data are used in publicly available, predefined visualizations (i.e. maps and graphs), and are available to existing bioinformatic efforts. Additionally, the complete BMDE data set is available for download directly via the Internet.

23 RESTRICTED Restricted access - clean data but must contact PRBO or contributing institution project leader or division director before using