the modern synthesis population genetics systematics paleontology botany and zoology

41

Upload: clare-baldwin

Post on 13-Jan-2016

228 views

Category:

Documents


3 download

TRANSCRIPT

Page 1: The Modern Synthesis Population genetics Systematics Paleontology Botany and Zoology
Page 2: The Modern Synthesis Population genetics Systematics Paleontology Botany and Zoology

The Modern Synthesis

Population genetics

SystematicsPaleontology

Botany and Zoology

Page 3: The Modern Synthesis Population genetics Systematics Paleontology Botany and Zoology

ModernPost-modern

Page 4: The Modern Synthesis Population genetics Systematics Paleontology Botany and Zoology

• Address grand challenge questions in evolutionary biology

• Focus on areas ripe for conceptual synthesis

• Allow the wealth of existing data to be fully utilized

• Facilitate collaboration among disciplines and institutions

• Support scholars pursuing synthetic research

Page 5: The Modern Synthesis Population genetics Systematics Paleontology Botany and Zoology

Sponsored science at NESCent

• Visiting postdoctoral and sabbatical scholars

• Catalysis meetings (~30 participants; highly interdisciplinary, to spawn collaborations)

• Working groups (~15 participants, more focused topic, multiple meetings, specific outcomes)

Page 6: The Modern Synthesis Population genetics Systematics Paleontology Botany and Zoology

Education & Outreach

• Staff– Kristin Jenkins – Jory Weintraub

• Disseminate NESCent science• Promote evolution education at institutions

serving under-represented groups• Organize annual symposium for National

Association of Biology Teachers (with AIBS)• Support NESCent postdoc professional

development• Consult on broader impacts

Page 7: The Modern Synthesis Population genetics Systematics Paleontology Botany and Zoology

Business & Logistics

• Staff– Karen Henry, Assistant Director for Administration

– Jeff Sturkey, Logistics Manager– Barbara Mitchell, Business Manager– Marcia Painter, Administrative Assistant

Page 8: The Modern Synthesis Population genetics Systematics Paleontology Botany and Zoology

Informatics @ NESCentMission

– Support for sponsored science– “Cyberinfrastructure” to enable evolutionary synthesis• Data sharing/exchange and database technology• Software development• Training, dissemination and user support

Resources– IT and bioinformatics staff (currently hiring!)– Hardware for HPC, software development, web/db services

– Logistical and technical support for collaborative development projects

Page 9: The Modern Synthesis Population genetics Systematics Paleontology Botany and Zoology

Thinking about a proposal?Here’s what to expect

• We can – Provide infrastructure for collaborative projects

– Prototype databases, software, etc. meant for public dissemination

– Partner with you on external grants to go beyond the prototype stage

• We can’t– Provide long-term maintenance for public resources (except through SDSC Data Central)

– Adopt all possible technologies – Substitute for data experts or database curators

Page 10: The Modern Synthesis Population genetics Systematics Paleontology Botany and Zoology

Major initiatives

• Where do they come from?– Working groups and center visitors– Whitepapers - come to us with your dreams!– Advisory groups

• Leverage a vast array of existing resources– Open source software initiatives– Expertise and energy within the evolution community (and beyond)

• This is your center - use us!

Page 11: The Modern Synthesis Population genetics Systematics Paleontology Botany and Zoology

Evolutionary model organisms

Page 12: The Modern Synthesis Population genetics Systematics Paleontology Botany and Zoology

Evolutionary model system databases

Interface of genomics and evolutionary biology– Genome models are venturing into phylogenetics and population genetics

– Evolutionary biologists are acquiring genomic information for their own model systems

How many times must we invent the same genome database?– Expensive– Distracting– Not easy to do, but easy to do poorly

Page 13: The Modern Synthesis Population genetics Systematics Paleontology Botany and Zoology

QuickTime™ and aTIFF (LZW) decompressor

are needed to see this picture.

QuickTime™ and aTIFF (LZW) decompressor

are needed to see this picture.

Tailoring GMOD to evolutionary model

systems

• Extension of schema and visualization tools for genetic and phenotypic variation, geographic information and phylogenetics

• Providing user support

Page 14: The Modern Synthesis Population genetics Systematics Paleontology Botany and Zoology

Brokering between natural phenotypic diversity and model organism genetics

Working group led by Paula Mabee (U. South Dakota) & Monte Westerfield (U. of Oregon)

Page 15: The Modern Synthesis Population genetics Systematics Paleontology Botany and Zoology

Phenotypic reasoning

• What zebrafish mutants differ from the wild type the same way that this apomorphy distinguishes this clade?

• Are characters that are phylogenetically correlated also genetically correlated in zebrafish mutants?

Page 16: The Modern Synthesis Population genetics Systematics Paleontology Botany and Zoology

What is an ontology?

• A way for a community to agree upon the meanings of terms and relations in order to reliably share knowledge about a specific domain

• Automated processes that share this knowledge representation can perform simple reasoning

Page 17: The Modern Synthesis Population genetics Systematics Paleontology Botany and Zoology

Elements of an ontology

• Types: Collections of objects that share common attributes

• Attributes: properties that objects can have and share

• Instances: Specific objects that manifest a type

• Relations: ways that objects can be related to one another

Page 18: The Modern Synthesis Population genetics Systematics Paleontology Botany and Zoology

Gene Ontology

from yeastgenome.org

Page 19: The Modern Synthesis Population genetics Systematics Paleontology Botany and Zoology

Representing phenotypes

• Entity-Qua[l|nt]ity (EQ) syntax – Entities come from an organism-specific anatomy ontology (AO)•Several are being developed for genomic model organisms

– Qualities come from the Phenotype Attributes and Trait Ontology (PATO)

Page 20: The Modern Synthesis Population genetics Systematics Paleontology Botany and Zoology

Use of PATO

from arabidopsis.info

Page 21: The Modern Synthesis Population genetics Systematics Paleontology Botany and Zoology

Use your center

• Upcoming calls for proposals– Postdoc/sabbaticals - December– Working groups/catalysis meetings - Dec & June

• Submit a whitepaper (whenever)• Consider us for collaborative projects• Contact us:

– Todd Vision (Assoc. Director): [email protected]

– Hilmar Lapp (Asst. Director): [email protected]

Page 22: The Modern Synthesis Population genetics Systematics Paleontology Botany and Zoology
Page 23: The Modern Synthesis Population genetics Systematics Paleontology Botany and Zoology

Globally Unique Identifiers (GUIDs)

• With – Taxonomic Data Working Group (TDWG) – Global Biodiversity Information Facility (GBIF)

• The aim: to have a persistent, unique, resolvable identifier for a biological resource that can be distinguished from its location on the web– Museum specimens– Taxonomic concepts– Publication (e.g. DOI)

• Adopting Life Science Identifiers (LSIDs)

Page 24: The Modern Synthesis Population genetics Systematics Paleontology Botany and Zoology

Software initiatives

• Sponsoring working groups and hackathons in phyloinformatics and population genetics

• Training– Summer course in evolutionary informatics

Page 25: The Modern Synthesis Population genetics Systematics Paleontology Botany and Zoology

Outline

• The importance of informatics to evolutionary synthesis

• What kinds of informatics support NESCent set up to do (and not to do)

• Leveraging open source software initiatives and training the future community of evolutionary informaticists

• The role of databases and our efforts to tailor GMOD to evolutionary model organisms

• Some semantic web technologies (e.g. ontologies) that every evolutionary biologist should care about

• The cypriniformes phenotype comparison engine as an example that ties these themes together

Page 26: The Modern Synthesis Population genetics Systematics Paleontology Botany and Zoology

DRIADE Stakeholders Wkshp

5 Dec 2006

Page 27: The Modern Synthesis Population genetics Systematics Paleontology Botany and Zoology

Published data today

Page 28: The Modern Synthesis Population genetics Systematics Paleontology Botany and Zoology
Page 29: The Modern Synthesis Population genetics Systematics Paleontology Botany and Zoology

Where we’d like to be tomorrow

Page 30: The Modern Synthesis Population genetics Systematics Paleontology Botany and Zoology

Digital Repository for Information and Data in

Evolution (DRIADE)

Page 31: The Modern Synthesis Population genetics Systematics Paleontology Botany and Zoology

Proposed role for DRIADE

Journals

Specialized data repositories

(eg Genbank, Morphbank, PaleoDB Treebase)

Societies

Researchers

PublishersEditors

ReviewersDigital DataRepository

Page 32: The Modern Synthesis Population genetics Systematics Paleontology Botany and Zoology

Data registries and repositories

Page 33: The Modern Synthesis Population genetics Systematics Paleontology Botany and Zoology
Page 34: The Modern Synthesis Population genetics Systematics Paleontology Botany and Zoology
Page 35: The Modern Synthesis Population genetics Systematics Paleontology Botany and Zoology
Page 36: The Modern Synthesis Population genetics Systematics Paleontology Botany and Zoology

Role of metadata

• Two central questions– What metadata to keep?– How to get the metadata that’s needed?

• Partnership with the <MRC>– Director: Jane Greenberg

Page 37: The Modern Synthesis Population genetics Systematics Paleontology Botany and Zoology

Many other decisions to be made

• Technical– What software solutions to adopt?– Centralized or distributed physical storage?– How to interface with journals and specialized databases?

• Social– Is deposition mandatory?– Are there additional incentives?

• Intellectual Property– Are there any limits on data use?

• Business– How to ensure long-term financial stewardship?– How much to invest in metadata curation?

Page 38: The Modern Synthesis Population genetics Systematics Paleontology Botany and Zoology

Other disciplinary data repositories and data integration projects

• caBio (Cancer Biology)• Conservation Commons (Conservation Biology)

• GEON (Geology)• ICPSR (Inter-university Consortium for Political and Social Research)

• Marine Metadata Initiative• Virtual Observatory (Astronomy)• World Data Center System (Geography)

Page 39: The Modern Synthesis Population genetics Systematics Paleontology Botany and Zoology

A reality-based approach

Page 40: The Modern Synthesis Population genetics Systematics Paleontology Botany and Zoology

DRIADE: Near-term plans

• Requirements gathering and evaluation phase– Technical consultations

• Workshop for information science experts in March 2007

• To learn from the experiences of other disciplines

– Stakeholder consultations• Major design meeting in May 2007

• Implementation– Phase I: Stopgap solution?– Phase 2: for which external funding will be required

– Feasible milestones?

Page 41: The Modern Synthesis Population genetics Systematics Paleontology Botany and Zoology

Schedule9:00-10:00 - BackgroundHilmar Lapp - Objectives and requirements gatheringJane Greenberg & Co. - Issues regarding metadata Ahrash Bissell - OpenContext

10:00 - Brainstorming sessionWhat use cases do we imagine?What are the requirements and

priorities?How to continue gathering requirements?What experts and stakeholders to

include?12:00 Lunch