graduate curriculum for biological information specialists: a key to integration of scale in biology...

20
Graduate Curriculum for Biological Information Specialists: A Key to Integration of Scale in Biology Graduate School of Library and Information Science University of Illinois at Urbana- P. Bryan Heidorn Carole L. Palmer Dan Wright Melissa H. Cragin Graduate School of Library and Information Science University of Illinois at Urbana-Champaign 2nd International Digital Curation Conference Digital Data Curation in Practice 22 November 2006

Upload: ira-blair

Post on 11-Jan-2016

215 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Graduate Curriculum for Biological Information Specialists: A Key to Integration of Scale in Biology Graduate School of Library and Information Science

Graduate Curriculum for Biological Information Specialists: A Key to Integration of Scale in Biology

Graduate School of Library and Information Science University of Illinois at Urbana-Champaign

P. Bryan HeidornCarole L. Palmer

Dan WrightMelissa H. Cragin

Graduate School of Library and Information ScienceUniversity of Illinois at Urbana-Champaign

2nd International Digital Curation ConferenceDigital Data Curation in Practice

22 November 2006

Page 2: Graduate Curriculum for Biological Information Specialists: A Key to Integration of Scale in Biology Graduate School of Library and Information Science

Outline

● Scientific Collaboration Initiative

● Information challenges of biological data

● Research foundation of our program

● Biological Information Specialists

● Data Curation Concentration in MSLIS

● Partners and internships

● Integration of research and practice

Page 3: Graduate Curriculum for Biological Information Specialists: A Key to Integration of Scale in Biology Graduate School of Library and Information Science

Scientific Communication Initiative (SCI)

Given: The ever growing universe of information resources, informatics

tools, and scholarly communication options that need to be understood, assessed, and coordinated

SCI Aims: Improve information transfer and integration, technology

development and sustainability, and collaboration in the practice of science through

basic and applied research

education to train of information specialists to work cooperatively with research scientists

● Complement, not duplicate, expertise of natural & computational scientists.

Page 4: Graduate Curriculum for Biological Information Specialists: A Key to Integration of Scale in Biology Graduate School of Library and Information Science

Information challenges in biology

● Emergent complexity is hallmark of modern biology.

Complexity of the data is of greater consequence for scientific discovery than the volume of the data

● New data practices must make sure that data at many scales is interoperable to support this kind of research

● Data is active and part of with the scientific process in a changing producer-consumer economy

● Scientific inquiry requires integration of lab data, procedures, literature and reference work

● A critical shortage of personnel trained to manage biological information and data

Page 5: Graduate Curriculum for Biological Information Specialists: A Key to Integration of Scale in Biology Graduate School of Library and Information Science

Research foundation

Modeling and computational neuroscience• Information and Discovery in Neuroscience

Palmer, NSF IIS-0222848

Automated metadata extraction and inference• Automatic Museum Label Metatdata Extraction

Heidorn, NSF DBI-9982849, NSF, DBI-0345387• Georeferencing Museum Specimen Sources

Heidorn, Moore 2005-2929-00

Terminology, schema, and ontology development• Plant Description Standards

Heidorn, IMLS NR-00-01-0017-01

Collaborative data collection• BioDiversity Survey Collaboration and Verification

Heidorn and Palmer, NSF BDI-0113918

Page 6: Graduate Curriculum for Biological Information Specialists: A Key to Integration of Scale in Biology Graduate School of Library and Information Science

Modeling and computational neuroscience

● Large user group for experimental biological data, yet rarely (if ever) generate their own data sets

● One of the communities making the most use of shared data repositories

● Difficulties in re-purposing data collected under a specific set of experimental circumstances and constraints

Metadata difficult to gather

● Needs generally not taken into account in planning, collection, and storage of experimental data

Page 7: Graduate Curriculum for Biological Information Specialists: A Key to Integration of Scale in Biology Graduate School of Library and Information Science

Automated metadata extraction and inference

● Historical collections in botany, zoology, and entomology have been curated for centuries, along with rich metadata (~ 2 billion specimen labels)

● Manual extraction is making collections globally accessible and usable (Darwin core/ABCD, DigIR, TCS)

● Automated metadata extraction: HERBIS and Biogeomancer approach

● Implication: Predictive ecological modeling under climate change

Page 8: Graduate Curriculum for Biological Information Specialists: A Key to Integration of Scale in Biology Graduate School of Library and Information Science

Terminology, schema, and ontology development

Serving goals of knowledge representation, discovery, and data integration

● Biodiversity: Informatics Core Ontology

Taxonomic Databases Working Group standards

● Neuroscience: ontology and vocabulary development work aimed at integrating animal and human imaging data.

● Text mining to ontology from texts

Page 9: Graduate Curriculum for Biological Information Specialists: A Key to Integration of Scale in Biology Graduate School of Library and Information Science

Biological Information Specialists

At present:

● Biologists at all degree levels self-trained in information technology

● Information technologists at all degree levels self-trained in biology

(both with gaps in knowledge for many months, years)

● Differing roles of BIS in large and small

Page 10: Graduate Curriculum for Biological Information Specialists: A Key to Integration of Scale in Biology Graduate School of Library and Information Science

Master of Science in Biological Informatics

Part of campus-wide bioinformatics masters program

curriculum development funded by

NSF/CISE/IIS, Education Research and Curriculum Development, 0534567 (Palmer, PI)

Degree Program began September 2006

Combines Biology, Bioinformatics, Computer Science core

with LIS courses from GSLIS long-standing top ranked program.

Page 11: Graduate Curriculum for Biological Information Specialists: A Key to Integration of Scale in Biology Graduate School of Library and Information Science

What does a BIS need to know?

Biological training and interest in solving biological research problems

Information skills

● Evaluation and implementation of information systems: user based assessment and continual quality improvement for the development of tools that work and are used.

● Information acquisition, management, and dissemination: development of digital libraries, data archives, institutional repositories, and related tools.

● Information organization and integration: ontology development, structuring information for optimal use and sharing, and standards development.

Page 12: Graduate Curriculum for Biological Information Specialists: A Key to Integration of Scale in Biology Graduate School of Library and Information Science

LIS orientation

● LIS is the only field concerned with the full landscape of scientific information and the interactions among fields

● Focus on information needs of users, rather then internal criteria such as technical elegance

● Tradition of the training scientific information professionals as informationist

“The informationist concept meets a critical need for an intermediary between the expanding information universe and practitioners and researchers. Successful informationists may come from a variety of backgrounds and perform a variety of roles,

but must have knowledge about both a subject domain and the process of locating, analyzing, and synthesizing information.”

(Giuse et al. 2005, p. 2) emphasis added.

Page 13: Graduate Curriculum for Biological Information Specialists: A Key to Integration of Scale in Biology Graduate School of Library and Information Science

UIUC bioinformatics core coursework

Cross-disciplinary course distribution requirement

Example courses include:

Bioinformatics: Computing in Molecular BiologyAlgorithms in BioinformaticsPrinciples of Systematics

Computer Science: AlgorithmsDatabase Systems

Biology:Human GeneticsIntroductory BiochemistryMacromolecular Modeling

Page 14: Graduate Curriculum for Biological Information Specialists: A Key to Integration of Scale in Biology Graduate School of Library and Information Science

Sample existing LIS courses

Health Sciences Information Services and Resources

Electronic PublishingUse and Users of Information

Implementation of Information Retrieval Systems

Information Sources and Services in the Sciences

Architecture of Networked Information Systems

Information Modeling

Indexing and AbstractingBuilding Digital Libraries

Interfaces to Information Systems

Representing and Organizing Information

Page 15: Graduate Curriculum for Biological Information Specialists: A Key to Integration of Scale in Biology Graduate School of Library and Information Science

New and proposed courses

Ontologies in the Natural Sciences (Renear)

Biodiversity Informatics (Heidorn)

Information Transfer and Collaboration in Science (Palmer)

Metadata in Theory and Practice (McDonough)

Scientific Data and Procedure Standards

Scientific Classification and Vocabulary

Discovery Informatics and Data Mining

Scientific Literatures and Bibliometrics

Bioinformatics Resources and Tools Open Access Repositories

Page 16: Graduate Curriculum for Biological Information Specialists: A Key to Integration of Scale in Biology Graduate School of Library and Information Science

MSLIS Data Curation Concentration

Data Curation Educational Program (DCEP)

IMLS – Laura Bush 21st Century Librarian Program,

RE-05-06-0036-06 (Heidorn, PI)

Students with the DC concentration will be trained to add value to data and promote sharing across labs and disciplinary specializations

Page 17: Graduate Curriculum for Biological Information Specialists: A Key to Integration of Scale in Biology Graduate School of Library and Information Science

Integration of research and practice

● Cooperating institutions:

Biomedical Informatics Research Network (UCSD)

Arrowsmith literature mining project (University of Illinois at Chicago, Neuroscience Dept.)

Smithsonian Institution

American Museum of Natural History

Missouri Botanical Garden

U.S. Army Strategic Environmental Management Program

MIT Data Services Librarian

● Identify information problems and collect best practices from our partners to provide a broad understanding of information and data techniques, issues, and needs

● Place students in internships with our partners at biological science institutions to gain real-world biological research experience

● Cultivate new partners and new collaborative reseearch

Page 18: Graduate Curriculum for Biological Information Specialists: A Key to Integration of Scale in Biology Graduate School of Library and Information Science

New research directions

Focus on integration and scale

Informatics infrastructure as competitive edge

Sample areas of development

Landinformatics Group

Atmospheric science, hydrology, nutrient balance, carbon cycle, ecology, agronomy

Critical Zones Observatory

Focus on data integration problems across larger ranger sciences

Page 19: Graduate Curriculum for Biological Information Specialists: A Key to Integration of Scale in Biology Graduate School of Library and Information Science

References

Giuse, N.B., Sathe, N., and Jerome, R. (2005). Envisioning the Information Specialist in Context (ISIC): A Multi-Center Study to Articulate Roles and Training Models. Medical Library Association.

Palmer, C.L., Cragin, M.H., and Hogan, T.P. (2004). Information at the Intersections of Discovery: Case Studies in Neuroscience. Proceedings of the American Society for Information Science and Technology annual meeting 41: 448-455.

Greenberg, Jane, P. Bryan Heidorn and Stephen Seiberling (2005). Growing Vocabularies for Plant Identification and Scientific Learning. International Conference on Dublin Core and Metadata Applications (DC-2005, Sept 15, 2005), Madrid, Spain.

Page 20: Graduate Curriculum for Biological Information Specialists: A Key to Integration of Scale in Biology Graduate School of Library and Information Science

Acknowledgements

● Research grants:

IIS 022848, DBI 0345387

● GSLIS Research Writing Group

_____________________

● Scientific Communication Initiative: http://sci.lis.uiuc.edu/under development