biological d ata management and preservation richard slayden, phd
DESCRIPTION
Biological D ata Management and Preservation Richard Slayden, PhD. Microbiology, Immunology & Pathology. CURRENT DATA MANAGEMENT & PRESERVATION STRATEGIES USED BY BIOLOGISTS. Data Preservation. Data Management. What experimental data makes up information?. Study Design Experiment - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: Biological D ata Management and Preservation Richard Slayden, PhD](https://reader034.vdocuments.site/reader034/viewer/2022051518/5681602d550346895dcf4001/html5/thumbnails/1.jpg)
Biological Data Management and Preservation
Richard Slayden, PhD.Microbiology, Immunology & Pathology
![Page 2: Biological D ata Management and Preservation Richard Slayden, PhD](https://reader034.vdocuments.site/reader034/viewer/2022051518/5681602d550346895dcf4001/html5/thumbnails/2.jpg)
CURRENT DATA MANAGEMENT & PRESERVATION STRATEGIES USED BY BIOLOGISTS
Data PreservationData Management
![Page 3: Biological D ata Management and Preservation Richard Slayden, PhD](https://reader034.vdocuments.site/reader034/viewer/2022051518/5681602d550346895dcf4001/html5/thumbnails/3.jpg)
Information
Study DesignExperimentComplexityData ReductionAnalysis
What experimental data makes up information?
![Page 4: Biological D ata Management and Preservation Richard Slayden, PhD](https://reader034.vdocuments.site/reader034/viewer/2022051518/5681602d550346895dcf4001/html5/thumbnails/4.jpg)
Sequencing: 1-400 genomes (bacterial)
Analysis: reference annotation vs re-annotation
Source of data: Historical data or newly generated
Integration of biological information, data complexity and “version”
Current data issues
![Page 5: Biological D ata Management and Preservation Richard Slayden, PhD](https://reader034.vdocuments.site/reader034/viewer/2022051518/5681602d550346895dcf4001/html5/thumbnails/5.jpg)
Example of where data is coming from: Next Generation Sequencing Technology
![Page 6: Biological D ata Management and Preservation Richard Slayden, PhD](https://reader034.vdocuments.site/reader034/viewer/2022051518/5681602d550346895dcf4001/html5/thumbnails/6.jpg)
Reference or De Novo Genome sequence data
Resequencing/SNP Analysis
Whole Transcriptome/small RNA/microbial RNA/human RNA
Epigenetics
Gene Essentiality
Metagenomic studies
Examples of biological data: Not limited to genome sequencing
![Page 7: Biological D ata Management and Preservation Richard Slayden, PhD](https://reader034.vdocuments.site/reader034/viewer/2022051518/5681602d550346895dcf4001/html5/thumbnails/7.jpg)
Schu4
Isolate #1
Isolate #2
LVS
Integration of data: Genome Analysis-Genome structure and arrangement
![Page 8: Biological D ata Management and Preservation Richard Slayden, PhD](https://reader034.vdocuments.site/reader034/viewer/2022051518/5681602d550346895dcf4001/html5/thumbnails/8.jpg)
Capturing and Updating Biological information and Function
![Page 9: Biological D ata Management and Preservation Richard Slayden, PhD](https://reader034.vdocuments.site/reader034/viewer/2022051518/5681602d550346895dcf4001/html5/thumbnails/9.jpg)
RESOLUTION OF DATA-UNIQUE DATA FROM A SINGLE INFECTION
![Page 10: Biological D ata Management and Preservation Richard Slayden, PhD](https://reader034.vdocuments.site/reader034/viewer/2022051518/5681602d550346895dcf4001/html5/thumbnails/10.jpg)
0500
10001500200025003000350040004500
0500
10001500200025003000350040004500
RPKM
Non-annotated open reading frames
Annotated open reading frames
RPKM
IDENTIFICATION OF NEW GENOMIC INFORMATION: Assignment of Function
![Page 11: Biological D ata Management and Preservation Richard Slayden, PhD](https://reader034.vdocuments.site/reader034/viewer/2022051518/5681602d550346895dcf4001/html5/thumbnails/11.jpg)
INTEGRATION OF DIFFERENT SOURCES OF DATA
Whole genome essential gene mapping
NS SNPs+ORFs-ORFs
Genome size: ~1.9 million bases
Input pool: 196,044 mutations (~10%)
Bacteria from lung: 179,782 mutations
Bacteria from Spleen: 77,806 mutations
Mapped 1,419 unique non-synonymous SNPs across the genome
74% are within proposed open reading frames
![Page 12: Biological D ata Management and Preservation Richard Slayden, PhD](https://reader034.vdocuments.site/reader034/viewer/2022051518/5681602d550346895dcf4001/html5/thumbnails/12.jpg)
Combine bioinformatics [promoter families] and transcriptional analysis
GENOME AND “FUNCTIONAL” INFORMATION
![Page 13: Biological D ata Management and Preservation Richard Slayden, PhD](https://reader034.vdocuments.site/reader034/viewer/2022051518/5681602d550346895dcf4001/html5/thumbnails/13.jpg)
Envisioned Needs in context of the BIOLOGIST:
1. Data Storage-where is the data
2. Access & maintenance-has it been changed, if so in what way, and by who, and for what reason
3. Access to data files-interface with data for manipulation and data analysis & output
4. Distribution of data files-Provide data in “universal” format where state of analysis is embedded and can be integrated with other data
5. Compatibility of analytical software and future interfaces
Future data management and preservation
![Page 14: Biological D ata Management and Preservation Richard Slayden, PhD](https://reader034.vdocuments.site/reader034/viewer/2022051518/5681602d550346895dcf4001/html5/thumbnails/14.jpg)
Questions: