lecture-25 · 2013-08-23 · 2012$%$bmmb$597d:$analyzing$next$genera;on$sequencing$data$ $...

14

2012 % BMMB 597D: Analyzing Next Genera;on Sequencing Data Week 13, Lecture 25 István Albert Biochemistry and Molecular Biology and Bioinforma;cs Consul;ng Center Penn State

Upload: others

Post on 03-Aug-2020

1 views

Category:

Documents

0 download

Report

Download

Embed Size (px):

TRANSCRIPT

Page 1: lecture-25 · 2013-08-23 · 2012$%$BMMB$597D:$Analyzing$Next$Genera;on$Sequencing$Data$ $ $Week$13,$Lecture$25$ István'Albert' ' Biochemistry$and$Molecular$Biology$$ and$Bioinforma;cs$Consul;ng

2012$%$BMMB$597D:$Analyzing$Next$Genera;on$Sequencing$Data$

$$Week$13,$Lecture$25$

István'Albert''

Biochemistry$and$Molecular$Biology$$and$Bioinforma;cs$Consul;ng$Center$

$Penn$State$

Page 2: lecture-25 · 2013-08-23 · 2012$%$BMMB$597D:$Analyzing$Next$Genera;on$Sequencing$Data$ $ $Week$13,$Lecture$25$ István'Albert' ' Biochemistry$and$Molecular$Biology$$ and$Bioinforma;cs$Consul;ng

Final$Project$

•  Will$account$for$50%$of$your$the$grade.$

•  It$will$consist$of$one$or$more$datasets$and$a$number$of$hypotheses$that$need$to$be$tested$

•  Project$given$out$Thursday,$Dec$6th$and$is$due$Tuesday,$Dec$11th$

$

Page 3: lecture-25 · 2013-08-23 · 2012$%$BMMB$597D:$Analyzing$Next$Genera;on$Sequencing$Data$ $ $Week$13,$Lecture$25$ István'Albert' ' Biochemistry$and$Molecular$Biology$$ and$Bioinforma;cs$Consul;ng

Data$and$scripts$for$this$lecture$

•  Data,$code$and$scripts$are$packaged$in$lec25.tar.gz'

•  have$to$master$too$many$tools$and$it$is$easy$to$get$stuck$

•  Use$it$as$a$guide$%$$don’t$just$copy/paste$$

•  Develop$your$own$mini$libraries$of$tools,$shell$scripts$and$methods$$

Page 4: lecture-25 · 2013-08-23 · 2012$%$BMMB$597D:$Analyzing$Next$Genera;on$Sequencing$Data$ $ $Week$13,$Lecture$25$ István'Albert' ' Biochemistry$and$Molecular$Biology$$ and$Bioinforma;cs$Consul;ng

Chip%Seq$peak%caller$Recap$

1.$A$peak$caller$transforms$aligned$read$coverages$to$intervals$of$enrichment$

Page 5: lecture-25 · 2013-08-23 · 2012$%$BMMB$597D:$Analyzing$Next$Genera;on$Sequencing$Data$ $ $Week$13,$Lecture$25$ István'Albert' ' Biochemistry$and$Molecular$Biology$$ and$Bioinforma;cs$Consul;ng

The$most$common$misconcep;on$Cau;on:$this$a$personal$opinion$that$others$disagree$with$

Most'confusion'in'peak'calling'arises'from'combining'the'peak'calling'with'sta>s>cal'tes>ng'$•  A$peaks$means$a$region$that$appears$to$have$an$enrichment.$$

$•  The$opposite$of$peak$is$a$region$with$no'data'

$•  Some$peaks$may$be$caused$by$random$agglomera;on$of$data$–$but$that$

those$are$s;ll$peaks$–$only$that$they$are$peaks$occurring$by$random$chance$$

•  Most$published$peak$callers$will$o]en$remove$peaks$based$on$o]en$non%obvious$reasons$–$$

Page 6: lecture-25 · 2013-08-23 · 2012$%$BMMB$597D:$Analyzing$Next$Genera;on$Sequencing$Data$ $ $Week$13,$Lecture$25$ István'Albert' ' Biochemistry$and$Molecular$Biology$$ and$Bioinforma;cs$Consul;ng

Crazy$peaks$by$MACS$

Page 7: lecture-25 · 2013-08-23 · 2012$%$BMMB$597D:$Analyzing$Next$Genera;on$Sequencing$Data$ $ $Week$13,$Lecture$25$ István'Albert' ' Biochemistry$and$Molecular$Biology$$ and$Bioinforma;cs$Consul;ng

Understanding$peak$calling$

•  We$don’t$need$to$use$the$en;re$read!$Only$the$5’$end$ma`ers.$$

•  Transform$your$data$into$reads$that$are$1$bp$long$around$the$start$sites!$

•  See$the$README.txt$for$step$by$step$instruc;ons$

$

Page 8: lecture-25 · 2013-08-23 · 2012$%$BMMB$597D:$Analyzing$Next$Genera;on$Sequencing$Data$ $ $Week$13,$Lecture$25$ István'Albert' ' Biochemistry$and$Molecular$Biology$$ and$Bioinforma;cs$Consul;ng

README.txt$in$the$data$

Page 9: lecture-25 · 2013-08-23 · 2012$%$BMMB$597D:$Analyzing$Next$Genera;on$Sequencing$Data$ $ $Week$13,$Lecture$25$ István'Albert' ' Biochemistry$and$Molecular$Biology$$ and$Bioinforma;cs$Consul;ng

Effects$of$increase$of$smoothing$

Page 10: lecture-25 · 2013-08-23 · 2012$%$BMMB$597D:$Analyzing$Next$Genera;on$Sequencing$Data$ $ $Week$13,$Lecture$25$ István'Albert' ' Biochemistry$and$Molecular$Biology$$ and$Bioinforma;cs$Consul;ng

Isolate$peak$calling$by$strands$

•  Tools$that$merge$strands$always$need$to$make$assump;ons$on$the$data$–$some;mes$it$is$not$obvious$what$these$are$

•  The$best$approach$is$to$operate$on$strand$individually,$then$combine$the$results$

•  Caveat:$low$coverage,$error$prone$data$will$be$even$more$difficult$to$analyze$once$split$$

Page 11: lecture-25 · 2013-08-23 · 2012$%$BMMB$597D:$Analyzing$Next$Genera;on$Sequencing$Data$ $ $Week$13,$Lecture$25$ István'Albert' ' Biochemistry$and$Molecular$Biology$$ and$Bioinforma;cs$Consul;ng

Inves;ga;ng$two$error$models$$

Fidng$and$predic;ons$smoothing=5$

Fidng$and$predic;ons$smoothing=20$

DNA$fragment$length$=$factor$footprint$

Page 12: lecture-25 · 2013-08-23 · 2012$%$BMMB$597D:$Analyzing$Next$Genera;on$Sequencing$Data$ $ $Week$13,$Lecture$25$ István'Albert' ' Biochemistry$and$Molecular$Biology$$ and$Bioinforma;cs$Consul;ng

Peaks$with$no$exclusion$zone$around$them$

Page 13: lecture-25 · 2013-08-23 · 2012$%$BMMB$597D:$Analyzing$Next$Genera;on$Sequencing$Data$ $ $Week$13,$Lecture$25$ István'Albert' ' Biochemistry$and$Molecular$Biology$$ and$Bioinforma;cs$Consul;ng

How$to$get$the$average$fragment$size?$

1.  Need$to$find$peak$pairs$2.  Compute$distance$between$their$limits$$Step$by$step:$•  Expand$each$fragment$to$the$le]$by$a$limit$•  Intersect$fragment$on$the$opposite$strand$•  Compute$distances$

Page 14: lecture-25 · 2013-08-23 · 2012$%$BMMB$597D:$Analyzing$Next$Genera;on$Sequencing$Data$ $ $Week$13,$Lecture$25$ István'Albert' ' Biochemistry$and$Molecular$Biology$$ and$Bioinforma;cs$Consul;ng

Homework$25$

•  Run$the$pipeline$described$in$README.txt$$

•  Report$the$average$fragment$size$that$it$generates.$

TCI 2011 Publications Final - Cleveland Clinic...networks for head and neck squamous cell carcinoma. J Clin Bioinforma. 2011;1(1):21. Bego MG, Keyes LR, Maciejewski J, St Jeor SC

*M SJOWJP QSFHJVEJ[JBMF BMMB $PSUF EJ (JVTUJ[JB DPNF

ÖĞRENCİ NO ÖĞRENCİNİN ADI SOYADI BÖLÜM DERSİN ...bhi.nku.edu.tr/basinyonetim/resim/images/editorresimleri/...1180607040 Er** Ce* ÖZ*** Biyomedikal Mühendisliği BMMB 309

BMMB#852 Lecturer:&Istvan#Albert# ([email protected])&& … · 8/25/15 1 BMMB#852:&Applied&Bioinformacs& &Week&1,&Lecture&1& István#Albert# # Bioinformacs&Consul5ng&Center& Penn&State,&2015&

Hierarchical Temporal Memory as a Means for Image Recognition by Wesley Bruning CHEM/CSE 597D Final Project Presentation December 10, 2008

CMSC423: Bioinforma#c Algorithms, Databases and Tools

Week1,Lecture1 - Pennsylvania State University · 2014-08-26 · BMMB#852:"Applied"Bioinforma0cs" "Week"1,"Lecture"1" István#Albert# # Bioinforma0cs"Consul0ng"Center" Penn"State,"2014"

lecture-2 - Pennsylvania State University · 2012-08-30 · 2012$%$BMMB$597D:$Analyzing$Next$Genera;on$Sequencing$Data$ $ $Week$1,$Lecture$2$ István'Albert' ' Biochemistry$and$Molecular$Biology$$

MEMORANDUM - portal.unimap.edu.myportal.unimap.edu.my/portal/page/portal30/FP_NEWS_EVENTS/MORE_NE... · senarai nama pelajar : kad unidebit bmmb 30 nov 2017 box 1 1. dalilah binti

จังหวัดปัตตานีo bbom bmmb od dddd bbmm [email protected] (Training Roadmap)" (Performance) (Potential) (Training Roadmap) 3 (Competency Gap) (Potential)

Memahami Urutan Genom - biogen.litbang.pertanian.go.idbiogen.litbang.pertanian.go.id/terbitan/Kuliah_Umum/Kuliah Umum-22... · Nebulizer (gas) Sonicator (suara) ... Analisa data/bioinforma

BHARATHIAR UNIVERSITY:: COIMBATORE – 641046 …syllabus.b-u.ac.in/syl_college/ug_an30asnt9.pdfDEGREE COURSE WITH COMPULSORY DIPLOMA IN BIOINFORMA TICS SEMESTER SYSTEM (WITH EFFECT

Rigid-Body Registration · 2003-10-03 · 1 Rigid-Body Registration COS 597D Mike Burns Paul Calamia September 30, 2003 • What is registration? – Finding a one-to-one mapping

Guide économique 20130205 - Quebec · BMMB – Guide d’analyse économique 4 février 2013 iii ... 1.1 Les objectifs du guide d’analyse économique.....3 1.2 La clientèle visée

ChIP:%Quick%Overview% Chroma=n%Immuno'Precipita=on% · 11/21/13% 1% 2012%'%BMMB%597D:%Analyzing%Next%Genera=on%Sequencing%Data% % %Week%13,%Lecture%26% István'Albert' ' Biochemistry%and%Molecular%Biology%%

Facadebrochure Sweden korr1€¦ · 4pn tmvutuszlojohtgÊsh qÌ bmmb njofsbmjtlb voefsmbh tbnu qÌ voefsmbh bw bmmb uzqfs bw hbnnbm cÊslsbgujh gÊsh jodmvtjwf tjmjlbugÊsh nfe voeboubh

Welcome to BMMB 597D! - Pennsylvania State Universitybiopieces - Biopieces is a bioinformatic framework of tools easily used and easily created. - Google Project Hosting - File Edit

Analyse de rentabilité économique des éclaircies ... · bois (BMMB) par Laurent Gagné, coordonnateur du chantier d’éclaircie commerciale pour la CRÉ du Bas-St-Laurent. L’objectif

BLAST:’Basic’Local’Alignment’’ 2014’)’BMMB’852:’Applied

studentsrepo.um.edu.mystudentsrepo.um.edu.my/1997/8/BIBLIOGRAFI.pdf · Mualamah Banking Dept. (1994), Muamalah Banking Products, Kuala Lumpur. BMMB

Dedication and Commitment Flash/AE 597D/FINAL SUBMISSI… · on this project. Being a leader in this movement through edu-cating students and community based on the systems implemented

BMMB MUAMALAT MALAYSIA BERHAD QR PAYMENT MOBILE … · 1.1.3 The Merchant acknowledge that the BMMB shall not be liable and Merchant shall indemnify the BMMB for any loss or damage

Bioinforma)cs,in,a,Box. 04/18/15. - Vermont Genetics Network · Summer Bioinformatics Intensive Internship ! NM Bioinformatics, Science and Technology (NMBIST) ... conference ! Sequencing

saraburi · VI (Training Roadmap)" 6) lgdbG) G). (Training Roadmap)" 10. "mea (Training Roadmap)" G) - tunnîû d,doo o bbom bmmb od dddd Igbmm 0 bbom bmmb od dddd bbmm

2017 - UC Irvine Oﬃce of Informa on Technology Customer Sa ...€¦ · Informa on Systems, Bioinforma cs, & parallel & serial programming needs; provide support for the usage of

BMMB 3033 Teori Sastera Barat

$06/53: 3*4, %BMMB UFPSJB BMMB QSBUJDBimages.vc.camcom.it/f/Varie/30/3001_CCIAAVC_2882012.pdfIndirizzo: Piazza Poli, 37/42 – 00187 Roma. E-mail: [email protected] 2 Indice Introduzione

Analyse de rentabilité économique des plantations au Québec...Simon Guay (BFEC) Annie Malenfant (DCAR 11) Sébastien Meunier (DEX 07) Édith Tremblay (BMMB) Révision linguistique

Rigid-Body Registration · 1 Rigid-Body Registration COS 597D Mike Burns Paul Calamia September 30, 2003 • What is registration? – Finding a one-to-one mapping between two or

Bioinforma)cs Series: Designing An NGS Study For My ... · Comprehensive Molecular Characterization of Papillary Renal-Cell Carcinoma, The Cancer Genome Atlas Research Network, N

BMMB 3033 Kesusasteraan Melayu

105€¦ · t 01&/*/( 4&44*0/ #fowfovup f dpoejwjtjpof jogpsnb[jpoj ufdojdif bodif jo sfmb[jpof bmmb qpttjcjmjuË ej joufswfojsf qpssf epnboef bj sfmbupsj

Introduc)on*to*Molecular*Biology* and*Bioinforma)cs*Workshop

%Week%6,%Lecture%12% - Pennsylvania State University · 2014. 10. 2. · 2014%&%BMMB%852:%Applied%Bioinformacs% % %Week%6,%Lecture%12% István’Albert’ ’ Bioinformacs%Consul8ng%Center%

Introduction to Bioinformatics analysis of Metabarcoding data...Genomics, Proteomics Bioinforma., 13, 278–289. Sanger ABI Ion Torrent 454 Illumina-400 Illumina MiSeq 2nd 2 x 300