2007 paul vanraden 1, jeff o’connell 2, george wiggans 1, kent weigel 3 1 animal improvement...

23
200 7 Paul VanRaden Paul VanRaden 1 , Jeff O’Connell , Jeff O’Connell 2 , George Wiggans , George Wiggans 1 , , Kent Weigel Kent Weigel 3 1 Animal Improvement Programs Lab, USDA, Beltsville, MD, USA 2 University of Maryland School of Medicine, Baltimore, MD, USA 3 University of Wisconsin Dept. Dairy Science, Madison, WI, USA [email protected] 201 0 Combining Different Marker Combining Different Marker Densities in Genomic Densities in Genomic Evaluation Evaluation

Upload: bathsheba-floyd

Post on 08-Jan-2018

216 views

Category:

Documents


1 download

DESCRIPTION

Interbull meeting, Riga, Latvia May 2010 (3)Paul VanRaden 2010 Mixing Different Chips

TRANSCRIPT

Page 1: 2007 Paul VanRaden 1, Jeff O’Connell 2, George Wiggans 1, Kent Weigel 3 1 Animal Improvement Programs Lab, USDA, Beltsville, MD, USA 2 University of Maryland

2007

Paul VanRadenPaul VanRaden11, Jeff O’Connell, Jeff O’Connell22, George Wiggans, George Wiggans11, Kent Weigel, Kent Weigel33 1Animal Improvement Programs Lab, USDA, Beltsville, MD, USA2University of Maryland School of Medicine, Baltimore, MD, USA3University of Wisconsin Dept. Dairy Science, Madison, WI, USA

[email protected]

2010

Combining Different Marker Combining Different Marker Densities in Genomic EvaluationDensities in Genomic Evaluation

Page 2: 2007 Paul VanRaden 1, Jeff O’Connell 2, George Wiggans 1, Kent Weigel 3 1 Animal Improvement Programs Lab, USDA, Beltsville, MD, USA 2 University of Maryland

Interbull meeting, Riga, Latvia May 2010 (2) Paul VanRaden201

0

TopicsTopics

Filling missing SNPs (imputation)• Find haplotypes from genotypes• Use lower density to track higher• Programs implemented April 2010

Actual mixes of 3K with 50K Simulated mixes of 50K with 500K Calculating reliabilities

Page 4: 2007 Paul VanRaden 1, Jeff O’Connell 2, George Wiggans 1, Kent Weigel 3 1 Animal Improvement Programs Lab, USDA, Beltsville, MD, USA 2 University of Maryland

Interbull meeting, Riga, Latvia May 2010 (4) Paul VanRaden201

0

What is imputation?What is imputation? Genotypes indicate how many copies of

each allele were inherited Haplotypes indicate which alleles are

on which chromosome Use observed genotypes to impute

unknown haplotypes• Pedigree haplotyping uses relatives• Population haplotyping finds matching

allele patterns

Page 5: 2007 Paul VanRaden 1, Jeff O’Connell 2, George Wiggans 1, Kent Weigel 3 1 Animal Improvement Programs Lab, USDA, Beltsville, MD, USA 2 University of Maryland

Interbull meeting, Riga, Latvia May 2010 (5) Paul VanRaden201

0

Why impute haplotypes?Why impute haplotypes?

Predict unknown SNP from known• Measure 3,000, predict 50,000 SNP• Measure 50,000, predict 500,000• Measure each haplotype at highest

density only a few times Predict dam from progeny SNP Increase reliabilities for less cost

Page 6: 2007 Paul VanRaden 1, Jeff O’Connell 2, George Wiggans 1, Kent Weigel 3 1 Animal Improvement Programs Lab, USDA, Beltsville, MD, USA 2 University of Maryland

Interbull meeting, Riga, Latvia May 2010 (6) Paul VanRaden201

0

Haplotyping ProgramHaplotyping Programfindhap.f90findhap.f90

Begin with population haplotyping• Divide chromosomes into segments,

~250 SNP / segment• List haplotypes by genotype match• Similar to FastPhase, IMPUTE

End with pedigree haplotyping• Detect crossover, fix noninheritance• Impute nongenotyped ancestors

Page 7: 2007 Paul VanRaden 1, Jeff O’Connell 2, George Wiggans 1, Kent Weigel 3 1 Animal Improvement Programs Lab, USDA, Beltsville, MD, USA 2 University of Maryland

Interbull meeting, Riga, Latvia May 2010 (7) Paul VanRaden201

0

Recent Program RevisionsRecent Program Revisions

Imputation and GEBV reliability are better than in 9WCGALP paper

Changes since January 2010• Use known haplotype if second is unknown• Use current instead of base frequency• Combine parent haplotypes if crossover is

detected• Begin search with parent or grandparent

haplotypes

Page 8: 2007 Paul VanRaden 1, Jeff O’Connell 2, George Wiggans 1, Kent Weigel 3 1 Animal Improvement Programs Lab, USDA, Beltsville, MD, USA 2 University of Maryland

Interbull meeting, Riga, Latvia May 2010 (8) Paul VanRaden201

0

Most Frequent HaplotypesMost Frequent Haplotypes

5.16% 0222222220200200220020202000200002002020000220222222022204.37% 0220202202022000200220222000022002002000002002222000022024.36% 0220200222022002000220202200002202022000022002222002022203.67% 0220202220202220020220222020200002022200002000020200020023.66% 0222222220202220220202002200000202222020000020202200020223.65% 0220200222022002000220202200002202022000022002222002022223.51% 0220022220202220220220202202002220022000000020222200022203.42% 0220022220022200220220202200202002022020002020200200020203.24% 0222222220202000000220202200202002022020002020200200020203.22% 022002222002220022002020002220000202200000202022020202220

Most frequent haplotype in first segment of chromosome 15 for Holsteins had 4,316 copies= 41,822 * 2 * .0516

Page 9: 2007 Paul VanRaden 1, Jeff O’Connell 2, George Wiggans 1, Kent Weigel 3 1 Animal Improvement Programs Lab, USDA, Beltsville, MD, USA 2 University of Maryland

Interbull meeting, Riga, Latvia May 2010 (9) Paul VanRaden201

0

Example Bull: O-StyleExample Bull: O-StyleUSA137611441, Sire = O-ManUSA137611441, Sire = O-Man

Read genotypes, write haplotypes

Page 10: 2007 Paul VanRaden 1, Jeff O’Connell 2, George Wiggans 1, Kent Weigel 3 1 Animal Improvement Programs Lab, USDA, Beltsville, MD, USA 2 University of Maryland

Interbull meeting, Riga, Latvia May 2010 (10) Paul VanRaden201

0

Find Haplotypes – AB codingFind Haplotypes – AB coding

Genotypes:

Oman BB,AA,AA,AB,AA,AB,AB,AA,AA,AB

Ostyle BB,AA,AA,AB,AB,AA,AA,AA,AA,AB

Haplotypes:

OStyle (pat) B A A _ A A A A A _OStyle (mat) B A A _ B A A A A _

Page 11: 2007 Paul VanRaden 1, Jeff O’Connell 2, George Wiggans 1, Kent Weigel 3 1 Animal Improvement Programs Lab, USDA, Beltsville, MD, USA 2 University of Maryland

Interbull meeting, Riga, Latvia May 2010 (11) Paul VanRaden201

0

Find Haplotypes – 0,1,2 codingFind Haplotypes – 0,1,2 coding

Genotypes: codes 0 = BB, 1 = AB or BA, 2 = AA

Oman 0 2 2 1 2 1 1 2 2 1

Ostyle 0 2 2 1 1 2 2 2 2 1

Haplotypes: codes 0 = B , 1 = unknown, 2 = A

OStyle (pat) 0 2 2 1 2 2 2 2 2 1OStyle (mat) 0 2 2 1 0 2 2 2 2 1

Page 12: 2007 Paul VanRaden 1, Jeff O’Connell 2, George Wiggans 1, Kent Weigel 3 1 Animal Improvement Programs Lab, USDA, Beltsville, MD, USA 2 University of Maryland

Interbull meeting, Riga, Latvia May 2010 (12) Paul VanRaden201

0

O-Style HaplotypesO-Style Haplotypeschromosome 15chromosome 15

Page 13: 2007 Paul VanRaden 1, Jeff O’Connell 2, George Wiggans 1, Kent Weigel 3 1 Animal Improvement Programs Lab, USDA, Beltsville, MD, USA 2 University of Maryland

Interbull meeting, Riga, Latvia May 2010 (13) Paul VanRaden201

0

How does imputation work?How does imputation work?

Identify haplotypes in population using many markers

Track haplotypes with fewer markers e.g., use 5 SNP to track 25 SNP

• 5 SNP: 22020

• 25 SNP: 2022020002002002000202200

Page 14: 2007 Paul VanRaden 1, Jeff O’Connell 2, George Wiggans 1, Kent Weigel 3 1 Animal Improvement Programs Lab, USDA, Beltsville, MD, USA 2 University of Maryland

Interbull meeting, Riga, Latvia May 2010 (14) Paul VanRaden201

0

Imputed DamsImputed Dams

If progeny and sire both genotyped• First progeny inherits 1 of dam’s 2

haplotypes• Second progeny has 50:50 chance to

get same or other haplotype• Haplotypes known with 1, 2, 3, etc.

progeny are ~50%, 75%, 87%, etc.

Page 15: 2007 Paul VanRaden 1, Jeff O’Connell 2, George Wiggans 1, Kent Weigel 3 1 Animal Improvement Programs Lab, USDA, Beltsville, MD, USA 2 University of Maryland

Interbull meeting, Riga, Latvia May 2010 (15) Paul VanRaden201

0

Better Communication is NeededBetter Communication is Needed “Progeny genotypes should affect dam,

but programs are not yet available” Jan 2009 USDA Changes Memo

“Programs are available to impute 1300 dams” Oct 2009 USDA report to Council

“Encourage USDA to use genotypes, derived by imputation, in genetic evaluation” Oct 2009 Holstein USA Board of Directors (in Holstein Pulse)

Page 16: 2007 Paul VanRaden 1, Jeff O’Connell 2, George Wiggans 1, Kent Weigel 3 1 Animal Improvement Programs Lab, USDA, Beltsville, MD, USA 2 University of Maryland

Interbull meeting, Riga, Latvia May 2010 (16) Paul VanRaden201

0

Haplotyping Tests – Real DataHaplotyping Tests – Real Data

Half of young animals assigned 3K• Proven bulls, cows all had 50K• Dams imputed using 50K and 3K

Half of ALL animals assigned 3K• Could 3K reference animals help?• 10,000 proven bulls yet to genotype• Should cows with 3K be predictors?

Page 17: 2007 Paul VanRaden 1, Jeff O’Connell 2, George Wiggans 1, Kent Weigel 3 1 Animal Improvement Programs Lab, USDA, Beltsville, MD, USA 2 University of Maryland

Interbull meeting, Riga, Latvia May 2010 (17) Paul VanRaden201

0

CorrelationsCorrelations22 of 3K and PA with 50K of 3K and PA with 50K Half ofHalf of YOUNG animals had 3K PTA, half 50K PTAYOUNG animals had 3K PTA, half 50K PTA

Trait Corr(3K,50K)2 Corr(PA,50K)2 GainNM$ .899 .518 79%Milk .920 .523 83%Fat .920 .516 83%Prot .920 .555 82%PL .933 .498 87%SCS .912 .417 85%DPR .937 .539 86%

Page 18: 2007 Paul VanRaden 1, Jeff O’Connell 2, George Wiggans 1, Kent Weigel 3 1 Animal Improvement Programs Lab, USDA, Beltsville, MD, USA 2 University of Maryland

Interbull meeting, Riga, Latvia May 2010 (18) Paul VanRaden201

0

Using 3K as Reference Genotypes Using 3K as Reference Genotypes Half ofHalf of ALL animal NM$ were from 3K, half 50KALL animal NM$ were from 3K, half 50K

REL Gain as compared to all 50K

Breed 50K prog 3K prog Imputed dams

HO 90% 73% 36%

JE 82% 56% 44%

BS 84% 72% 55%

Page 19: 2007 Paul VanRaden 1, Jeff O’Connell 2, George Wiggans 1, Kent Weigel 3 1 Animal Improvement Programs Lab, USDA, Beltsville, MD, USA 2 University of Maryland

Interbull meeting, Riga, Latvia May 2010 (19) Paul VanRaden201

0

Simulated 500K GenotypesSimulated 500K Genotypes

Linkage in base population• Similar to actual linkage reported by:

– De Roos et al, 2008 Genetics 179:1503De Roos et al, 2008 Genetics 179:1503– Villa-Angulo et al, 2009 BMC Genetics 10:19Villa-Angulo et al, 2009 BMC Genetics 10:19

• Underlying linkage corresponds to D’ Three subsets of mixed 50K and 500K:

• Of 33,414, only 1,586 (young) had 500K • Also bulls > 99% REL, total 3,726• Also bulls > 90% REL, total 7,398

Page 20: 2007 Paul VanRaden 1, Jeff O’Connell 2, George Wiggans 1, Kent Weigel 3 1 Animal Improvement Programs Lab, USDA, Beltsville, MD, USA 2 University of Maryland

Interbull meeting, Riga, Latvia May 2010 (20) Paul VanRaden201

0

Results from 500K SimulationResults from 500K Simulation

Density Single Mixed Single

Chips 50K 50K and 500K 500K

Missing N = 0 1,586 3,726 7,398 33,414

Before 1% 88% 80% 70% 1%

After .05% 5.3% 2.3% 1.5% .05%

REL 82.6 83.4 83.6 83.7 84.0

Page 21: 2007 Paul VanRaden 1, Jeff O’Connell 2, George Wiggans 1, Kent Weigel 3 1 Animal Improvement Programs Lab, USDA, Beltsville, MD, USA 2 University of Maryland

Interbull meeting, Riga, Latvia May 2010 (21) Paul VanRaden201

0

REL Using Only 3K, 50K, or 500KREL Using Only 3K, 50K, or 500Kwith increasing numbers of bullswith increasing numbers of bulls

0102030405060708090

100

PA only 2,500 10,000 25,000 100,000

3K50K500K

Page 22: 2007 Paul VanRaden 1, Jeff O’Connell 2, George Wiggans 1, Kent Weigel 3 1 Animal Improvement Programs Lab, USDA, Beltsville, MD, USA 2 University of Maryland

Interbull meeting, Riga, Latvia May 2010 (22) Paul VanRaden201

0

ConclusionsConclusions Genomic evaluations can mix different

chip densities to save $ (or € or ¥)• New programs implemented in April 2010

Only a few thousand of highest density genotypes needed, and other animals imputed

More animals can be genotyped to increase selection differential and size of reference population

Page 23: 2007 Paul VanRaden 1, Jeff O’Connell 2, George Wiggans 1, Kent Weigel 3 1 Animal Improvement Programs Lab, USDA, Beltsville, MD, USA 2 University of Maryland

Interbull meeting, Riga, Latvia May 2010 (23) Paul VanRaden201

0

AcknowledgmentsAcknowledgments

Curt Van Tassell of BFGL selected the 3,209 low density SNP

Bob Schnabel of U. Missouri fixed map locations for several SNP

Mel Tooker assisted with computation