advanced quantitative research methodology, lecture notes ...readings king, gary; james honaker; ann...

250
Advanced Quantitative Research Methodology, Lecture Notes: Missing Data 1 Gary King http://GKing.Harvard.Edu April 16, 2012 1 c Copyright 2012 Gary King, All Rights Reserved. Gary King http://GKing.Harvard.Edu () Advanced Quantitative Research Methodology, Lecture Notes: April 16, 2012 1 / 42

Upload: others

Post on 01-Mar-2021

6 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Advanced Quantitative Research Methodology, LectureNotes: Missing Data1

Gary Kinghttp://GKing.Harvard.Edu

April 16, 2012

1 c©Copyright 2012 Gary King, All Rights Reserved.Gary King http://GKing.Harvard.Edu () Advanced Quantitative Research Methodology, Lecture Notes: Missing DataApril 16, 2012 1 / 42

Page 2: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Readings

King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001.“Analyzing Incomplete Political Science Data: An AlternativeAlgorithm for Multiple Imputation,” APSR 95, 1 (March, 2001):49-69.

James Honaker and Gary King. “What to do about Missing Values inTime Series Cross-Section Data,” AJPS 54, 2 (April, 2010): 561-581

Amelia II: A Program for Missing Data

http://gking.harvard.edu/amelia

Gary King () Missing Data 2 / 42

Page 3: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Summary

Some common but biased or inefficient missing data practices:

Make up numbers: e.g., changing Party ID “don’t knows” to“independent”Listwise deletion: used by 94% pre-2000 in AJPS/APSR/BJPSVarious other ad hoc approaches

Application-specific methods: efficient, but model-dependent andhard to develop and use

An easy-to-use and statistically appropriate alternative, Multipleimputation: (1) fill in five data sets with different imputations formissing values, (2) analyze each one as you would withoutmissingness, and (3) use a special method to combine the results.[We need to show you how to create imputations and how to combinethem]

Gary King () Missing Data 3 / 42

Page 4: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Summary

Some common but biased or inefficient missing data practices:

Make up numbers: e.g., changing Party ID “don’t knows” to“independent”

Listwise deletion: used by 94% pre-2000 in AJPS/APSR/BJPSVarious other ad hoc approaches

Application-specific methods: efficient, but model-dependent andhard to develop and use

An easy-to-use and statistically appropriate alternative, Multipleimputation: (1) fill in five data sets with different imputations formissing values, (2) analyze each one as you would withoutmissingness, and (3) use a special method to combine the results.[We need to show you how to create imputations and how to combinethem]

Gary King () Missing Data 3 / 42

Page 5: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Summary

Some common but biased or inefficient missing data practices:

Make up numbers: e.g., changing Party ID “don’t knows” to“independent”Listwise deletion: used by 94% pre-2000 in AJPS/APSR/BJPS

Various other ad hoc approaches

Application-specific methods: efficient, but model-dependent andhard to develop and use

An easy-to-use and statistically appropriate alternative, Multipleimputation: (1) fill in five data sets with different imputations formissing values, (2) analyze each one as you would withoutmissingness, and (3) use a special method to combine the results.[We need to show you how to create imputations and how to combinethem]

Gary King () Missing Data 3 / 42

Page 6: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Summary

Some common but biased or inefficient missing data practices:

Make up numbers: e.g., changing Party ID “don’t knows” to“independent”Listwise deletion: used by 94% pre-2000 in AJPS/APSR/BJPSVarious other ad hoc approaches

Application-specific methods: efficient, but model-dependent andhard to develop and use

An easy-to-use and statistically appropriate alternative, Multipleimputation: (1) fill in five data sets with different imputations formissing values, (2) analyze each one as you would withoutmissingness, and (3) use a special method to combine the results.[We need to show you how to create imputations and how to combinethem]

Gary King () Missing Data 3 / 42

Page 7: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Summary

Some common but biased or inefficient missing data practices:

Make up numbers: e.g., changing Party ID “don’t knows” to“independent”Listwise deletion: used by 94% pre-2000 in AJPS/APSR/BJPSVarious other ad hoc approaches

Application-specific methods: efficient, but model-dependent andhard to develop and use

An easy-to-use and statistically appropriate alternative, Multipleimputation: (1) fill in five data sets with different imputations formissing values, (2) analyze each one as you would withoutmissingness, and (3) use a special method to combine the results.[We need to show you how to create imputations and how to combinethem]

Gary King () Missing Data 3 / 42

Page 8: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Summary

Some common but biased or inefficient missing data practices:

Make up numbers: e.g., changing Party ID “don’t knows” to“independent”Listwise deletion: used by 94% pre-2000 in AJPS/APSR/BJPSVarious other ad hoc approaches

Application-specific methods: efficient, but model-dependent andhard to develop and use

An easy-to-use and statistically appropriate alternative, Multipleimputation: (1) fill in five data sets with different imputations formissing values, (2) analyze each one as you would withoutmissingness, and (3) use a special method to combine the results.[We need to show you how to create imputations and how to combinethem]

Gary King () Missing Data 3 / 42

Page 9: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Missingness Notation

D =

1 2.5 432 05 3.2 543 12 7.4 219 16 1.9 234 13 1.2 108 00 7.7 95 1

, M =

1 1 1 11 1 1 11 0 1 00 1 0 10 1 1 10 1 1 1

Dmis = missing elements in D (in Red)Dobs = observed elements in D

Missing elements must exist (what’s your view on the National HeliumReserve?)

Gary King () Missing Data 4 / 42

Page 10: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Missingness Notation

D =

1 2.5 432 05 3.2 543 12 7.4 219 16 1.9 234 13 1.2 108 00 7.7 95 1

, M =

1 1 1 11 1 1 11 0 1 00 1 0 10 1 1 10 1 1 1

Dmis = missing elements in D (in Red)Dobs = observed elements in D

Missing elements must exist (what’s your view on the National HeliumReserve?)

Gary King () Missing Data 4 / 42

Page 11: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Missingness Notation

D =

1 2.5 432 05 3.2 543 12 7.4 219 16 1.9 234 13 1.2 108 00 7.7 95 1

, M =

1 1 1 11 1 1 11 0 1 00 1 0 10 1 1 10 1 1 1

Dmis = missing elements in D (in Red)

Dobs = observed elements in D

Missing elements must exist (what’s your view on the National HeliumReserve?)

Gary King () Missing Data 4 / 42

Page 12: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Missingness Notation

D =

1 2.5 432 05 3.2 543 12 7.4 219 16 1.9 234 13 1.2 108 00 7.7 95 1

, M =

1 1 1 11 1 1 11 0 1 00 1 0 10 1 1 10 1 1 1

Dmis = missing elements in D (in Red)Dobs = observed elements in D

Missing elements must exist (what’s your view on the National HeliumReserve?)

Gary King () Missing Data 4 / 42

Page 13: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Missingness Notation

D =

1 2.5 432 05 3.2 543 12 7.4 219 16 1.9 234 13 1.2 108 00 7.7 95 1

, M =

1 1 1 11 1 1 11 0 1 00 1 0 10 1 1 10 1 1 1

Dmis = missing elements in D (in Red)Dobs = observed elements in D

Missing elements must exist (what’s your view on the National HeliumReserve?)

Gary King () Missing Data 4 / 42

Page 14: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Possible Assumptions

You can predictAssumption Acronym M with:

Missing Completely At Random MCAR —Missing At Random MAR Dobs

Nonignorable NI Dobs & Dmis

• Reasons for the odd terminology are historical.

Gary King () Missing Data 5 / 42

Page 15: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Possible Assumptions

You can predictAssumption Acronym M with:

Missing Completely At Random MCAR —Missing At Random MAR Dobs

Nonignorable NI Dobs & Dmis

• Reasons for the odd terminology are historical.

Gary King () Missing Data 5 / 42

Page 16: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Missingness Assumptions, again

1. MCAR: Coin flips determine whether to answer survey questions (naive)

P(M|D) = P(M)

2. MAR: missingness is a function of measured variables (empirical)

P(M|D) ≡ P(M|Dobs ,Dmis) = P(M|Dobs)

e.g., Independents are less likely to answer vote choice question (withPID measured)e.g., Some occupations are less likely to answer the income question(with occupation measured)

3. NI: missingness depends on unobservables (fatalistic)

P(M|D) doesn’t simplifye.g., censoring income if income is > $100K and you can’t predict highincome with other measured variablesAdding variables to predict income can change NI to MAR

Gary King () Missing Data 6 / 42

Page 17: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Missingness Assumptions, again

1. MCAR: Coin flips determine whether to answer survey questions (naive)

P(M|D) = P(M)

2. MAR: missingness is a function of measured variables (empirical)

P(M|D) ≡ P(M|Dobs ,Dmis) = P(M|Dobs)

e.g., Independents are less likely to answer vote choice question (withPID measured)e.g., Some occupations are less likely to answer the income question(with occupation measured)

3. NI: missingness depends on unobservables (fatalistic)

P(M|D) doesn’t simplifye.g., censoring income if income is > $100K and you can’t predict highincome with other measured variablesAdding variables to predict income can change NI to MAR

Gary King () Missing Data 6 / 42

Page 18: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Missingness Assumptions, again

1. MCAR: Coin flips determine whether to answer survey questions (naive)

P(M|D) = P(M)

2. MAR: missingness is a function of measured variables (empirical)

P(M|D) ≡ P(M|Dobs ,Dmis) = P(M|Dobs)

e.g., Independents are less likely to answer vote choice question (withPID measured)e.g., Some occupations are less likely to answer the income question(with occupation measured)

3. NI: missingness depends on unobservables (fatalistic)

P(M|D) doesn’t simplifye.g., censoring income if income is > $100K and you can’t predict highincome with other measured variablesAdding variables to predict income can change NI to MAR

Gary King () Missing Data 6 / 42

Page 19: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Missingness Assumptions, again

1. MCAR: Coin flips determine whether to answer survey questions (naive)

P(M|D) = P(M)

2. MAR: missingness is a function of measured variables (empirical)

P(M|D) ≡ P(M|Dobs ,Dmis) = P(M|Dobs)

e.g., Independents are less likely to answer vote choice question (withPID measured)e.g., Some occupations are less likely to answer the income question(with occupation measured)

3. NI: missingness depends on unobservables (fatalistic)

P(M|D) doesn’t simplifye.g., censoring income if income is > $100K and you can’t predict highincome with other measured variablesAdding variables to predict income can change NI to MAR

Gary King () Missing Data 6 / 42

Page 20: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Missingness Assumptions, again

1. MCAR: Coin flips determine whether to answer survey questions (naive)

P(M|D) = P(M)

2. MAR: missingness is a function of measured variables (empirical)

P(M|D) ≡ P(M|Dobs ,Dmis) = P(M|Dobs)

e.g., Independents are less likely to answer vote choice question (withPID measured)e.g., Some occupations are less likely to answer the income question(with occupation measured)

3. NI: missingness depends on unobservables (fatalistic)

P(M|D) doesn’t simplifye.g., censoring income if income is > $100K and you can’t predict highincome with other measured variablesAdding variables to predict income can change NI to MAR

Gary King () Missing Data 6 / 42

Page 21: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Missingness Assumptions, again

1. MCAR: Coin flips determine whether to answer survey questions (naive)

P(M|D) = P(M)

2. MAR: missingness is a function of measured variables (empirical)

P(M|D) ≡ P(M|Dobs ,Dmis) = P(M|Dobs)

e.g., Independents are less likely to answer vote choice question (withPID measured)

e.g., Some occupations are less likely to answer the income question(with occupation measured)

3. NI: missingness depends on unobservables (fatalistic)

P(M|D) doesn’t simplifye.g., censoring income if income is > $100K and you can’t predict highincome with other measured variablesAdding variables to predict income can change NI to MAR

Gary King () Missing Data 6 / 42

Page 22: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Missingness Assumptions, again

1. MCAR: Coin flips determine whether to answer survey questions (naive)

P(M|D) = P(M)

2. MAR: missingness is a function of measured variables (empirical)

P(M|D) ≡ P(M|Dobs ,Dmis) = P(M|Dobs)

e.g., Independents are less likely to answer vote choice question (withPID measured)e.g., Some occupations are less likely to answer the income question(with occupation measured)

3. NI: missingness depends on unobservables (fatalistic)

P(M|D) doesn’t simplifye.g., censoring income if income is > $100K and you can’t predict highincome with other measured variablesAdding variables to predict income can change NI to MAR

Gary King () Missing Data 6 / 42

Page 23: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Missingness Assumptions, again

1. MCAR: Coin flips determine whether to answer survey questions (naive)

P(M|D) = P(M)

2. MAR: missingness is a function of measured variables (empirical)

P(M|D) ≡ P(M|Dobs ,Dmis) = P(M|Dobs)

e.g., Independents are less likely to answer vote choice question (withPID measured)e.g., Some occupations are less likely to answer the income question(with occupation measured)

3. NI: missingness depends on unobservables (fatalistic)

P(M|D) doesn’t simplifye.g., censoring income if income is > $100K and you can’t predict highincome with other measured variablesAdding variables to predict income can change NI to MAR

Gary King () Missing Data 6 / 42

Page 24: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Missingness Assumptions, again

1. MCAR: Coin flips determine whether to answer survey questions (naive)

P(M|D) = P(M)

2. MAR: missingness is a function of measured variables (empirical)

P(M|D) ≡ P(M|Dobs ,Dmis) = P(M|Dobs)

e.g., Independents are less likely to answer vote choice question (withPID measured)e.g., Some occupations are less likely to answer the income question(with occupation measured)

3. NI: missingness depends on unobservables (fatalistic)P(M|D) doesn’t simplify

e.g., censoring income if income is > $100K and you can’t predict highincome with other measured variablesAdding variables to predict income can change NI to MAR

Gary King () Missing Data 6 / 42

Page 25: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Missingness Assumptions, again

1. MCAR: Coin flips determine whether to answer survey questions (naive)

P(M|D) = P(M)

2. MAR: missingness is a function of measured variables (empirical)

P(M|D) ≡ P(M|Dobs ,Dmis) = P(M|Dobs)

e.g., Independents are less likely to answer vote choice question (withPID measured)e.g., Some occupations are less likely to answer the income question(with occupation measured)

3. NI: missingness depends on unobservables (fatalistic)P(M|D) doesn’t simplifye.g., censoring income if income is > $100K and you can’t predict highincome with other measured variables

Adding variables to predict income can change NI to MAR

Gary King () Missing Data 6 / 42

Page 26: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Missingness Assumptions, again

1. MCAR: Coin flips determine whether to answer survey questions (naive)

P(M|D) = P(M)

2. MAR: missingness is a function of measured variables (empirical)

P(M|D) ≡ P(M|Dobs ,Dmis) = P(M|Dobs)

e.g., Independents are less likely to answer vote choice question (withPID measured)e.g., Some occupations are less likely to answer the income question(with occupation measured)

3. NI: missingness depends on unobservables (fatalistic)P(M|D) doesn’t simplifye.g., censoring income if income is > $100K and you can’t predict highincome with other measured variablesAdding variables to predict income can change NI to MAR

Gary King () Missing Data 6 / 42

Page 27: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

How Bad Is Listwise Deletion?

Goal: estimate β1, where X2 has λ missing values (y , X1 are fullyobserved).

E (y) = X1β1 + X2β2

The choice in real research:

Infeasible Estimator Regress y on X1 and a fully observed X2, and use bI1,

the coefficient on X1.

Omitted Variable Estimator Omit X2 and estimate β1 by bO1 , the slope

from regressing y on X1.

Listwise Deletion Estimator Perform listwise deletion on {y ,X1,X2}, andthen estimate β1 as bL

1 , the coefficient on X1 whenregressing y on X1 and X2.

Gary King () Missing Data 7 / 42

Page 28: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

How Bad Is Listwise Deletion?

Goal: estimate β1, where X2 has λ missing values (y , X1 are fullyobserved).

E (y) = X1β1 + X2β2

The choice in real research:

Infeasible Estimator Regress y on X1 and a fully observed X2, and use bI1,

the coefficient on X1.

Omitted Variable Estimator Omit X2 and estimate β1 by bO1 , the slope

from regressing y on X1.

Listwise Deletion Estimator Perform listwise deletion on {y ,X1,X2}, andthen estimate β1 as bL

1 , the coefficient on X1 whenregressing y on X1 and X2.

Gary King () Missing Data 7 / 42

Page 29: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

How Bad Is Listwise Deletion?

Goal: estimate β1, where X2 has λ missing values (y , X1 are fullyobserved).

E (y) = X1β1 + X2β2

The choice in real research:

Infeasible Estimator Regress y on X1 and a fully observed X2, and use bI1,

the coefficient on X1.

Omitted Variable Estimator Omit X2 and estimate β1 by bO1 , the slope

from regressing y on X1.

Listwise Deletion Estimator Perform listwise deletion on {y ,X1,X2}, andthen estimate β1 as bL

1 , the coefficient on X1 whenregressing y on X1 and X2.

Gary King () Missing Data 7 / 42

Page 30: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

How Bad Is Listwise Deletion?

Goal: estimate β1, where X2 has λ missing values (y , X1 are fullyobserved).

E (y) = X1β1 + X2β2

The choice in real research:

Infeasible Estimator Regress y on X1 and a fully observed X2, and use bI1,

the coefficient on X1.

Omitted Variable Estimator Omit X2 and estimate β1 by bO1 , the slope

from regressing y on X1.

Listwise Deletion Estimator Perform listwise deletion on {y ,X1,X2}, andthen estimate β1 as bL

1 , the coefficient on X1 whenregressing y on X1 and X2.

Gary King () Missing Data 7 / 42

Page 31: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

How Bad Is Listwise Deletion?

Goal: estimate β1, where X2 has λ missing values (y , X1 are fullyobserved).

E (y) = X1β1 + X2β2

The choice in real research:

Infeasible Estimator Regress y on X1 and a fully observed X2, and use bI1,

the coefficient on X1.

Omitted Variable Estimator Omit X2 and estimate β1 by bO1 , the slope

from regressing y on X1.

Listwise Deletion Estimator Perform listwise deletion on {y ,X1,X2}, andthen estimate β1 as bL

1 , the coefficient on X1 whenregressing y on X1 and X2.

Gary King () Missing Data 7 / 42

Page 32: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

How Bad Is Listwise Deletion?

Goal: estimate β1, where X2 has λ missing values (y , X1 are fullyobserved).

E (y) = X1β1 + X2β2

The choice in real research:

Infeasible Estimator Regress y on X1 and a fully observed X2, and use bI1,

the coefficient on X1.

Omitted Variable Estimator Omit X2 and estimate β1 by bO1 , the slope

from regressing y on X1.

Listwise Deletion Estimator Perform listwise deletion on {y ,X1,X2}, andthen estimate β1 as bL

1 , the coefficient on X1 whenregressing y on X1 and X2.

Gary King () Missing Data 7 / 42

Page 33: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

In the best case scenerio for listwise deletion (MCAR),should we delete listwise or omit the variable?

Mean Square Error as a measure of the badness of an estimator a of a.

MSE(a) = E [(a− a)2]

= V (a) + [E (a− a)]2

= Variance(a) + bias(a)2

To compare, compute

MSE (bL1)−MSE(bO

1 ) =

{> 0 when omitting the variable is better

< 0 when listwise deletion is better

Gary King () Missing Data 8 / 42

Page 34: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

In the best case scenerio for listwise deletion (MCAR),should we delete listwise or omit the variable?

Mean Square Error as a measure of the badness of an estimator a of a.

MSE(a) = E [(a− a)2]

= V (a) + [E (a− a)]2

= Variance(a) + bias(a)2

To compare, compute

MSE (bL1)−MSE(bO

1 ) =

{> 0 when omitting the variable is better

< 0 when listwise deletion is better

Gary King () Missing Data 8 / 42

Page 35: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

In the best case scenerio for listwise deletion (MCAR),should we delete listwise or omit the variable?

Mean Square Error as a measure of the badness of an estimator a of a.

MSE(a) = E [(a− a)2]

= V (a) + [E (a− a)]2

= Variance(a) + bias(a)2

To compare, compute

MSE (bL1)−MSE(bO

1 ) =

{> 0 when omitting the variable is better

< 0 when listwise deletion is better

Gary King () Missing Data 8 / 42

Page 36: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

In the best case scenerio for listwise deletion (MCAR),should we delete listwise or omit the variable?

Mean Square Error as a measure of the badness of an estimator a of a.

MSE(a) = E [(a− a)2]

= V (a) + [E (a− a)]2

= Variance(a) + bias(a)2

To compare, compute

MSE (bL1)−MSE(bO

1 ) =

{> 0 when omitting the variable is better

< 0 when listwise deletion is better

Gary King () Missing Data 8 / 42

Page 37: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

In the best case scenerio for listwise deletion (MCAR),should we delete listwise or omit the variable?

Mean Square Error as a measure of the badness of an estimator a of a.

MSE(a) = E [(a− a)2]

= V (a) + [E (a− a)]2

= Variance(a) + bias(a)2

To compare, compute

MSE (bL1)−MSE(bO

1 ) =

{> 0 when omitting the variable is better

< 0 when listwise deletion is better

Gary King () Missing Data 8 / 42

Page 38: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

In the best case scenerio for listwise deletion (MCAR),should we delete listwise or omit the variable?

Mean Square Error as a measure of the badness of an estimator a of a.

MSE(a) = E [(a− a)2]

= V (a) + [E (a− a)]2

= Variance(a) + bias(a)2

To compare, compute

MSE (bL1)−MSE(bO

1 ) =

{> 0 when omitting the variable is better

< 0 when listwise deletion is better

Gary King () Missing Data 8 / 42

Page 39: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

In the best case scenerio for listwise deletion (MCAR),should we delete listwise or omit the variable?

Mean Square Error as a measure of the badness of an estimator a of a.

MSE(a) = E [(a− a)2]

= V (a) + [E (a− a)]2

= Variance(a) + bias(a)2

To compare, compute

MSE (bL1)−MSE(bO

1 ) =

{> 0 when omitting the variable is better

< 0 when listwise deletion is better

Gary King () Missing Data 8 / 42

Page 40: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Derivation and Implications

MSE(bL1)−MSE(bO

1 ) =

1− λV(bI

1)

)+ F [V(bI

2)− β2β′2]F

= (Missingness part) + (Observed part)

1. Missingness part (> 0) is an extra tilt away from listwise deletion

2. Observed part is the standard bias-efficiency tradeoff of omittingvariables, even without missingness

3. How big is λ usually?

λ ≈ 1/3 on average in real political science articles> 1/2 at a recent SPM ConferenceLarger for authors who work harder to avoid omitted variable bias

Gary King () Missing Data 9 / 42

Page 41: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Derivation and Implications

MSE(bL1)−MSE(bO

1 )

=

1− λV(bI

1)

)+ F [V(bI

2)− β2β′2]F

= (Missingness part) + (Observed part)

1. Missingness part (> 0) is an extra tilt away from listwise deletion

2. Observed part is the standard bias-efficiency tradeoff of omittingvariables, even without missingness

3. How big is λ usually?

λ ≈ 1/3 on average in real political science articles> 1/2 at a recent SPM ConferenceLarger for authors who work harder to avoid omitted variable bias

Gary King () Missing Data 9 / 42

Page 42: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Derivation and Implications

MSE(bL1)−MSE(bO

1 ) =

1− λV(bI

1)

)+ F [V(bI

2)− β2β′2]F

= (Missingness part) + (Observed part)

1. Missingness part (> 0) is an extra tilt away from listwise deletion

2. Observed part is the standard bias-efficiency tradeoff of omittingvariables, even without missingness

3. How big is λ usually?

λ ≈ 1/3 on average in real political science articles> 1/2 at a recent SPM ConferenceLarger for authors who work harder to avoid omitted variable bias

Gary King () Missing Data 9 / 42

Page 43: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Derivation and Implications

MSE(bL1)−MSE(bO

1 ) =

1− λV(bI

1)

)+ F [V(bI

2)− β2β′2]F

= (Missingness part) + (Observed part)

1. Missingness part (> 0) is an extra tilt away from listwise deletion

2. Observed part is the standard bias-efficiency tradeoff of omittingvariables, even without missingness

3. How big is λ usually?

λ ≈ 1/3 on average in real political science articles> 1/2 at a recent SPM ConferenceLarger for authors who work harder to avoid omitted variable bias

Gary King () Missing Data 9 / 42

Page 44: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Derivation and Implications

MSE(bL1)−MSE(bO

1 ) =

1− λV(bI

1)

)+ F [V(bI

2)− β2β′2]F

= (Missingness part) + (Observed part)

1. Missingness part (> 0) is an extra tilt away from listwise deletion

2. Observed part is the standard bias-efficiency tradeoff of omittingvariables, even without missingness

3. How big is λ usually?

λ ≈ 1/3 on average in real political science articles> 1/2 at a recent SPM ConferenceLarger for authors who work harder to avoid omitted variable bias

Gary King () Missing Data 9 / 42

Page 45: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Derivation and Implications

MSE(bL1)−MSE(bO

1 ) =

1− λV(bI

1)

)+ F [V(bI

2)− β2β′2]F

= (Missingness part) + (Observed part)

1. Missingness part (> 0) is an extra tilt away from listwise deletion

2. Observed part is the standard bias-efficiency tradeoff of omittingvariables, even without missingness

3. How big is λ usually?

λ ≈ 1/3 on average in real political science articles> 1/2 at a recent SPM ConferenceLarger for authors who work harder to avoid omitted variable bias

Gary King () Missing Data 9 / 42

Page 46: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Derivation and Implications

MSE(bL1)−MSE(bO

1 ) =

1− λV(bI

1)

)+ F [V(bI

2)− β2β′2]F

= (Missingness part) + (Observed part)

1. Missingness part (> 0) is an extra tilt away from listwise deletion

2. Observed part is the standard bias-efficiency tradeoff of omittingvariables, even without missingness

3. How big is λ usually?

λ ≈ 1/3 on average in real political science articles> 1/2 at a recent SPM ConferenceLarger for authors who work harder to avoid omitted variable bias

Gary King () Missing Data 9 / 42

Page 47: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Derivation and Implications

MSE(bL1)−MSE(bO

1 ) =

1− λV(bI

1)

)+ F [V(bI

2)− β2β′2]F

= (Missingness part) + (Observed part)

1. Missingness part (> 0) is an extra tilt away from listwise deletion

2. Observed part is the standard bias-efficiency tradeoff of omittingvariables, even without missingness

3. How big is λ usually?

λ ≈ 1/3 on average in real political science articles

> 1/2 at a recent SPM ConferenceLarger for authors who work harder to avoid omitted variable bias

Gary King () Missing Data 9 / 42

Page 48: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Derivation and Implications

MSE(bL1)−MSE(bO

1 ) =

1− λV(bI

1)

)+ F [V(bI

2)− β2β′2]F

= (Missingness part) + (Observed part)

1. Missingness part (> 0) is an extra tilt away from listwise deletion

2. Observed part is the standard bias-efficiency tradeoff of omittingvariables, even without missingness

3. How big is λ usually?

λ ≈ 1/3 on average in real political science articles> 1/2 at a recent SPM Conference

Larger for authors who work harder to avoid omitted variable bias

Gary King () Missing Data 9 / 42

Page 49: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Derivation and Implications

MSE(bL1)−MSE(bO

1 ) =

1− λV(bI

1)

)+ F [V(bI

2)− β2β′2]F

= (Missingness part) + (Observed part)

1. Missingness part (> 0) is an extra tilt away from listwise deletion

2. Observed part is the standard bias-efficiency tradeoff of omittingvariables, even without missingness

3. How big is λ usually?

λ ≈ 1/3 on average in real political science articles> 1/2 at a recent SPM ConferenceLarger for authors who work harder to avoid omitted variable bias

Gary King () Missing Data 9 / 42

Page 50: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Derivation and Implications

4. If λ ≈ 0.5, the contribution of the missingness (tilting away fromchoosing listwise deletion over omitting variables) is

RMSE difference =

√λ

1− λV (bI

1) =

√0.5

1− 0.5SE(bI

1) = SE(bI1)

(The sqrt of only one piece, for simplicity, not the difference.)

5. Result: The point estimate in the average political science article isabout an additional standard error farther away from the truth becauseof listwise deletion (as compared to omitting X2 entirely).

6. Conclusion: Listwise deletion is often as bad a problem as the muchbetter known omitted variable bias — in the best case scenerio (MCAR)

Gary King () Missing Data 10 / 42

Page 51: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Derivation and Implications

4. If λ ≈ 0.5, the contribution of the missingness (tilting away fromchoosing listwise deletion over omitting variables) is

RMSE difference =

√λ

1− λV (bI

1) =

√0.5

1− 0.5SE(bI

1) = SE(bI1)

(The sqrt of only one piece, for simplicity, not the difference.)

5. Result: The point estimate in the average political science article isabout an additional standard error farther away from the truth becauseof listwise deletion (as compared to omitting X2 entirely).

6. Conclusion: Listwise deletion is often as bad a problem as the muchbetter known omitted variable bias — in the best case scenerio (MCAR)

Gary King () Missing Data 10 / 42

Page 52: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Derivation and Implications

4. If λ ≈ 0.5, the contribution of the missingness (tilting away fromchoosing listwise deletion over omitting variables) is

RMSE difference =

√λ

1− λV (bI

1) =

√0.5

1− 0.5SE(bI

1) = SE(bI1)

(The sqrt of only one piece, for simplicity, not the difference.)

5. Result: The point estimate in the average political science article isabout an additional standard error farther away from the truth becauseof listwise deletion (as compared to omitting X2 entirely).

6. Conclusion: Listwise deletion is often as bad a problem as the muchbetter known omitted variable bias — in the best case scenerio (MCAR)

Gary King () Missing Data 10 / 42

Page 53: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Derivation and Implications

4. If λ ≈ 0.5, the contribution of the missingness (tilting away fromchoosing listwise deletion over omitting variables) is

RMSE difference =

√λ

1− λV (bI

1) =

√0.5

1− 0.5SE(bI

1) = SE(bI1)

(The sqrt of only one piece, for simplicity, not the difference.)

5. Result: The point estimate in the average political science article isabout an additional standard error farther away from the truth becauseof listwise deletion (as compared to omitting X2 entirely).

6. Conclusion: Listwise deletion is often as bad a problem as the muchbetter known omitted variable bias — in the best case scenerio (MCAR)

Gary King () Missing Data 10 / 42

Page 54: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Derivation and Implications

4. If λ ≈ 0.5, the contribution of the missingness (tilting away fromchoosing listwise deletion over omitting variables) is

RMSE difference =

√λ

1− λV (bI

1) =

√0.5

1− 0.5SE(bI

1) = SE(bI1)

(The sqrt of only one piece, for simplicity, not the difference.)

5. Result: The point estimate in the average political science article isabout an additional standard error farther away from the truth becauseof listwise deletion (as compared to omitting X2 entirely).

6. Conclusion: Listwise deletion is often as bad a problem as the muchbetter known omitted variable bias — in the best case scenerio (MCAR)

Gary King () Missing Data 10 / 42

Page 55: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Existing General Purpose Missing Data Methods

Fill in or delete the missing data, and then act as if there were no missingdata. None work in general under MAR.

1. Listwise deletion (RMSE is 1 SE off if MCAR holds; biased under MAR)

2. Best guess imputation (depends on guesser!)

3. Imputing a zero and then adding an additional dummy variable tocontrol for the imputed value (biased)

4. Pairwise deletion (assumes MCAR)

5. Hot deck imputation, (inefficient, standard errors wrong)

6. Mean substitution (attenuates estimated relationships)

Gary King () Missing Data 11 / 42

Page 56: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Existing General Purpose Missing Data Methods

Fill in or delete the missing data, and then act as if there were no missingdata. None work in general under MAR.

1. Listwise deletion (RMSE is 1 SE off if MCAR holds; biased under MAR)

2. Best guess imputation (depends on guesser!)

3. Imputing a zero and then adding an additional dummy variable tocontrol for the imputed value (biased)

4. Pairwise deletion (assumes MCAR)

5. Hot deck imputation, (inefficient, standard errors wrong)

6. Mean substitution (attenuates estimated relationships)

Gary King () Missing Data 11 / 42

Page 57: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Existing General Purpose Missing Data Methods

Fill in or delete the missing data, and then act as if there were no missingdata. None work in general under MAR.

1. Listwise deletion (RMSE is 1 SE off if MCAR holds; biased under MAR)

2. Best guess imputation (depends on guesser!)

3. Imputing a zero and then adding an additional dummy variable tocontrol for the imputed value (biased)

4. Pairwise deletion (assumes MCAR)

5. Hot deck imputation, (inefficient, standard errors wrong)

6. Mean substitution (attenuates estimated relationships)

Gary King () Missing Data 11 / 42

Page 58: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Existing General Purpose Missing Data Methods

Fill in or delete the missing data, and then act as if there were no missingdata. None work in general under MAR.

1. Listwise deletion (RMSE is 1 SE off if MCAR holds; biased under MAR)

2. Best guess imputation (depends on guesser!)

3. Imputing a zero and then adding an additional dummy variable tocontrol for the imputed value (biased)

4. Pairwise deletion (assumes MCAR)

5. Hot deck imputation, (inefficient, standard errors wrong)

6. Mean substitution (attenuates estimated relationships)

Gary King () Missing Data 11 / 42

Page 59: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Existing General Purpose Missing Data Methods

Fill in or delete the missing data, and then act as if there were no missingdata. None work in general under MAR.

1. Listwise deletion (RMSE is 1 SE off if MCAR holds; biased under MAR)

2. Best guess imputation (depends on guesser!)

3. Imputing a zero and then adding an additional dummy variable tocontrol for the imputed value (biased)

4. Pairwise deletion (assumes MCAR)

5. Hot deck imputation, (inefficient, standard errors wrong)

6. Mean substitution (attenuates estimated relationships)

Gary King () Missing Data 11 / 42

Page 60: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Existing General Purpose Missing Data Methods

Fill in or delete the missing data, and then act as if there were no missingdata. None work in general under MAR.

1. Listwise deletion (RMSE is 1 SE off if MCAR holds; biased under MAR)

2. Best guess imputation (depends on guesser!)

3. Imputing a zero and then adding an additional dummy variable tocontrol for the imputed value (biased)

4. Pairwise deletion (assumes MCAR)

5. Hot deck imputation, (inefficient, standard errors wrong)

6. Mean substitution (attenuates estimated relationships)

Gary King () Missing Data 11 / 42

Page 61: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Existing General Purpose Missing Data Methods

Fill in or delete the missing data, and then act as if there were no missingdata. None work in general under MAR.

1. Listwise deletion (RMSE is 1 SE off if MCAR holds; biased under MAR)

2. Best guess imputation (depends on guesser!)

3. Imputing a zero and then adding an additional dummy variable tocontrol for the imputed value (biased)

4. Pairwise deletion (assumes MCAR)

5. Hot deck imputation, (inefficient, standard errors wrong)

6. Mean substitution (attenuates estimated relationships)

Gary King () Missing Data 11 / 42

Page 62: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Existing General Purpose Missing Data Methods

Fill in or delete the missing data, and then act as if there were no missingdata. None work in general under MAR.

1. Listwise deletion (RMSE is 1 SE off if MCAR holds; biased under MAR)

2. Best guess imputation (depends on guesser!)

3. Imputing a zero and then adding an additional dummy variable tocontrol for the imputed value (biased)

4. Pairwise deletion (assumes MCAR)

5. Hot deck imputation, (inefficient, standard errors wrong)

6. Mean substitution (attenuates estimated relationships)

Gary King () Missing Data 11 / 42

Page 63: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

7. y-hat regression imputation, (optimistic: scatter when observed,perfectly linear when unobserved; SEs too small)

8. y-hat + ε regression imputation (assumes no estimation uncertainty,does not help for scattered missingness)

Gary King () Missing Data 12 / 42

Page 64: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

7. y-hat regression imputation, (optimistic: scatter when observed,perfectly linear when unobserved; SEs too small)

8. y-hat + ε regression imputation (assumes no estimation uncertainty,does not help for scattered missingness)

Gary King () Missing Data 12 / 42

Page 65: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

7. y-hat regression imputation, (optimistic: scatter when observed,perfectly linear when unobserved; SEs too small)

8. y-hat + ε regression imputation (assumes no estimation uncertainty,does not help for scattered missingness)

Gary King () Missing Data 12 / 42

Page 66: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Application-specific Methods for Missing Data

1. Base inferences on the likelihood function or posterior distribution, byconditioning on observed data only, P(θ|Yobs).

2. E.g., models of censoring, truncation, etc.

3. Optimal theoretically, if specification is correct

4. Not robust (i.e., sensitive to distributional assumptions) if model isincorrect

5. Often difficult practically

6. Does not handle missingness scattered through X and Y

Gary King () Missing Data 13 / 42

Page 67: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Application-specific Methods for Missing Data

1. Base inferences on the likelihood function or posterior distribution, byconditioning on observed data only, P(θ|Yobs).

2. E.g., models of censoring, truncation, etc.

3. Optimal theoretically, if specification is correct

4. Not robust (i.e., sensitive to distributional assumptions) if model isincorrect

5. Often difficult practically

6. Does not handle missingness scattered through X and Y

Gary King () Missing Data 13 / 42

Page 68: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Application-specific Methods for Missing Data

1. Base inferences on the likelihood function or posterior distribution, byconditioning on observed data only, P(θ|Yobs).

2. E.g., models of censoring, truncation, etc.

3. Optimal theoretically, if specification is correct

4. Not robust (i.e., sensitive to distributional assumptions) if model isincorrect

5. Often difficult practically

6. Does not handle missingness scattered through X and Y

Gary King () Missing Data 13 / 42

Page 69: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Application-specific Methods for Missing Data

1. Base inferences on the likelihood function or posterior distribution, byconditioning on observed data only, P(θ|Yobs).

2. E.g., models of censoring, truncation, etc.

3. Optimal theoretically, if specification is correct

4. Not robust (i.e., sensitive to distributional assumptions) if model isincorrect

5. Often difficult practically

6. Does not handle missingness scattered through X and Y

Gary King () Missing Data 13 / 42

Page 70: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Application-specific Methods for Missing Data

1. Base inferences on the likelihood function or posterior distribution, byconditioning on observed data only, P(θ|Yobs).

2. E.g., models of censoring, truncation, etc.

3. Optimal theoretically, if specification is correct

4. Not robust (i.e., sensitive to distributional assumptions) if model isincorrect

5. Often difficult practically

6. Does not handle missingness scattered through X and Y

Gary King () Missing Data 13 / 42

Page 71: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Application-specific Methods for Missing Data

1. Base inferences on the likelihood function or posterior distribution, byconditioning on observed data only, P(θ|Yobs).

2. E.g., models of censoring, truncation, etc.

3. Optimal theoretically, if specification is correct

4. Not robust (i.e., sensitive to distributional assumptions) if model isincorrect

5. Often difficult practically

6. Does not handle missingness scattered through X and Y

Gary King () Missing Data 13 / 42

Page 72: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Application-specific Methods for Missing Data

1. Base inferences on the likelihood function or posterior distribution, byconditioning on observed data only, P(θ|Yobs).

2. E.g., models of censoring, truncation, etc.

3. Optimal theoretically, if specification is correct

4. Not robust (i.e., sensitive to distributional assumptions) if model isincorrect

5. Often difficult practically

6. Does not handle missingness scattered through X and Y

Gary King () Missing Data 13 / 42

Page 73: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

How to create application-specific methods?

1. We observe M always. Suppose we also see all the contents of D.

2. Then the likelihood is

P(D,M|θ, γ) = P(D|θ)P(M|D, γ),

the likelihood if D were observed, and the model for missingness.

If D and M are observed, when can we drop P(M|D, γ)?Stochastic and parametric independence

3. Suppose now D is observed (as usual) only when M is 1.

Gary King () Missing Data 14 / 42

Page 74: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

How to create application-specific methods?

1. We observe M always. Suppose we also see all the contents of D.

2. Then the likelihood is

P(D,M|θ, γ) = P(D|θ)P(M|D, γ),

the likelihood if D were observed, and the model for missingness.

If D and M are observed, when can we drop P(M|D, γ)?Stochastic and parametric independence

3. Suppose now D is observed (as usual) only when M is 1.

Gary King () Missing Data 14 / 42

Page 75: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

How to create application-specific methods?

1. We observe M always. Suppose we also see all the contents of D.

2. Then the likelihood is

P(D,M|θ, γ) = P(D|θ)P(M|D, γ),

the likelihood if D were observed, and the model for missingness.

If D and M are observed, when can we drop P(M|D, γ)?Stochastic and parametric independence

3. Suppose now D is observed (as usual) only when M is 1.

Gary King () Missing Data 14 / 42

Page 76: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

How to create application-specific methods?

1. We observe M always. Suppose we also see all the contents of D.

2. Then the likelihood is

P(D,M|θ, γ) = P(D|θ)P(M|D, γ),

the likelihood if D were observed, and the model for missingness.

If D and M are observed, when can we drop P(M|D, γ)?Stochastic and parametric independence

3. Suppose now D is observed (as usual) only when M is 1.

Gary King () Missing Data 14 / 42

Page 77: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

How to create application-specific methods?

1. We observe M always. Suppose we also see all the contents of D.

2. Then the likelihood is

P(D,M|θ, γ) = P(D|θ)P(M|D, γ),

the likelihood if D were observed, and the model for missingness.

If D and M are observed, when can we drop P(M|D, γ)?Stochastic and parametric independence

3. Suppose now D is observed (as usual) only when M is 1.

Gary King () Missing Data 14 / 42

Page 78: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

How to create application-specific methods?

1. We observe M always. Suppose we also see all the contents of D.

2. Then the likelihood is

P(D,M|θ, γ) = P(D|θ)P(M|D, γ),

the likelihood if D were observed, and the model for missingness.

If D and M are observed, when can we drop P(M|D, γ)?

Stochastic and parametric independence

3. Suppose now D is observed (as usual) only when M is 1.

Gary King () Missing Data 14 / 42

Page 79: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

How to create application-specific methods?

1. We observe M always. Suppose we also see all the contents of D.

2. Then the likelihood is

P(D,M|θ, γ) = P(D|θ)P(M|D, γ),

the likelihood if D were observed, and the model for missingness.

If D and M are observed, when can we drop P(M|D, γ)?Stochastic and parametric independence

3. Suppose now D is observed (as usual) only when M is 1.

Gary King () Missing Data 14 / 42

Page 80: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

How to create application-specific methods?

1. We observe M always. Suppose we also see all the contents of D.

2. Then the likelihood is

P(D,M|θ, γ) = P(D|θ)P(M|D, γ),

the likelihood if D were observed, and the model for missingness.

If D and M are observed, when can we drop P(M|D, γ)?Stochastic and parametric independence

3. Suppose now D is observed (as usual) only when M is 1.

Gary King () Missing Data 14 / 42

Page 81: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

4. Then the likelihood integrates out the missing observations

P(Dobs ,M|θ, γ) =

∫P(D|θ)P(M|D, γ)dDmis

and if assume MAR (D and M are stochastically and parametricallyindependent), then

= P(Dobs |θ)P(M|Dobs , γ),

∝ P(Dobs |θ)

because P(M|Dobs , γ) is constant w.r.t. θ

5. Without the MAR assumption, the missingness model can’t bedropped; its an NI model.

6. Specifying the missingness mechanism is hard. Little theory is available

7. NI models (Heckman, many others) haven’t always done well whentruth is known

Gary King () Missing Data 15 / 42

Page 82: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

4. Then the likelihood integrates out the missing observations

P(Dobs ,M|θ, γ)

=

∫P(D|θ)P(M|D, γ)dDmis

and if assume MAR (D and M are stochastically and parametricallyindependent), then

= P(Dobs |θ)P(M|Dobs , γ),

∝ P(Dobs |θ)

because P(M|Dobs , γ) is constant w.r.t. θ

5. Without the MAR assumption, the missingness model can’t bedropped; its an NI model.

6. Specifying the missingness mechanism is hard. Little theory is available

7. NI models (Heckman, many others) haven’t always done well whentruth is known

Gary King () Missing Data 15 / 42

Page 83: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

4. Then the likelihood integrates out the missing observations

P(Dobs ,M|θ, γ) =

∫P(D|θ)P(M|D, γ)dDmis

and if assume MAR (D and M are stochastically and parametricallyindependent), then

= P(Dobs |θ)P(M|Dobs , γ),

∝ P(Dobs |θ)

because P(M|Dobs , γ) is constant w.r.t. θ

5. Without the MAR assumption, the missingness model can’t bedropped; its an NI model.

6. Specifying the missingness mechanism is hard. Little theory is available

7. NI models (Heckman, many others) haven’t always done well whentruth is known

Gary King () Missing Data 15 / 42

Page 84: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

4. Then the likelihood integrates out the missing observations

P(Dobs ,M|θ, γ) =

∫P(D|θ)P(M|D, γ)dDmis

and if assume MAR (D and M are stochastically and parametricallyindependent), then

= P(Dobs |θ)P(M|Dobs , γ),

∝ P(Dobs |θ)

because P(M|Dobs , γ) is constant w.r.t. θ

5. Without the MAR assumption, the missingness model can’t bedropped; its an NI model.

6. Specifying the missingness mechanism is hard. Little theory is available

7. NI models (Heckman, many others) haven’t always done well whentruth is known

Gary King () Missing Data 15 / 42

Page 85: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

4. Then the likelihood integrates out the missing observations

P(Dobs ,M|θ, γ) =

∫P(D|θ)P(M|D, γ)dDmis

and if assume MAR (D and M are stochastically and parametricallyindependent), then

= P(Dobs |θ)P(M|Dobs , γ),

∝ P(Dobs |θ)

because P(M|Dobs , γ) is constant w.r.t. θ

5. Without the MAR assumption, the missingness model can’t bedropped; its an NI model.

6. Specifying the missingness mechanism is hard. Little theory is available

7. NI models (Heckman, many others) haven’t always done well whentruth is known

Gary King () Missing Data 15 / 42

Page 86: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

4. Then the likelihood integrates out the missing observations

P(Dobs ,M|θ, γ) =

∫P(D|θ)P(M|D, γ)dDmis

and if assume MAR (D and M are stochastically and parametricallyindependent), then

= P(Dobs |θ)P(M|Dobs , γ),

∝ P(Dobs |θ)

because P(M|Dobs , γ) is constant w.r.t. θ

5. Without the MAR assumption, the missingness model can’t bedropped; its an NI model.

6. Specifying the missingness mechanism is hard. Little theory is available

7. NI models (Heckman, many others) haven’t always done well whentruth is known

Gary King () Missing Data 15 / 42

Page 87: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

4. Then the likelihood integrates out the missing observations

P(Dobs ,M|θ, γ) =

∫P(D|θ)P(M|D, γ)dDmis

and if assume MAR (D and M are stochastically and parametricallyindependent), then

= P(Dobs |θ)P(M|Dobs , γ),

∝ P(Dobs |θ)

because P(M|Dobs , γ) is constant w.r.t. θ

5. Without the MAR assumption, the missingness model can’t bedropped; its an NI model.

6. Specifying the missingness mechanism is hard. Little theory is available

7. NI models (Heckman, many others) haven’t always done well whentruth is known

Gary King () Missing Data 15 / 42

Page 88: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

4. Then the likelihood integrates out the missing observations

P(Dobs ,M|θ, γ) =

∫P(D|θ)P(M|D, γ)dDmis

and if assume MAR (D and M are stochastically and parametricallyindependent), then

= P(Dobs |θ)P(M|Dobs , γ),

∝ P(Dobs |θ)

because P(M|Dobs , γ) is constant w.r.t. θ

5. Without the MAR assumption, the missingness model can’t bedropped; its an NI model.

6. Specifying the missingness mechanism is hard. Little theory is available

7. NI models (Heckman, many others) haven’t always done well whentruth is known

Gary King () Missing Data 15 / 42

Page 89: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

4. Then the likelihood integrates out the missing observations

P(Dobs ,M|θ, γ) =

∫P(D|θ)P(M|D, γ)dDmis

and if assume MAR (D and M are stochastically and parametricallyindependent), then

= P(Dobs |θ)P(M|Dobs , γ),

∝ P(Dobs |θ)

because P(M|Dobs , γ) is constant w.r.t. θ

5. Without the MAR assumption, the missingness model can’t bedropped; its an NI model.

6. Specifying the missingness mechanism is hard. Little theory is available

7. NI models (Heckman, many others) haven’t always done well whentruth is known

Gary King () Missing Data 15 / 42

Page 90: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

4. Then the likelihood integrates out the missing observations

P(Dobs ,M|θ, γ) =

∫P(D|θ)P(M|D, γ)dDmis

and if assume MAR (D and M are stochastically and parametricallyindependent), then

= P(Dobs |θ)P(M|Dobs , γ),

∝ P(Dobs |θ)

because P(M|Dobs , γ) is constant w.r.t. θ

5. Without the MAR assumption, the missingness model can’t bedropped; its an NI model.

6. Specifying the missingness mechanism is hard. Little theory is available

7. NI models (Heckman, many others) haven’t always done well whentruth is known

Gary King () Missing Data 15 / 42

Page 91: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Multiple Imputation

Point estimates are consistent, efficient, and the standard errors are right.To compute:

1. Impute m values for each missing element

(a) Imputation method assumes MAR(b) Uses a model with stochastic and systematic components(c) Produces independent imputations(d) (We’ll give you a model to impute later)

2. Create m completed data sets

(a) Observed data are the same across the data sets(b) Imputations of missing data differ

i. Cells we can predict well don’t differ muchii. Cells we can’t predict well differ a lot

3. Run whatever statistical method you would have with no missing datafor each completed data set

Gary King () Missing Data 16 / 42

Page 92: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Multiple Imputation

Point estimates are consistent, efficient, and the standard errors are right.To compute:

1. Impute m values for each missing element

(a) Imputation method assumes MAR(b) Uses a model with stochastic and systematic components(c) Produces independent imputations(d) (We’ll give you a model to impute later)

2. Create m completed data sets

(a) Observed data are the same across the data sets(b) Imputations of missing data differ

i. Cells we can predict well don’t differ muchii. Cells we can’t predict well differ a lot

3. Run whatever statistical method you would have with no missing datafor each completed data set

Gary King () Missing Data 16 / 42

Page 93: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Multiple Imputation

Point estimates are consistent, efficient, and the standard errors are right.To compute:

1. Impute m values for each missing element

(a) Imputation method assumes MAR(b) Uses a model with stochastic and systematic components(c) Produces independent imputations(d) (We’ll give you a model to impute later)

2. Create m completed data sets

(a) Observed data are the same across the data sets(b) Imputations of missing data differ

i. Cells we can predict well don’t differ muchii. Cells we can’t predict well differ a lot

3. Run whatever statistical method you would have with no missing datafor each completed data set

Gary King () Missing Data 16 / 42

Page 94: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Multiple Imputation

Point estimates are consistent, efficient, and the standard errors are right.To compute:

1. Impute m values for each missing element

(a) Imputation method assumes MAR

(b) Uses a model with stochastic and systematic components(c) Produces independent imputations(d) (We’ll give you a model to impute later)

2. Create m completed data sets

(a) Observed data are the same across the data sets(b) Imputations of missing data differ

i. Cells we can predict well don’t differ muchii. Cells we can’t predict well differ a lot

3. Run whatever statistical method you would have with no missing datafor each completed data set

Gary King () Missing Data 16 / 42

Page 95: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Multiple Imputation

Point estimates are consistent, efficient, and the standard errors are right.To compute:

1. Impute m values for each missing element

(a) Imputation method assumes MAR(b) Uses a model with stochastic and systematic components

(c) Produces independent imputations(d) (We’ll give you a model to impute later)

2. Create m completed data sets

(a) Observed data are the same across the data sets(b) Imputations of missing data differ

i. Cells we can predict well don’t differ muchii. Cells we can’t predict well differ a lot

3. Run whatever statistical method you would have with no missing datafor each completed data set

Gary King () Missing Data 16 / 42

Page 96: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Multiple Imputation

Point estimates are consistent, efficient, and the standard errors are right.To compute:

1. Impute m values for each missing element

(a) Imputation method assumes MAR(b) Uses a model with stochastic and systematic components(c) Produces independent imputations

(d) (We’ll give you a model to impute later)

2. Create m completed data sets

(a) Observed data are the same across the data sets(b) Imputations of missing data differ

i. Cells we can predict well don’t differ muchii. Cells we can’t predict well differ a lot

3. Run whatever statistical method you would have with no missing datafor each completed data set

Gary King () Missing Data 16 / 42

Page 97: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Multiple Imputation

Point estimates are consistent, efficient, and the standard errors are right.To compute:

1. Impute m values for each missing element

(a) Imputation method assumes MAR(b) Uses a model with stochastic and systematic components(c) Produces independent imputations(d) (We’ll give you a model to impute later)

2. Create m completed data sets

(a) Observed data are the same across the data sets(b) Imputations of missing data differ

i. Cells we can predict well don’t differ muchii. Cells we can’t predict well differ a lot

3. Run whatever statistical method you would have with no missing datafor each completed data set

Gary King () Missing Data 16 / 42

Page 98: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Multiple Imputation

Point estimates are consistent, efficient, and the standard errors are right.To compute:

1. Impute m values for each missing element

(a) Imputation method assumes MAR(b) Uses a model with stochastic and systematic components(c) Produces independent imputations(d) (We’ll give you a model to impute later)

2. Create m completed data sets

(a) Observed data are the same across the data sets(b) Imputations of missing data differ

i. Cells we can predict well don’t differ muchii. Cells we can’t predict well differ a lot

3. Run whatever statistical method you would have with no missing datafor each completed data set

Gary King () Missing Data 16 / 42

Page 99: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Multiple Imputation

Point estimates are consistent, efficient, and the standard errors are right.To compute:

1. Impute m values for each missing element

(a) Imputation method assumes MAR(b) Uses a model with stochastic and systematic components(c) Produces independent imputations(d) (We’ll give you a model to impute later)

2. Create m completed data sets

(a) Observed data are the same across the data sets

(b) Imputations of missing data differ

i. Cells we can predict well don’t differ muchii. Cells we can’t predict well differ a lot

3. Run whatever statistical method you would have with no missing datafor each completed data set

Gary King () Missing Data 16 / 42

Page 100: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Multiple Imputation

Point estimates are consistent, efficient, and the standard errors are right.To compute:

1. Impute m values for each missing element

(a) Imputation method assumes MAR(b) Uses a model with stochastic and systematic components(c) Produces independent imputations(d) (We’ll give you a model to impute later)

2. Create m completed data sets

(a) Observed data are the same across the data sets(b) Imputations of missing data differ

i. Cells we can predict well don’t differ muchii. Cells we can’t predict well differ a lot

3. Run whatever statistical method you would have with no missing datafor each completed data set

Gary King () Missing Data 16 / 42

Page 101: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Multiple Imputation

Point estimates are consistent, efficient, and the standard errors are right.To compute:

1. Impute m values for each missing element

(a) Imputation method assumes MAR(b) Uses a model with stochastic and systematic components(c) Produces independent imputations(d) (We’ll give you a model to impute later)

2. Create m completed data sets

(a) Observed data are the same across the data sets(b) Imputations of missing data differ

i. Cells we can predict well don’t differ much

ii. Cells we can’t predict well differ a lot

3. Run whatever statistical method you would have with no missing datafor each completed data set

Gary King () Missing Data 16 / 42

Page 102: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Multiple Imputation

Point estimates are consistent, efficient, and the standard errors are right.To compute:

1. Impute m values for each missing element

(a) Imputation method assumes MAR(b) Uses a model with stochastic and systematic components(c) Produces independent imputations(d) (We’ll give you a model to impute later)

2. Create m completed data sets

(a) Observed data are the same across the data sets(b) Imputations of missing data differ

i. Cells we can predict well don’t differ muchii. Cells we can’t predict well differ a lot

3. Run whatever statistical method you would have with no missing datafor each completed data set

Gary King () Missing Data 16 / 42

Page 103: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Multiple Imputation

Point estimates are consistent, efficient, and the standard errors are right.To compute:

1. Impute m values for each missing element

(a) Imputation method assumes MAR(b) Uses a model with stochastic and systematic components(c) Produces independent imputations(d) (We’ll give you a model to impute later)

2. Create m completed data sets

(a) Observed data are the same across the data sets(b) Imputations of missing data differ

i. Cells we can predict well don’t differ muchii. Cells we can’t predict well differ a lot

3. Run whatever statistical method you would have with no missing datafor each completed data set

Gary King () Missing Data 16 / 42

Page 104: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

4. Overall Point estimate: average individual point estimates, qj

(j = 1, . . . ,m):

q =1

m

m∑j=1

qj

Standard error: use this equation:

SE(q)2 = mean(SE2j ) + variance(qj) (1 + 1/m)

= within + between

Last piece vanishes as m increases

5. Easier by simulation: draw 1/m sims from each data set of the QOI,combine (i.e., concatenate into a larger set of simulations), and makeinferences as usual.

Gary King () Missing Data 17 / 42

Page 105: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

4. Overall Point estimate: average individual point estimates, qj

(j = 1, . . . ,m):

q =1

m

m∑j=1

qj

Standard error: use this equation:

SE(q)2 = mean(SE2j ) + variance(qj) (1 + 1/m)

= within + between

Last piece vanishes as m increases

5. Easier by simulation: draw 1/m sims from each data set of the QOI,combine (i.e., concatenate into a larger set of simulations), and makeinferences as usual.

Gary King () Missing Data 17 / 42

Page 106: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

4. Overall Point estimate: average individual point estimates, qj

(j = 1, . . . ,m):

q =1

m

m∑j=1

qj

Standard error: use this equation:

SE(q)2 = mean(SE2j ) + variance(qj) (1 + 1/m)

= within + between

Last piece vanishes as m increases

5. Easier by simulation: draw 1/m sims from each data set of the QOI,combine (i.e., concatenate into a larger set of simulations), and makeinferences as usual.

Gary King () Missing Data 17 / 42

Page 107: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

4. Overall Point estimate: average individual point estimates, qj

(j = 1, . . . ,m):

q =1

m

m∑j=1

qj

Standard error: use this equation:

SE(q)2 = mean(SE2j ) + variance(qj) (1 + 1/m)

= within + between

Last piece vanishes as m increases

5. Easier by simulation: draw 1/m sims from each data set of the QOI,combine (i.e., concatenate into a larger set of simulations), and makeinferences as usual.

Gary King () Missing Data 17 / 42

Page 108: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

4. Overall Point estimate: average individual point estimates, qj

(j = 1, . . . ,m):

q =1

m

m∑j=1

qj

Standard error: use this equation:

SE(q)2 = mean(SE2j ) + variance(qj) (1 + 1/m)

= within + between

Last piece vanishes as m increases

5. Easier by simulation: draw 1/m sims from each data set of the QOI,combine (i.e., concatenate into a larger set of simulations), and makeinferences as usual.

Gary King () Missing Data 17 / 42

Page 109: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

4. Overall Point estimate: average individual point estimates, qj

(j = 1, . . . ,m):

q =1

m

m∑j=1

qj

Standard error: use this equation:

SE(q)2 = mean(SE2j ) + variance(qj) (1 + 1/m)

= within + between

Last piece vanishes as m increases

5. Easier by simulation: draw 1/m sims from each data set of the QOI,combine (i.e., concatenate into a larger set of simulations), and makeinferences as usual.

Gary King () Missing Data 17 / 42

Page 110: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

4. Overall Point estimate: average individual point estimates, qj

(j = 1, . . . ,m):

q =1

m

m∑j=1

qj

Standard error: use this equation:

SE(q)2 = mean(SE2j ) + variance(qj) (1 + 1/m)

= within + between

Last piece vanishes as m increases

5. Easier by simulation: draw 1/m sims from each data set of the QOI,combine (i.e., concatenate into a larger set of simulations), and makeinferences as usual.

Gary King () Missing Data 17 / 42

Page 111: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

A General Model for Imputations

1. If data were complete, we could use:

L(µ,Σ|D) ∝n∏

i=1

N(Di |µ,Σ) (a SURM model without X )

2. With missing data, this becomes:

L(µ,Σ|Dobs) ∝n∏

i=1

∫N(Di |µ,Σ)dDmis

=n∏

i=1

N(Di,obs |µobs ,Σobs)

since marginals of MVN’s are normal.

3. Simple theoretically: merely a likelihood model for data (Dobs ,M) and sameparameters as when fully observed (µ,Σ).

4. Difficult computationally: Di,obs has different elements observed for each i andso each observation is informative about different pieces of (µ,Σ).

Gary King () Missing Data 18 / 42

Page 112: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

A General Model for Imputations

1. If data were complete, we could use:

L(µ,Σ|D) ∝n∏

i=1

N(Di |µ,Σ) (a SURM model without X )

2. With missing data, this becomes:

L(µ,Σ|Dobs) ∝n∏

i=1

∫N(Di |µ,Σ)dDmis

=n∏

i=1

N(Di,obs |µobs ,Σobs)

since marginals of MVN’s are normal.

3. Simple theoretically: merely a likelihood model for data (Dobs ,M) and sameparameters as when fully observed (µ,Σ).

4. Difficult computationally: Di,obs has different elements observed for each i andso each observation is informative about different pieces of (µ,Σ).

Gary King () Missing Data 18 / 42

Page 113: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

A General Model for Imputations

1. If data were complete, we could use:

L(µ,Σ|D) ∝n∏

i=1

N(Di |µ,Σ)

(a SURM model without X )

2. With missing data, this becomes:

L(µ,Σ|Dobs) ∝n∏

i=1

∫N(Di |µ,Σ)dDmis

=n∏

i=1

N(Di,obs |µobs ,Σobs)

since marginals of MVN’s are normal.

3. Simple theoretically: merely a likelihood model for data (Dobs ,M) and sameparameters as when fully observed (µ,Σ).

4. Difficult computationally: Di,obs has different elements observed for each i andso each observation is informative about different pieces of (µ,Σ).

Gary King () Missing Data 18 / 42

Page 114: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

A General Model for Imputations

1. If data were complete, we could use:

L(µ,Σ|D) ∝n∏

i=1

N(Di |µ,Σ) (a SURM model without X )

2. With missing data, this becomes:

L(µ,Σ|Dobs) ∝n∏

i=1

∫N(Di |µ,Σ)dDmis

=n∏

i=1

N(Di,obs |µobs ,Σobs)

since marginals of MVN’s are normal.

3. Simple theoretically: merely a likelihood model for data (Dobs ,M) and sameparameters as when fully observed (µ,Σ).

4. Difficult computationally: Di,obs has different elements observed for each i andso each observation is informative about different pieces of (µ,Σ).

Gary King () Missing Data 18 / 42

Page 115: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

A General Model for Imputations

1. If data were complete, we could use:

L(µ,Σ|D) ∝n∏

i=1

N(Di |µ,Σ) (a SURM model without X )

2. With missing data, this becomes:

L(µ,Σ|Dobs) ∝n∏

i=1

∫N(Di |µ,Σ)dDmis

=n∏

i=1

N(Di,obs |µobs ,Σobs)

since marginals of MVN’s are normal.

3. Simple theoretically: merely a likelihood model for data (Dobs ,M) and sameparameters as when fully observed (µ,Σ).

4. Difficult computationally: Di,obs has different elements observed for each i andso each observation is informative about different pieces of (µ,Σ).

Gary King () Missing Data 18 / 42

Page 116: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

A General Model for Imputations

1. If data were complete, we could use:

L(µ,Σ|D) ∝n∏

i=1

N(Di |µ,Σ) (a SURM model without X )

2. With missing data, this becomes:

L(µ,Σ|Dobs) ∝n∏

i=1

∫N(Di |µ,Σ)dDmis

=n∏

i=1

N(Di,obs |µobs ,Σobs)

since marginals of MVN’s are normal.

3. Simple theoretically: merely a likelihood model for data (Dobs ,M) and sameparameters as when fully observed (µ,Σ).

4. Difficult computationally: Di,obs has different elements observed for each i andso each observation is informative about different pieces of (µ,Σ).

Gary King () Missing Data 18 / 42

Page 117: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

A General Model for Imputations

1. If data were complete, we could use:

L(µ,Σ|D) ∝n∏

i=1

N(Di |µ,Σ) (a SURM model without X )

2. With missing data, this becomes:

L(µ,Σ|Dobs) ∝n∏

i=1

∫N(Di |µ,Σ)dDmis

=n∏

i=1

N(Di,obs |µobs ,Σobs)

since marginals of MVN’s are normal.

3. Simple theoretically: merely a likelihood model for data (Dobs ,M) and sameparameters as when fully observed (µ,Σ).

4. Difficult computationally: Di,obs has different elements observed for each i andso each observation is informative about different pieces of (µ,Σ).

Gary King () Missing Data 18 / 42

Page 118: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

A General Model for Imputations

1. If data were complete, we could use:

L(µ,Σ|D) ∝n∏

i=1

N(Di |µ,Σ) (a SURM model without X )

2. With missing data, this becomes:

L(µ,Σ|Dobs) ∝n∏

i=1

∫N(Di |µ,Σ)dDmis

=n∏

i=1

N(Di,obs |µobs ,Σobs)

since marginals of MVN’s are normal.

3. Simple theoretically: merely a likelihood model for data (Dobs ,M) and sameparameters as when fully observed (µ,Σ).

4. Difficult computationally: Di,obs has different elements observed for each i andso each observation is informative about different pieces of (µ,Σ).

Gary King () Missing Data 18 / 42

Page 119: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

A General Model for Imputations

1. If data were complete, we could use:

L(µ,Σ|D) ∝n∏

i=1

N(Di |µ,Σ) (a SURM model without X )

2. With missing data, this becomes:

L(µ,Σ|Dobs) ∝n∏

i=1

∫N(Di |µ,Σ)dDmis

=n∏

i=1

N(Di,obs |µobs ,Σobs)

since marginals of MVN’s are normal.

3. Simple theoretically: merely a likelihood model for data (Dobs ,M) and sameparameters as when fully observed (µ,Σ).

4. Difficult computationally: Di,obs has different elements observed for each i andso each observation is informative about different pieces of (µ,Σ).

Gary King () Missing Data 18 / 42

Page 120: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

A General Model for Imputations

1. If data were complete, we could use:

L(µ,Σ|D) ∝n∏

i=1

N(Di |µ,Σ) (a SURM model without X )

2. With missing data, this becomes:

L(µ,Σ|Dobs) ∝n∏

i=1

∫N(Di |µ,Σ)dDmis

=n∏

i=1

N(Di,obs |µobs ,Σobs)

since marginals of MVN’s are normal.

3. Simple theoretically: merely a likelihood model for data (Dobs ,M) and sameparameters as when fully observed (µ,Σ).

4. Difficult computationally: Di,obs has different elements observed for each i andso each observation is informative about different pieces of (µ,Σ).

Gary King () Missing Data 18 / 42

Page 121: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

5. Difficult Statistically: number of parameters increases quickly in the number ofvariables (p, columns of D):

parameters = parameters(µ) + parameters(Σ)

= p + p(p + 1)/2 = p(p + 3)/2.

E.g., for p = 5, parameters= 20; for p = 40 parameters= 860 (Compare to n.)

6. More appropriate models, such as for categorical or mixed variables, are harderto apply and do not usually perform better than this model (If you’re going touse a difficult imputation method, you might as well use anapplication-specific method. Our goal is an easy-to-apply, generally applicable,method even if 2nd best.)

7. For social science survey data, which mostly contain ordinal scales, this is areasonable choice for imputation, even though it may not be a good choice foranalysis.

Gary King () Missing Data 19 / 42

Page 122: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

5. Difficult Statistically: number of parameters increases quickly in the number ofvariables (p, columns of D):

parameters = parameters(µ) + parameters(Σ)

= p + p(p + 1)/2 = p(p + 3)/2.

E.g., for p = 5, parameters= 20; for p = 40 parameters= 860 (Compare to n.)

6. More appropriate models, such as for categorical or mixed variables, are harderto apply and do not usually perform better than this model (If you’re going touse a difficult imputation method, you might as well use anapplication-specific method. Our goal is an easy-to-apply, generally applicable,method even if 2nd best.)

7. For social science survey data, which mostly contain ordinal scales, this is areasonable choice for imputation, even though it may not be a good choice foranalysis.

Gary King () Missing Data 19 / 42

Page 123: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

5. Difficult Statistically: number of parameters increases quickly in the number ofvariables (p, columns of D):

parameters = parameters(µ) + parameters(Σ)

= p + p(p + 1)/2

= p(p + 3)/2.

E.g., for p = 5, parameters= 20; for p = 40 parameters= 860 (Compare to n.)

6. More appropriate models, such as for categorical or mixed variables, are harderto apply and do not usually perform better than this model (If you’re going touse a difficult imputation method, you might as well use anapplication-specific method. Our goal is an easy-to-apply, generally applicable,method even if 2nd best.)

7. For social science survey data, which mostly contain ordinal scales, this is areasonable choice for imputation, even though it may not be a good choice foranalysis.

Gary King () Missing Data 19 / 42

Page 124: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

5. Difficult Statistically: number of parameters increases quickly in the number ofvariables (p, columns of D):

parameters = parameters(µ) + parameters(Σ)

= p + p(p + 1)/2 = p(p + 3)/2.

E.g., for p = 5, parameters= 20; for p = 40 parameters= 860 (Compare to n.)

6. More appropriate models, such as for categorical or mixed variables, are harderto apply and do not usually perform better than this model (If you’re going touse a difficult imputation method, you might as well use anapplication-specific method. Our goal is an easy-to-apply, generally applicable,method even if 2nd best.)

7. For social science survey data, which mostly contain ordinal scales, this is areasonable choice for imputation, even though it may not be a good choice foranalysis.

Gary King () Missing Data 19 / 42

Page 125: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

5. Difficult Statistically: number of parameters increases quickly in the number ofvariables (p, columns of D):

parameters = parameters(µ) + parameters(Σ)

= p + p(p + 1)/2 = p(p + 3)/2.

E.g., for p = 5, parameters= 20; for p = 40 parameters= 860 (Compare to n.)

6. More appropriate models, such as for categorical or mixed variables, are harderto apply and do not usually perform better than this model (If you’re going touse a difficult imputation method, you might as well use anapplication-specific method. Our goal is an easy-to-apply, generally applicable,method even if 2nd best.)

7. For social science survey data, which mostly contain ordinal scales, this is areasonable choice for imputation, even though it may not be a good choice foranalysis.

Gary King () Missing Data 19 / 42

Page 126: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

5. Difficult Statistically: number of parameters increases quickly in the number ofvariables (p, columns of D):

parameters = parameters(µ) + parameters(Σ)

= p + p(p + 1)/2 = p(p + 3)/2.

E.g., for p = 5, parameters= 20; for p = 40 parameters= 860 (Compare to n.)

6. More appropriate models, such as for categorical or mixed variables, are harderto apply and do not usually perform better than this model (If you’re going touse a difficult imputation method, you might as well use anapplication-specific method. Our goal is an easy-to-apply, generally applicable,method even if 2nd best.)

7. For social science survey data, which mostly contain ordinal scales, this is areasonable choice for imputation, even though it may not be a good choice foranalysis.

Gary King () Missing Data 19 / 42

Page 127: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

5. Difficult Statistically: number of parameters increases quickly in the number ofvariables (p, columns of D):

parameters = parameters(µ) + parameters(Σ)

= p + p(p + 1)/2 = p(p + 3)/2.

E.g., for p = 5, parameters= 20; for p = 40 parameters= 860 (Compare to n.)

6. More appropriate models, such as for categorical or mixed variables, are harderto apply and do not usually perform better than this model (If you’re going touse a difficult imputation method, you might as well use anapplication-specific method. Our goal is an easy-to-apply, generally applicable,method even if 2nd best.)

7. For social science survey data, which mostly contain ordinal scales, this is areasonable choice for imputation, even though it may not be a good choice foranalysis.

Gary King () Missing Data 19 / 42

Page 128: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

How to create imputations from this model

1. E.g., suppose D has only 2 variables, D = {X ,Y }2. X is fully observed, Y has some missingness.

3. Then D = {Y ,X} is bivariate normal:

D ∼ N(D|µ,Σ)

= N

[(YX

) ∣∣∣∣ (µy

µx

),

(σy σxy

σxy σx

)]

4. Conditionals of bivariate normals are normal:

Y |X ∼ N (y |E (Y |X ),V (Y |X ))

E (Y |X ) = µy + β(X − µx) (a linear regression!)

β = σxy/σx

V (Y |X ) = σy − σ2xy/σx

Gary King () Missing Data 20 / 42

Page 129: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

How to create imputations from this model

1. E.g., suppose D has only 2 variables, D = {X ,Y }

2. X is fully observed, Y has some missingness.

3. Then D = {Y ,X} is bivariate normal:

D ∼ N(D|µ,Σ)

= N

[(YX

) ∣∣∣∣ (µy

µx

),

(σy σxy

σxy σx

)]

4. Conditionals of bivariate normals are normal:

Y |X ∼ N (y |E (Y |X ),V (Y |X ))

E (Y |X ) = µy + β(X − µx) (a linear regression!)

β = σxy/σx

V (Y |X ) = σy − σ2xy/σx

Gary King () Missing Data 20 / 42

Page 130: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

How to create imputations from this model

1. E.g., suppose D has only 2 variables, D = {X ,Y }2. X is fully observed, Y has some missingness.

3. Then D = {Y ,X} is bivariate normal:

D ∼ N(D|µ,Σ)

= N

[(YX

) ∣∣∣∣ (µy

µx

),

(σy σxy

σxy σx

)]

4. Conditionals of bivariate normals are normal:

Y |X ∼ N (y |E (Y |X ),V (Y |X ))

E (Y |X ) = µy + β(X − µx) (a linear regression!)

β = σxy/σx

V (Y |X ) = σy − σ2xy/σx

Gary King () Missing Data 20 / 42

Page 131: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

How to create imputations from this model

1. E.g., suppose D has only 2 variables, D = {X ,Y }2. X is fully observed, Y has some missingness.

3. Then D = {Y ,X} is bivariate normal:

D ∼ N(D|µ,Σ)

= N

[(YX

) ∣∣∣∣ (µy

µx

),

(σy σxy

σxy σx

)]4. Conditionals of bivariate normals are normal:

Y |X ∼ N (y |E (Y |X ),V (Y |X ))

E (Y |X ) = µy + β(X − µx) (a linear regression!)

β = σxy/σx

V (Y |X ) = σy − σ2xy/σx

Gary King () Missing Data 20 / 42

Page 132: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

How to create imputations from this model

1. E.g., suppose D has only 2 variables, D = {X ,Y }2. X is fully observed, Y has some missingness.

3. Then D = {Y ,X} is bivariate normal:

D ∼ N(D|µ,Σ)

= N

[(YX

) ∣∣∣∣ (µy

µx

),

(σy σxy

σxy σx

)]4. Conditionals of bivariate normals are normal:

Y |X ∼ N (y |E (Y |X ),V (Y |X ))

E (Y |X ) = µy + β(X − µx) (a linear regression!)

β = σxy/σx

V (Y |X ) = σy − σ2xy/σx

Gary King () Missing Data 20 / 42

Page 133: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

How to create imputations from this model

1. E.g., suppose D has only 2 variables, D = {X ,Y }2. X is fully observed, Y has some missingness.

3. Then D = {Y ,X} is bivariate normal:

D ∼ N(D|µ,Σ)

= N

[(YX

) ∣∣∣∣ (µy

µx

),

(σy σxy

σxy σx

)]

4. Conditionals of bivariate normals are normal:

Y |X ∼ N (y |E (Y |X ),V (Y |X ))

E (Y |X ) = µy + β(X − µx) (a linear regression!)

β = σxy/σx

V (Y |X ) = σy − σ2xy/σx

Gary King () Missing Data 20 / 42

Page 134: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

How to create imputations from this model

1. E.g., suppose D has only 2 variables, D = {X ,Y }2. X is fully observed, Y has some missingness.

3. Then D = {Y ,X} is bivariate normal:

D ∼ N(D|µ,Σ)

= N

[(YX

) ∣∣∣∣ (µy

µx

),

(σy σxy

σxy σx

)]4. Conditionals of bivariate normals are normal:

Y |X ∼ N (y |E (Y |X ),V (Y |X ))

E (Y |X ) = µy + β(X − µx) (a linear regression!)

β = σxy/σx

V (Y |X ) = σy − σ2xy/σx

Gary King () Missing Data 20 / 42

Page 135: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

How to create imputations from this model

1. E.g., suppose D has only 2 variables, D = {X ,Y }2. X is fully observed, Y has some missingness.

3. Then D = {Y ,X} is bivariate normal:

D ∼ N(D|µ,Σ)

= N

[(YX

) ∣∣∣∣ (µy

µx

),

(σy σxy

σxy σx

)]4. Conditionals of bivariate normals are normal:

Y |X ∼ N (y |E (Y |X ),V (Y |X ))

E (Y |X ) = µy + β(X − µx) (a linear regression!)

β = σxy/σx

V (Y |X ) = σy − σ2xy/σx

Gary King () Missing Data 20 / 42

Page 136: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

How to create imputations from this model

1. E.g., suppose D has only 2 variables, D = {X ,Y }2. X is fully observed, Y has some missingness.

3. Then D = {Y ,X} is bivariate normal:

D ∼ N(D|µ,Σ)

= N

[(YX

) ∣∣∣∣ (µy

µx

),

(σy σxy

σxy σx

)]4. Conditionals of bivariate normals are normal:

Y |X ∼ N (y |E (Y |X ),V (Y |X ))

E (Y |X ) = µy + β(X − µx) (a linear regression!)

β = σxy/σx

V (Y |X ) = σy − σ2xy/σx

Gary King () Missing Data 20 / 42

Page 137: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

How to create imputations from this model

1. E.g., suppose D has only 2 variables, D = {X ,Y }2. X is fully observed, Y has some missingness.

3. Then D = {Y ,X} is bivariate normal:

D ∼ N(D|µ,Σ)

= N

[(YX

) ∣∣∣∣ (µy

µx

),

(σy σxy

σxy σx

)]4. Conditionals of bivariate normals are normal:

Y |X ∼ N (y |E (Y |X ),V (Y |X ))

E (Y |X ) = µy + β(X − µx) (a linear regression!)

β = σxy/σx

V (Y |X ) = σy − σ2xy/σx

Gary King () Missing Data 20 / 42

Page 138: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

How to create imputations from this model

1. E.g., suppose D has only 2 variables, D = {X ,Y }2. X is fully observed, Y has some missingness.

3. Then D = {Y ,X} is bivariate normal:

D ∼ N(D|µ,Σ)

= N

[(YX

) ∣∣∣∣ (µy

µx

),

(σy σxy

σxy σx

)]4. Conditionals of bivariate normals are normal:

Y |X ∼ N (y |E (Y |X ),V (Y |X ))

E (Y |X ) = µy + β(X − µx) (a linear regression!)

β = σxy/σx

V (Y |X ) = σy − σ2xy/σx

Gary King () Missing Data 20 / 42

Page 139: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

5. To create imputations:

(a) Estimate the posterior density of µ and Σ

i. Could do the usual: maximize likelihood, assume CLT applies, and drawfrom the normal approximation. (Hard to do, and CLT isn’t a goodasymptotic approximation due to large number of parameters.)

ii. We will improve on this shortly

(b) Draw µ and Σ from their posterior density(c) Compute simulations of E (Y |X ) and V (Y |X ) deterministically(d) Draw a simulation of the missing Y from the conditional normal

6. In this simple example (X fully observed), this is equivalent tosimulating from a linear regression of Y on X ,

yi = xi β + εi ,

with estimation and fundamental uncertainty

Gary King () Missing Data 21 / 42

Page 140: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

5. To create imputations:(a) Estimate the posterior density of µ and Σ

i. Could do the usual: maximize likelihood, assume CLT applies, and drawfrom the normal approximation. (Hard to do, and CLT isn’t a goodasymptotic approximation due to large number of parameters.)

ii. We will improve on this shortly

(b) Draw µ and Σ from their posterior density(c) Compute simulations of E (Y |X ) and V (Y |X ) deterministically(d) Draw a simulation of the missing Y from the conditional normal

6. In this simple example (X fully observed), this is equivalent tosimulating from a linear regression of Y on X ,

yi = xi β + εi ,

with estimation and fundamental uncertainty

Gary King () Missing Data 21 / 42

Page 141: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

5. To create imputations:(a) Estimate the posterior density of µ and Σ

i. Could do the usual: maximize likelihood, assume CLT applies, and drawfrom the normal approximation. (Hard to do, and CLT isn’t a goodasymptotic approximation due to large number of parameters.)

ii. We will improve on this shortly

(b) Draw µ and Σ from their posterior density(c) Compute simulations of E (Y |X ) and V (Y |X ) deterministically(d) Draw a simulation of the missing Y from the conditional normal

6. In this simple example (X fully observed), this is equivalent tosimulating from a linear regression of Y on X ,

yi = xi β + εi ,

with estimation and fundamental uncertainty

Gary King () Missing Data 21 / 42

Page 142: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

5. To create imputations:(a) Estimate the posterior density of µ and Σ

i. Could do the usual: maximize likelihood, assume CLT applies, and drawfrom the normal approximation. (Hard to do, and CLT isn’t a goodasymptotic approximation due to large number of parameters.)

ii. We will improve on this shortly

(b) Draw µ and Σ from their posterior density(c) Compute simulations of E (Y |X ) and V (Y |X ) deterministically(d) Draw a simulation of the missing Y from the conditional normal

6. In this simple example (X fully observed), this is equivalent tosimulating from a linear regression of Y on X ,

yi = xi β + εi ,

with estimation and fundamental uncertainty

Gary King () Missing Data 21 / 42

Page 143: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

5. To create imputations:(a) Estimate the posterior density of µ and Σ

i. Could do the usual: maximize likelihood, assume CLT applies, and drawfrom the normal approximation. (Hard to do, and CLT isn’t a goodasymptotic approximation due to large number of parameters.)

ii. We will improve on this shortly

(b) Draw µ and Σ from their posterior density

(c) Compute simulations of E (Y |X ) and V (Y |X ) deterministically(d) Draw a simulation of the missing Y from the conditional normal

6. In this simple example (X fully observed), this is equivalent tosimulating from a linear regression of Y on X ,

yi = xi β + εi ,

with estimation and fundamental uncertainty

Gary King () Missing Data 21 / 42

Page 144: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

5. To create imputations:(a) Estimate the posterior density of µ and Σ

i. Could do the usual: maximize likelihood, assume CLT applies, and drawfrom the normal approximation. (Hard to do, and CLT isn’t a goodasymptotic approximation due to large number of parameters.)

ii. We will improve on this shortly

(b) Draw µ and Σ from their posterior density(c) Compute simulations of E (Y |X ) and V (Y |X ) deterministically

(d) Draw a simulation of the missing Y from the conditional normal

6. In this simple example (X fully observed), this is equivalent tosimulating from a linear regression of Y on X ,

yi = xi β + εi ,

with estimation and fundamental uncertainty

Gary King () Missing Data 21 / 42

Page 145: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

5. To create imputations:(a) Estimate the posterior density of µ and Σ

i. Could do the usual: maximize likelihood, assume CLT applies, and drawfrom the normal approximation. (Hard to do, and CLT isn’t a goodasymptotic approximation due to large number of parameters.)

ii. We will improve on this shortly

(b) Draw µ and Σ from their posterior density(c) Compute simulations of E (Y |X ) and V (Y |X ) deterministically(d) Draw a simulation of the missing Y from the conditional normal

6. In this simple example (X fully observed), this is equivalent tosimulating from a linear regression of Y on X ,

yi = xi β + εi ,

with estimation and fundamental uncertainty

Gary King () Missing Data 21 / 42

Page 146: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

5. To create imputations:(a) Estimate the posterior density of µ and Σ

i. Could do the usual: maximize likelihood, assume CLT applies, and drawfrom the normal approximation. (Hard to do, and CLT isn’t a goodasymptotic approximation due to large number of parameters.)

ii. We will improve on this shortly

(b) Draw µ and Σ from their posterior density(c) Compute simulations of E (Y |X ) and V (Y |X ) deterministically(d) Draw a simulation of the missing Y from the conditional normal

6. In this simple example (X fully observed), this is equivalent tosimulating from a linear regression of Y on X ,

yi = xi β + εi ,

with estimation and fundamental uncertainty

Gary King () Missing Data 21 / 42

Page 147: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

5. To create imputations:(a) Estimate the posterior density of µ and Σ

i. Could do the usual: maximize likelihood, assume CLT applies, and drawfrom the normal approximation. (Hard to do, and CLT isn’t a goodasymptotic approximation due to large number of parameters.)

ii. We will improve on this shortly

(b) Draw µ and Σ from their posterior density(c) Compute simulations of E (Y |X ) and V (Y |X ) deterministically(d) Draw a simulation of the missing Y from the conditional normal

6. In this simple example (X fully observed), this is equivalent tosimulating from a linear regression of Y on X ,

yi = xi β + εi ,

with estimation and fundamental uncertainty

Gary King () Missing Data 21 / 42

Page 148: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

5. To create imputations:(a) Estimate the posterior density of µ and Σ

i. Could do the usual: maximize likelihood, assume CLT applies, and drawfrom the normal approximation. (Hard to do, and CLT isn’t a goodasymptotic approximation due to large number of parameters.)

ii. We will improve on this shortly

(b) Draw µ and Σ from their posterior density(c) Compute simulations of E (Y |X ) and V (Y |X ) deterministically(d) Draw a simulation of the missing Y from the conditional normal

6. In this simple example (X fully observed), this is equivalent tosimulating from a linear regression of Y on X ,

yi = xi β + εi ,

with estimation and fundamental uncertainty

Gary King () Missing Data 21 / 42

Page 149: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Computational Algorithms

1. Optim with hundreds of parameters would work but slowly.

2. EM (expectation maximization): another algorithm for finding themaximum.

(a) Much faster than optim(b) Intuition:

i. Without missingness, estimating β would be easy: run LS.ii. If β were known, imputation would be easy: draw ε from normal, and use

y = xβ + ε.

(c) EM works by iterating between

i. Impute Y with x β, given current estimates, βii. Estimate parameters β (by LS) on data filled in with current imputations

for Y

(d) Can easily do imputation via x β + ε, but SEs too small due to noestimation uncertainty (β 6= β); i.e., we need to draw β from its posteriorfirst

Gary King () Missing Data 22 / 42

Page 150: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Computational Algorithms

1. Optim with hundreds of parameters would work but slowly.

2. EM (expectation maximization): another algorithm for finding themaximum.

(a) Much faster than optim(b) Intuition:

i. Without missingness, estimating β would be easy: run LS.ii. If β were known, imputation would be easy: draw ε from normal, and use

y = xβ + ε.

(c) EM works by iterating between

i. Impute Y with x β, given current estimates, βii. Estimate parameters β (by LS) on data filled in with current imputations

for Y

(d) Can easily do imputation via x β + ε, but SEs too small due to noestimation uncertainty (β 6= β); i.e., we need to draw β from its posteriorfirst

Gary King () Missing Data 22 / 42

Page 151: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Computational Algorithms

1. Optim with hundreds of parameters would work but slowly.

2. EM (expectation maximization): another algorithm for finding themaximum.

(a) Much faster than optim(b) Intuition:

i. Without missingness, estimating β would be easy: run LS.ii. If β were known, imputation would be easy: draw ε from normal, and use

y = xβ + ε.

(c) EM works by iterating between

i. Impute Y with x β, given current estimates, βii. Estimate parameters β (by LS) on data filled in with current imputations

for Y

(d) Can easily do imputation via x β + ε, but SEs too small due to noestimation uncertainty (β 6= β); i.e., we need to draw β from its posteriorfirst

Gary King () Missing Data 22 / 42

Page 152: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Computational Algorithms

1. Optim with hundreds of parameters would work but slowly.

2. EM (expectation maximization): another algorithm for finding themaximum.

(a) Much faster than optim

(b) Intuition:

i. Without missingness, estimating β would be easy: run LS.ii. If β were known, imputation would be easy: draw ε from normal, and use

y = xβ + ε.

(c) EM works by iterating between

i. Impute Y with x β, given current estimates, βii. Estimate parameters β (by LS) on data filled in with current imputations

for Y

(d) Can easily do imputation via x β + ε, but SEs too small due to noestimation uncertainty (β 6= β); i.e., we need to draw β from its posteriorfirst

Gary King () Missing Data 22 / 42

Page 153: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Computational Algorithms

1. Optim with hundreds of parameters would work but slowly.

2. EM (expectation maximization): another algorithm for finding themaximum.

(a) Much faster than optim(b) Intuition:

i. Without missingness, estimating β would be easy: run LS.ii. If β were known, imputation would be easy: draw ε from normal, and use

y = xβ + ε.

(c) EM works by iterating between

i. Impute Y with x β, given current estimates, βii. Estimate parameters β (by LS) on data filled in with current imputations

for Y

(d) Can easily do imputation via x β + ε, but SEs too small due to noestimation uncertainty (β 6= β); i.e., we need to draw β from its posteriorfirst

Gary King () Missing Data 22 / 42

Page 154: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Computational Algorithms

1. Optim with hundreds of parameters would work but slowly.

2. EM (expectation maximization): another algorithm for finding themaximum.

(a) Much faster than optim(b) Intuition:

i. Without missingness, estimating β would be easy: run LS.

ii. If β were known, imputation would be easy: draw ε from normal, and usey = xβ + ε.

(c) EM works by iterating between

i. Impute Y with x β, given current estimates, βii. Estimate parameters β (by LS) on data filled in with current imputations

for Y

(d) Can easily do imputation via x β + ε, but SEs too small due to noestimation uncertainty (β 6= β); i.e., we need to draw β from its posteriorfirst

Gary King () Missing Data 22 / 42

Page 155: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Computational Algorithms

1. Optim with hundreds of parameters would work but slowly.

2. EM (expectation maximization): another algorithm for finding themaximum.

(a) Much faster than optim(b) Intuition:

i. Without missingness, estimating β would be easy: run LS.ii. If β were known, imputation would be easy: draw ε from normal, and use

y = xβ + ε.

(c) EM works by iterating between

i. Impute Y with x β, given current estimates, βii. Estimate parameters β (by LS) on data filled in with current imputations

for Y

(d) Can easily do imputation via x β + ε, but SEs too small due to noestimation uncertainty (β 6= β); i.e., we need to draw β from its posteriorfirst

Gary King () Missing Data 22 / 42

Page 156: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Computational Algorithms

1. Optim with hundreds of parameters would work but slowly.

2. EM (expectation maximization): another algorithm for finding themaximum.

(a) Much faster than optim(b) Intuition:

i. Without missingness, estimating β would be easy: run LS.ii. If β were known, imputation would be easy: draw ε from normal, and use

y = xβ + ε.

(c) EM works by iterating between

i. Impute Y with x β, given current estimates, βii. Estimate parameters β (by LS) on data filled in with current imputations

for Y

(d) Can easily do imputation via x β + ε, but SEs too small due to noestimation uncertainty (β 6= β); i.e., we need to draw β from its posteriorfirst

Gary King () Missing Data 22 / 42

Page 157: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Computational Algorithms

1. Optim with hundreds of parameters would work but slowly.

2. EM (expectation maximization): another algorithm for finding themaximum.

(a) Much faster than optim(b) Intuition:

i. Without missingness, estimating β would be easy: run LS.ii. If β were known, imputation would be easy: draw ε from normal, and use

y = xβ + ε.

(c) EM works by iterating between

i. Impute Y with x β, given current estimates, β

ii. Estimate parameters β (by LS) on data filled in with current imputationsfor Y

(d) Can easily do imputation via x β + ε, but SEs too small due to noestimation uncertainty (β 6= β); i.e., we need to draw β from its posteriorfirst

Gary King () Missing Data 22 / 42

Page 158: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Computational Algorithms

1. Optim with hundreds of parameters would work but slowly.

2. EM (expectation maximization): another algorithm for finding themaximum.

(a) Much faster than optim(b) Intuition:

i. Without missingness, estimating β would be easy: run LS.ii. If β were known, imputation would be easy: draw ε from normal, and use

y = xβ + ε.

(c) EM works by iterating between

i. Impute Y with x β, given current estimates, βii. Estimate parameters β (by LS) on data filled in with current imputations

for Y

(d) Can easily do imputation via x β + ε, but SEs too small due to noestimation uncertainty (β 6= β); i.e., we need to draw β from its posteriorfirst

Gary King () Missing Data 22 / 42

Page 159: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Computational Algorithms

1. Optim with hundreds of parameters would work but slowly.

2. EM (expectation maximization): another algorithm for finding themaximum.

(a) Much faster than optim(b) Intuition:

i. Without missingness, estimating β would be easy: run LS.ii. If β were known, imputation would be easy: draw ε from normal, and use

y = xβ + ε.

(c) EM works by iterating between

i. Impute Y with x β, given current estimates, βii. Estimate parameters β (by LS) on data filled in with current imputations

for Y

(d) Can easily do imputation via x β + ε, but SEs too small due to noestimation uncertainty (β 6= β); i.e., we need to draw β from its posteriorfirst

Gary King () Missing Data 22 / 42

Page 160: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

3. EMs: EM for maximization and then simulation (as usual) fromasymptotic normal posterior

(a) EMs adds estimation uncertainty to EM imputations by drawing β fromits asymptotic normal distribution, N(β, V (β))

(b) The central limit theorem guarantees that this works as n →∞, but forreal sample sizes it may be inadequate.

4. EMis: EM with simulation via importance resampling (probabilisticrejection sampling to draw from the posterior)

Keep θ1 with probability ∝ a/b (the importance ratio). Keep θ2 withprobability 1.

Gary King () Missing Data 23 / 42

Page 161: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

3. EMs: EM for maximization and then simulation (as usual) fromasymptotic normal posterior

(a) EMs adds estimation uncertainty to EM imputations by drawing β fromits asymptotic normal distribution, N(β, V (β))

(b) The central limit theorem guarantees that this works as n →∞, but forreal sample sizes it may be inadequate.

4. EMis: EM with simulation via importance resampling (probabilisticrejection sampling to draw from the posterior)

Keep θ1 with probability ∝ a/b (the importance ratio). Keep θ2 withprobability 1.

Gary King () Missing Data 23 / 42

Page 162: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

3. EMs: EM for maximization and then simulation (as usual) fromasymptotic normal posterior

(a) EMs adds estimation uncertainty to EM imputations by drawing β fromits asymptotic normal distribution, N(β, V (β))

(b) The central limit theorem guarantees that this works as n →∞, but forreal sample sizes it may be inadequate.

4. EMis: EM with simulation via importance resampling (probabilisticrejection sampling to draw from the posterior)

Keep θ1 with probability ∝ a/b (the importance ratio). Keep θ2 withprobability 1.

Gary King () Missing Data 23 / 42

Page 163: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

3. EMs: EM for maximization and then simulation (as usual) fromasymptotic normal posterior

(a) EMs adds estimation uncertainty to EM imputations by drawing β fromits asymptotic normal distribution, N(β, V (β))

(b) The central limit theorem guarantees that this works as n →∞, but forreal sample sizes it may be inadequate.

4. EMis: EM with simulation via importance resampling (probabilisticrejection sampling to draw from the posterior)

Keep θ1 with probability ∝ a/b (the importance ratio). Keep θ2 withprobability 1.

Gary King () Missing Data 23 / 42

Page 164: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

3. EMs: EM for maximization and then simulation (as usual) fromasymptotic normal posterior

(a) EMs adds estimation uncertainty to EM imputations by drawing β fromits asymptotic normal distribution, N(β, V (β))

(b) The central limit theorem guarantees that this works as n →∞, but forreal sample sizes it may be inadequate.

4. EMis: EM with simulation via importance resampling (probabilisticrejection sampling to draw from the posterior)

Keep θ1 with probability ∝ a/b (the importance ratio). Keep θ2 withprobability 1.

Gary King () Missing Data 23 / 42

Page 165: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

3. EMs: EM for maximization and then simulation (as usual) fromasymptotic normal posterior

(a) EMs adds estimation uncertainty to EM imputations by drawing β fromits asymptotic normal distribution, N(β, V (β))

(b) The central limit theorem guarantees that this works as n →∞, but forreal sample sizes it may be inadequate.

4. EMis: EM with simulation via importance resampling (probabilisticrejection sampling to draw from the posterior)

Keep θ1 with probability ∝ a/b (the importance ratio). Keep θ2 withprobability 1.

Gary King () Missing Data 23 / 42

Page 166: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

5. IP: Imputation-posterior

(a) A stochastic version of EM(b) The algorithm. For θ = {µ,Σ},

i. I-Step: draw Dmis from P(Dmis |Dobs , θ) (i.e., y = x β + ε) given currentdraw of θ.

ii. P-Step: draw θ from P(θ|Dobs , Dmis), given current imputation Dmis

(c) Example of MCMC (Markov-Chain Monte Carlo) methods, one of themost important developments in statistics in the 1990s

(d) MCMC enabled statisticians to do things they never previously dreamedpossible, but it requires considerable expertise to use and so didn’t helpothers do these things. (Few MCMC routines have appeared in cannedpackages.)

(e) Hard to know when its over. Convergence is asymptotic in iterations.Plot traces are hard to interpret, and the worst-converging parametercontrols the system.

Gary King () Missing Data 24 / 42

Page 167: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

5. IP: Imputation-posterior

(a) A stochastic version of EM

(b) The algorithm. For θ = {µ,Σ},

i. I-Step: draw Dmis from P(Dmis |Dobs , θ) (i.e., y = x β + ε) given currentdraw of θ.

ii. P-Step: draw θ from P(θ|Dobs , Dmis), given current imputation Dmis

(c) Example of MCMC (Markov-Chain Monte Carlo) methods, one of themost important developments in statistics in the 1990s

(d) MCMC enabled statisticians to do things they never previously dreamedpossible, but it requires considerable expertise to use and so didn’t helpothers do these things. (Few MCMC routines have appeared in cannedpackages.)

(e) Hard to know when its over. Convergence is asymptotic in iterations.Plot traces are hard to interpret, and the worst-converging parametercontrols the system.

Gary King () Missing Data 24 / 42

Page 168: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

5. IP: Imputation-posterior

(a) A stochastic version of EM(b) The algorithm. For θ = {µ,Σ},

i. I-Step: draw Dmis from P(Dmis |Dobs , θ) (i.e., y = x β + ε) given currentdraw of θ.

ii. P-Step: draw θ from P(θ|Dobs , Dmis), given current imputation Dmis

(c) Example of MCMC (Markov-Chain Monte Carlo) methods, one of themost important developments in statistics in the 1990s

(d) MCMC enabled statisticians to do things they never previously dreamedpossible, but it requires considerable expertise to use and so didn’t helpothers do these things. (Few MCMC routines have appeared in cannedpackages.)

(e) Hard to know when its over. Convergence is asymptotic in iterations.Plot traces are hard to interpret, and the worst-converging parametercontrols the system.

Gary King () Missing Data 24 / 42

Page 169: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

5. IP: Imputation-posterior

(a) A stochastic version of EM(b) The algorithm. For θ = {µ,Σ},

i. I-Step: draw Dmis from P(Dmis |Dobs , θ) (i.e., y = x β + ε) given currentdraw of θ.

ii. P-Step: draw θ from P(θ|Dobs , Dmis), given current imputation Dmis

(c) Example of MCMC (Markov-Chain Monte Carlo) methods, one of themost important developments in statistics in the 1990s

(d) MCMC enabled statisticians to do things they never previously dreamedpossible, but it requires considerable expertise to use and so didn’t helpothers do these things. (Few MCMC routines have appeared in cannedpackages.)

(e) Hard to know when its over. Convergence is asymptotic in iterations.Plot traces are hard to interpret, and the worst-converging parametercontrols the system.

Gary King () Missing Data 24 / 42

Page 170: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

5. IP: Imputation-posterior

(a) A stochastic version of EM(b) The algorithm. For θ = {µ,Σ},

i. I-Step: draw Dmis from P(Dmis |Dobs , θ) (i.e., y = x β + ε) given currentdraw of θ.

ii. P-Step: draw θ from P(θ|Dobs , Dmis), given current imputation Dmis

(c) Example of MCMC (Markov-Chain Monte Carlo) methods, one of themost important developments in statistics in the 1990s

(d) MCMC enabled statisticians to do things they never previously dreamedpossible, but it requires considerable expertise to use and so didn’t helpothers do these things. (Few MCMC routines have appeared in cannedpackages.)

(e) Hard to know when its over. Convergence is asymptotic in iterations.Plot traces are hard to interpret, and the worst-converging parametercontrols the system.

Gary King () Missing Data 24 / 42

Page 171: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

5. IP: Imputation-posterior

(a) A stochastic version of EM(b) The algorithm. For θ = {µ,Σ},

i. I-Step: draw Dmis from P(Dmis |Dobs , θ) (i.e., y = x β + ε) given currentdraw of θ.

ii. P-Step: draw θ from P(θ|Dobs , Dmis), given current imputation Dmis

(c) Example of MCMC (Markov-Chain Monte Carlo) methods, one of themost important developments in statistics in the 1990s

(d) MCMC enabled statisticians to do things they never previously dreamedpossible, but it requires considerable expertise to use and so didn’t helpothers do these things. (Few MCMC routines have appeared in cannedpackages.)

(e) Hard to know when its over. Convergence is asymptotic in iterations.Plot traces are hard to interpret, and the worst-converging parametercontrols the system.

Gary King () Missing Data 24 / 42

Page 172: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

5. IP: Imputation-posterior

(a) A stochastic version of EM(b) The algorithm. For θ = {µ,Σ},

i. I-Step: draw Dmis from P(Dmis |Dobs , θ) (i.e., y = x β + ε) given currentdraw of θ.

ii. P-Step: draw θ from P(θ|Dobs , Dmis), given current imputation Dmis

(c) Example of MCMC (Markov-Chain Monte Carlo) methods, one of themost important developments in statistics in the 1990s

(d) MCMC enabled statisticians to do things they never previously dreamedpossible, but it requires considerable expertise to use and so didn’t helpothers do these things. (Few MCMC routines have appeared in cannedpackages.)

(e) Hard to know when its over. Convergence is asymptotic in iterations.Plot traces are hard to interpret, and the worst-converging parametercontrols the system.

Gary King () Missing Data 24 / 42

Page 173: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

5. IP: Imputation-posterior

(a) A stochastic version of EM(b) The algorithm. For θ = {µ,Σ},

i. I-Step: draw Dmis from P(Dmis |Dobs , θ) (i.e., y = x β + ε) given currentdraw of θ.

ii. P-Step: draw θ from P(θ|Dobs , Dmis), given current imputation Dmis

(c) Example of MCMC (Markov-Chain Monte Carlo) methods, one of themost important developments in statistics in the 1990s

(d) MCMC enabled statisticians to do things they never previously dreamedpossible, but it requires considerable expertise to use and so didn’t helpothers do these things. (Few MCMC routines have appeared in cannedpackages.)

(e) Hard to know when its over. Convergence is asymptotic in iterations.Plot traces are hard to interpret, and the worst-converging parametercontrols the system.

Gary King () Missing Data 24 / 42

Page 174: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

MCMC Convergence Issues

Gary King () Missing Data 25 / 42

Page 175: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

One more algorithm: EMB: EM With Bootstrap

Randomly draw n obs (with replacement) from the data

Use EM to estimate β and Σ in each (for estimation uncertainty)

Impute Dmis from each from the model (for fundamental uncertainty)

Lightning fast; works with very large data sets

Basis for Amelia II

Gary King () Missing Data 26 / 42

Page 176: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

One more algorithm: EMB: EM With Bootstrap

Randomly draw n obs (with replacement) from the data

Use EM to estimate β and Σ in each (for estimation uncertainty)

Impute Dmis from each from the model (for fundamental uncertainty)

Lightning fast; works with very large data sets

Basis for Amelia II

Gary King () Missing Data 26 / 42

Page 177: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

One more algorithm: EMB: EM With Bootstrap

Randomly draw n obs (with replacement) from the data

Use EM to estimate β and Σ in each (for estimation uncertainty)

Impute Dmis from each from the model (for fundamental uncertainty)

Lightning fast; works with very large data sets

Basis for Amelia II

Gary King () Missing Data 26 / 42

Page 178: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

One more algorithm: EMB: EM With Bootstrap

Randomly draw n obs (with replacement) from the data

Use EM to estimate β and Σ in each (for estimation uncertainty)

Impute Dmis from each from the model (for fundamental uncertainty)

Lightning fast; works with very large data sets

Basis for Amelia II

Gary King () Missing Data 26 / 42

Page 179: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

One more algorithm: EMB: EM With Bootstrap

Randomly draw n obs (with replacement) from the data

Use EM to estimate β and Σ in each (for estimation uncertainty)

Impute Dmis from each from the model (for fundamental uncertainty)

Lightning fast; works with very large data sets

Basis for Amelia II

Gary King () Missing Data 26 / 42

Page 180: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

One more algorithm: EMB: EM With Bootstrap

Randomly draw n obs (with replacement) from the data

Use EM to estimate β and Σ in each (for estimation uncertainty)

Impute Dmis from each from the model (for fundamental uncertainty)

Lightning fast; works with very large data sets

Basis for Amelia II

Gary King () Missing Data 26 / 42

Page 181: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Multiple Imputation: Amelia Style

incomplete data?? ?

??

?? ?

??

??

?

bootstrap

bootstrapped data

imputed datasetsEM

analysis

combination

final results

Gary King () Missing Data 27 / 42

Page 182: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Multiple Imputation: Amelia Style

incomplete data?? ?

??

?? ?

??

??

?

bootstrap

bootstrapped data

imputed datasetsEM

analysis

combination

final results

Gary King () Missing Data 27 / 42

Page 183: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Multiple Imputation: Amelia Style

incomplete data?? ?

??

?? ?

??

??

?

bootstrap

bootstrapped data

imputed datasetsEM

analysis

combination

final results

Gary King () Missing Data 27 / 42

Page 184: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Multiple Imputation: Amelia Style

incomplete data?? ?

??

?? ?

??

??

?

bootstrap

bootstrapped data

imputed datasetsEM

analysis

combination

final results

Gary King () Missing Data 27 / 42

Page 185: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Multiple Imputation: Amelia Style

incomplete data?? ?

??

?? ?

??

??

?

bootstrap

bootstrapped data

imputed datasetsEM

analysis

combination

final results

Gary King () Missing Data 27 / 42

Page 186: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Multiple Imputation: Amelia Style

incomplete data?? ?

??

?? ?

??

??

?

bootstrap

bootstrapped data

imputed datasetsEM

analysis

combination

final results

Gary King () Missing Data 27 / 42

Page 187: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Comparisons of Posterior Density Approximations

−0.4 −0.2 0.0 0.2 0.4

β0

−0.4 −0.2 0.0 0.2 0.4

β0

−0.4 −0.2 0.0 0.2 0.4

β0

−0.4 −0.2 0.0 0.2 0.4

β0

−0.4 −0.2 0.0 0.2 0.4

β1

Den

sity

−0.4 −0.2 0.0 0.2 0.4

β1

Den

sity

−0.4 −0.2 0.0 0.2 0.4

β1

Den

sity

−0.4 −0.2 0.0 0.2 0.4

β1

Den

sity

−0.4 −0.2 0.0 0.2 0.4

β2

−0.4 −0.2 0.0 0.2 0.4

β2

−0.4 −0.2 0.0 0.2 0.4

β2

−0.4 −0.2 0.0 0.2 0.4

β2

EMBIP − EMisComplete DataList−wise Del.

Gary King () Missing Data 28 / 42

Page 188: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

What Can Go Wrong and What to Do

Inference is learning about facts we don’t have with facts we have; weassume the 2 are related!

Imputation and analysis are estimated separately robustnessbecause imputation affects only missing observations. Highmissingness reduces the property.

Include at least as much information in the imputation model as inthe analysis model: all vars in analysis model; others that would helppredict (e.g., All measures of a variable, post-treatment variables)

Fit imputation model distributional assumptions by transformation tounbounded scales:

√counts, ln(p/(1− p)), ln(money), etc.

Code ordinal variables as close to interval as possible.

Gary King () Missing Data 29 / 42

Page 189: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

What Can Go Wrong and What to Do

Inference is learning about facts we don’t have with facts we have; weassume the 2 are related!

Imputation and analysis are estimated separately robustnessbecause imputation affects only missing observations. Highmissingness reduces the property.

Include at least as much information in the imputation model as inthe analysis model: all vars in analysis model; others that would helppredict (e.g., All measures of a variable, post-treatment variables)

Fit imputation model distributional assumptions by transformation tounbounded scales:

√counts, ln(p/(1− p)), ln(money), etc.

Code ordinal variables as close to interval as possible.

Gary King () Missing Data 29 / 42

Page 190: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

What Can Go Wrong and What to Do

Inference is learning about facts we don’t have with facts we have; weassume the 2 are related!

Imputation and analysis are estimated separately robustnessbecause imputation affects only missing observations. Highmissingness reduces the property.

Include at least as much information in the imputation model as inthe analysis model: all vars in analysis model; others that would helppredict (e.g., All measures of a variable, post-treatment variables)

Fit imputation model distributional assumptions by transformation tounbounded scales:

√counts, ln(p/(1− p)), ln(money), etc.

Code ordinal variables as close to interval as possible.

Gary King () Missing Data 29 / 42

Page 191: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

What Can Go Wrong and What to Do

Inference is learning about facts we don’t have with facts we have; weassume the 2 are related!

Imputation and analysis are estimated separately robustnessbecause imputation affects only missing observations. Highmissingness reduces the property.

Include at least as much information in the imputation model as inthe analysis model: all vars in analysis model; others that would helppredict (e.g., All measures of a variable, post-treatment variables)

Fit imputation model distributional assumptions by transformation tounbounded scales:

√counts, ln(p/(1− p)), ln(money), etc.

Code ordinal variables as close to interval as possible.

Gary King () Missing Data 29 / 42

Page 192: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

What Can Go Wrong and What to Do

Inference is learning about facts we don’t have with facts we have; weassume the 2 are related!

Imputation and analysis are estimated separately robustnessbecause imputation affects only missing observations. Highmissingness reduces the property.

Include at least as much information in the imputation model as inthe analysis model: all vars in analysis model; others that would helppredict (e.g., All measures of a variable, post-treatment variables)

Fit imputation model distributional assumptions by transformation tounbounded scales:

√counts, ln(p/(1− p)), ln(money), etc.

Code ordinal variables as close to interval as possible.

Gary King () Missing Data 29 / 42

Page 193: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

What Can Go Wrong and What to Do

Inference is learning about facts we don’t have with facts we have; weassume the 2 are related!

Imputation and analysis are estimated separately robustnessbecause imputation affects only missing observations. Highmissingness reduces the property.

Include at least as much information in the imputation model as inthe analysis model: all vars in analysis model; others that would helppredict (e.g., All measures of a variable, post-treatment variables)

Fit imputation model distributional assumptions by transformation tounbounded scales:

√counts, ln(p/(1− p)), ln(money), etc.

Code ordinal variables as close to interval as possible.

Gary King () Missing Data 29 / 42

Page 194: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

What Can Go Wrong and What to Do (continued)

Represent severe nonlinear relationships in the imputation model withtransformations or added quadratic terms.

If imputation model has as much information as the analysis model,but the specification (such as the functional form) differs, CIs areconservative (e.g., ≥ 95% CIs)

When imputation model includes more information than analysismodel, it can be more efficient than the “optimal” application-specificmodel (known as “super-efficiency”)

Bad intuitions

If X is randomly imputed why no attenuation (the usual consequenceof random measurement error in an explanatory variable)?If X is imputed with information from Y , why no endogeneity?Answer to both: the draws are from the joint posterior and put backinto the data. Nothing is being changed.

Gary King () Missing Data 30 / 42

Page 195: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

What Can Go Wrong and What to Do (continued)

Represent severe nonlinear relationships in the imputation model withtransformations or added quadratic terms.

If imputation model has as much information as the analysis model,but the specification (such as the functional form) differs, CIs areconservative (e.g., ≥ 95% CIs)

When imputation model includes more information than analysismodel, it can be more efficient than the “optimal” application-specificmodel (known as “super-efficiency”)

Bad intuitions

If X is randomly imputed why no attenuation (the usual consequenceof random measurement error in an explanatory variable)?If X is imputed with information from Y , why no endogeneity?Answer to both: the draws are from the joint posterior and put backinto the data. Nothing is being changed.

Gary King () Missing Data 30 / 42

Page 196: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

What Can Go Wrong and What to Do (continued)

Represent severe nonlinear relationships in the imputation model withtransformations or added quadratic terms.

If imputation model has as much information as the analysis model,but the specification (such as the functional form) differs, CIs areconservative (e.g., ≥ 95% CIs)

When imputation model includes more information than analysismodel, it can be more efficient than the “optimal” application-specificmodel (known as “super-efficiency”)

Bad intuitions

If X is randomly imputed why no attenuation (the usual consequenceof random measurement error in an explanatory variable)?If X is imputed with information from Y , why no endogeneity?Answer to both: the draws are from the joint posterior and put backinto the data. Nothing is being changed.

Gary King () Missing Data 30 / 42

Page 197: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

What Can Go Wrong and What to Do (continued)

Represent severe nonlinear relationships in the imputation model withtransformations or added quadratic terms.

If imputation model has as much information as the analysis model,but the specification (such as the functional form) differs, CIs areconservative (e.g., ≥ 95% CIs)

When imputation model includes more information than analysismodel, it can be more efficient than the “optimal” application-specificmodel (known as “super-efficiency”)

Bad intuitions

If X is randomly imputed why no attenuation (the usual consequenceof random measurement error in an explanatory variable)?If X is imputed with information from Y , why no endogeneity?Answer to both: the draws are from the joint posterior and put backinto the data. Nothing is being changed.

Gary King () Missing Data 30 / 42

Page 198: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

What Can Go Wrong and What to Do (continued)

Represent severe nonlinear relationships in the imputation model withtransformations or added quadratic terms.

If imputation model has as much information as the analysis model,but the specification (such as the functional form) differs, CIs areconservative (e.g., ≥ 95% CIs)

When imputation model includes more information than analysismodel, it can be more efficient than the “optimal” application-specificmodel (known as “super-efficiency”)

Bad intuitions

If X is randomly imputed why no attenuation (the usual consequenceof random measurement error in an explanatory variable)?If X is imputed with information from Y , why no endogeneity?Answer to both: the draws are from the joint posterior and put backinto the data. Nothing is being changed.

Gary King () Missing Data 30 / 42

Page 199: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

What Can Go Wrong and What to Do (continued)

Represent severe nonlinear relationships in the imputation model withtransformations or added quadratic terms.

If imputation model has as much information as the analysis model,but the specification (such as the functional form) differs, CIs areconservative (e.g., ≥ 95% CIs)

When imputation model includes more information than analysismodel, it can be more efficient than the “optimal” application-specificmodel (known as “super-efficiency”)

Bad intuitions

If X is randomly imputed why no attenuation (the usual consequenceof random measurement error in an explanatory variable)?

If X is imputed with information from Y , why no endogeneity?Answer to both: the draws are from the joint posterior and put backinto the data. Nothing is being changed.

Gary King () Missing Data 30 / 42

Page 200: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

What Can Go Wrong and What to Do (continued)

Represent severe nonlinear relationships in the imputation model withtransformations or added quadratic terms.

If imputation model has as much information as the analysis model,but the specification (such as the functional form) differs, CIs areconservative (e.g., ≥ 95% CIs)

When imputation model includes more information than analysismodel, it can be more efficient than the “optimal” application-specificmodel (known as “super-efficiency”)

Bad intuitions

If X is randomly imputed why no attenuation (the usual consequenceof random measurement error in an explanatory variable)?If X is imputed with information from Y , why no endogeneity?

Answer to both: the draws are from the joint posterior and put backinto the data. Nothing is being changed.

Gary King () Missing Data 30 / 42

Page 201: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

What Can Go Wrong and What to Do (continued)

Represent severe nonlinear relationships in the imputation model withtransformations or added quadratic terms.

If imputation model has as much information as the analysis model,but the specification (such as the functional form) differs, CIs areconservative (e.g., ≥ 95% CIs)

When imputation model includes more information than analysismodel, it can be more efficient than the “optimal” application-specificmodel (known as “super-efficiency”)

Bad intuitions

If X is randomly imputed why no attenuation (the usual consequenceof random measurement error in an explanatory variable)?If X is imputed with information from Y , why no endogeneity?Answer to both: the draws are from the joint posterior and put backinto the data. Nothing is being changed.

Gary King () Missing Data 30 / 42

Page 202: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

The Best Case for Listwise Deletion

Listwise deletion is better than MI when all 4 hold:

1 The analysis model is conditional on X (like regression) andfunctional form is correct (so listwise deletion is consistent and thecharacteristic robustness of regression is not lost when applied to datawith slight measurement error, endogeneity, nonlinearity, etc.).

2 NI missingness in X and no external variables are available that couldbe used in an imputation stage to fix the problem.

3 Missingness in X is not a function of Y

4 The n left after listwise deletion is so large that the efficiency lossdoes not counter balance the biases induced by the other conditions.

I.e., you don’t trust data to impute Dmis but still trust it to analyze Dobs

Gary King () Missing Data 31 / 42

Page 203: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

The Best Case for Listwise Deletion

Listwise deletion is better than MI when all 4 hold:

1 The analysis model is conditional on X (like regression) andfunctional form is correct (so listwise deletion is consistent and thecharacteristic robustness of regression is not lost when applied to datawith slight measurement error, endogeneity, nonlinearity, etc.).

2 NI missingness in X and no external variables are available that couldbe used in an imputation stage to fix the problem.

3 Missingness in X is not a function of Y

4 The n left after listwise deletion is so large that the efficiency lossdoes not counter balance the biases induced by the other conditions.

I.e., you don’t trust data to impute Dmis but still trust it to analyze Dobs

Gary King () Missing Data 31 / 42

Page 204: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

The Best Case for Listwise Deletion

Listwise deletion is better than MI when all 4 hold:

1 The analysis model is conditional on X (like regression) andfunctional form is correct (so listwise deletion is consistent and thecharacteristic robustness of regression is not lost when applied to datawith slight measurement error, endogeneity, nonlinearity, etc.).

2 NI missingness in X and no external variables are available that couldbe used in an imputation stage to fix the problem.

3 Missingness in X is not a function of Y

4 The n left after listwise deletion is so large that the efficiency lossdoes not counter balance the biases induced by the other conditions.

I.e., you don’t trust data to impute Dmis but still trust it to analyze Dobs

Gary King () Missing Data 31 / 42

Page 205: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

The Best Case for Listwise Deletion

Listwise deletion is better than MI when all 4 hold:

1 The analysis model is conditional on X (like regression) andfunctional form is correct (so listwise deletion is consistent and thecharacteristic robustness of regression is not lost when applied to datawith slight measurement error, endogeneity, nonlinearity, etc.).

2 NI missingness in X and no external variables are available that couldbe used in an imputation stage to fix the problem.

3 Missingness in X is not a function of Y

4 The n left after listwise deletion is so large that the efficiency lossdoes not counter balance the biases induced by the other conditions.

I.e., you don’t trust data to impute Dmis but still trust it to analyze Dobs

Gary King () Missing Data 31 / 42

Page 206: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

The Best Case for Listwise Deletion

Listwise deletion is better than MI when all 4 hold:

1 The analysis model is conditional on X (like regression) andfunctional form is correct (so listwise deletion is consistent and thecharacteristic robustness of regression is not lost when applied to datawith slight measurement error, endogeneity, nonlinearity, etc.).

2 NI missingness in X and no external variables are available that couldbe used in an imputation stage to fix the problem.

3 Missingness in X is not a function of Y

4 The n left after listwise deletion is so large that the efficiency lossdoes not counter balance the biases induced by the other conditions.

I.e., you don’t trust data to impute Dmis but still trust it to analyze Dobs

Gary King () Missing Data 31 / 42

Page 207: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

The Best Case for Listwise Deletion

Listwise deletion is better than MI when all 4 hold:

1 The analysis model is conditional on X (like regression) andfunctional form is correct (so listwise deletion is consistent and thecharacteristic robustness of regression is not lost when applied to datawith slight measurement error, endogeneity, nonlinearity, etc.).

2 NI missingness in X and no external variables are available that couldbe used in an imputation stage to fix the problem.

3 Missingness in X is not a function of Y

4 The n left after listwise deletion is so large that the efficiency lossdoes not counter balance the biases induced by the other conditions.

I.e., you don’t trust data to impute Dmis but still trust it to analyze Dobs

Gary King () Missing Data 31 / 42

Page 208: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

The Best Case for Listwise Deletion

Listwise deletion is better than MI when all 4 hold:

1 The analysis model is conditional on X (like regression) andfunctional form is correct (so listwise deletion is consistent and thecharacteristic robustness of regression is not lost when applied to datawith slight measurement error, endogeneity, nonlinearity, etc.).

2 NI missingness in X and no external variables are available that couldbe used in an imputation stage to fix the problem.

3 Missingness in X is not a function of Y

4 The n left after listwise deletion is so large that the efficiency lossdoes not counter balance the biases induced by the other conditions.

I.e., you don’t trust data to impute Dmis but still trust it to analyze Dobs

Gary King () Missing Data 31 / 42

Page 209: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Root Mean Square Error Comparisons

•Listwise IP EMis Complete

.05

.1

.15

.2

.25

Roo

t Mea

n Sq

uare

Err

or

MCAR-1MCAR-2

MAR-1

MAR-2

ΝΙ

Each point is RMSE averaged over two regression coefficients in each of100 simulated data sets. (IP and EMis have the same RMSE, which islower than listwise deletion and higher than the complete data; its thesame for EMB.)

Gary King () Missing Data 32 / 42

Page 210: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Detailed Example: Support for Perot

1. Research question: were voters who did not share in the economicrecovery more likely to support Perot in the 1996 presidential election?

2. Analysis model: linear regression

3. Data: 1996 National Election Survey (n=1714)

4. Dependent variable: Perot Feeling Thermometer

5. Key explanatory variables: retrospective and propsective evaluations ofnational economic performance and personal financial circumstances

6. Control variables: age, education, family income, race, gender, unionmembership, ideology

7. Extra variables included in the imputation model to help prediction:attention to the campaign; feeling thermometers for Clinton, Dole,Democrats, Republicans; PID; Partisan moderation; vote intention;martial status; Hispanic; party contact, number of organizations R is apaying member of, and active member of.

8. Include nonlinear terms: age2

Gary King () Missing Data 33 / 42

Page 211: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Detailed Example: Support for Perot

1. Research question: were voters who did not share in the economicrecovery more likely to support Perot in the 1996 presidential election?

2. Analysis model: linear regression

3. Data: 1996 National Election Survey (n=1714)

4. Dependent variable: Perot Feeling Thermometer

5. Key explanatory variables: retrospective and propsective evaluations ofnational economic performance and personal financial circumstances

6. Control variables: age, education, family income, race, gender, unionmembership, ideology

7. Extra variables included in the imputation model to help prediction:attention to the campaign; feeling thermometers for Clinton, Dole,Democrats, Republicans; PID; Partisan moderation; vote intention;martial status; Hispanic; party contact, number of organizations R is apaying member of, and active member of.

8. Include nonlinear terms: age2

Gary King () Missing Data 33 / 42

Page 212: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Detailed Example: Support for Perot

1. Research question: were voters who did not share in the economicrecovery more likely to support Perot in the 1996 presidential election?

2. Analysis model: linear regression

3. Data: 1996 National Election Survey (n=1714)

4. Dependent variable: Perot Feeling Thermometer

5. Key explanatory variables: retrospective and propsective evaluations ofnational economic performance and personal financial circumstances

6. Control variables: age, education, family income, race, gender, unionmembership, ideology

7. Extra variables included in the imputation model to help prediction:attention to the campaign; feeling thermometers for Clinton, Dole,Democrats, Republicans; PID; Partisan moderation; vote intention;martial status; Hispanic; party contact, number of organizations R is apaying member of, and active member of.

8. Include nonlinear terms: age2

Gary King () Missing Data 33 / 42

Page 213: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Detailed Example: Support for Perot

1. Research question: were voters who did not share in the economicrecovery more likely to support Perot in the 1996 presidential election?

2. Analysis model: linear regression

3. Data: 1996 National Election Survey (n=1714)

4. Dependent variable: Perot Feeling Thermometer

5. Key explanatory variables: retrospective and propsective evaluations ofnational economic performance and personal financial circumstances

6. Control variables: age, education, family income, race, gender, unionmembership, ideology

7. Extra variables included in the imputation model to help prediction:attention to the campaign; feeling thermometers for Clinton, Dole,Democrats, Republicans; PID; Partisan moderation; vote intention;martial status; Hispanic; party contact, number of organizations R is apaying member of, and active member of.

8. Include nonlinear terms: age2

Gary King () Missing Data 33 / 42

Page 214: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Detailed Example: Support for Perot

1. Research question: were voters who did not share in the economicrecovery more likely to support Perot in the 1996 presidential election?

2. Analysis model: linear regression

3. Data: 1996 National Election Survey (n=1714)

4. Dependent variable: Perot Feeling Thermometer

5. Key explanatory variables: retrospective and propsective evaluations ofnational economic performance and personal financial circumstances

6. Control variables: age, education, family income, race, gender, unionmembership, ideology

7. Extra variables included in the imputation model to help prediction:attention to the campaign; feeling thermometers for Clinton, Dole,Democrats, Republicans; PID; Partisan moderation; vote intention;martial status; Hispanic; party contact, number of organizations R is apaying member of, and active member of.

8. Include nonlinear terms: age2

Gary King () Missing Data 33 / 42

Page 215: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Detailed Example: Support for Perot

1. Research question: were voters who did not share in the economicrecovery more likely to support Perot in the 1996 presidential election?

2. Analysis model: linear regression

3. Data: 1996 National Election Survey (n=1714)

4. Dependent variable: Perot Feeling Thermometer

5. Key explanatory variables: retrospective and propsective evaluations ofnational economic performance and personal financial circumstances

6. Control variables: age, education, family income, race, gender, unionmembership, ideology

7. Extra variables included in the imputation model to help prediction:attention to the campaign; feeling thermometers for Clinton, Dole,Democrats, Republicans; PID; Partisan moderation; vote intention;martial status; Hispanic; party contact, number of organizations R is apaying member of, and active member of.

8. Include nonlinear terms: age2

Gary King () Missing Data 33 / 42

Page 216: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Detailed Example: Support for Perot

1. Research question: were voters who did not share in the economicrecovery more likely to support Perot in the 1996 presidential election?

2. Analysis model: linear regression

3. Data: 1996 National Election Survey (n=1714)

4. Dependent variable: Perot Feeling Thermometer

5. Key explanatory variables: retrospective and propsective evaluations ofnational economic performance and personal financial circumstances

6. Control variables: age, education, family income, race, gender, unionmembership, ideology

7. Extra variables included in the imputation model to help prediction:attention to the campaign; feeling thermometers for Clinton, Dole,Democrats, Republicans; PID; Partisan moderation; vote intention;martial status; Hispanic; party contact, number of organizations R is apaying member of, and active member of.

8. Include nonlinear terms: age2

Gary King () Missing Data 33 / 42

Page 217: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Detailed Example: Support for Perot

1. Research question: were voters who did not share in the economicrecovery more likely to support Perot in the 1996 presidential election?

2. Analysis model: linear regression

3. Data: 1996 National Election Survey (n=1714)

4. Dependent variable: Perot Feeling Thermometer

5. Key explanatory variables: retrospective and propsective evaluations ofnational economic performance and personal financial circumstances

6. Control variables: age, education, family income, race, gender, unionmembership, ideology

7. Extra variables included in the imputation model to help prediction:attention to the campaign; feeling thermometers for Clinton, Dole,Democrats, Republicans; PID; Partisan moderation; vote intention;martial status; Hispanic; party contact, number of organizations R is apaying member of, and active member of.

8. Include nonlinear terms: age2

Gary King () Missing Data 33 / 42

Page 218: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Detailed Example: Support for Perot

1. Research question: were voters who did not share in the economicrecovery more likely to support Perot in the 1996 presidential election?

2. Analysis model: linear regression

3. Data: 1996 National Election Survey (n=1714)

4. Dependent variable: Perot Feeling Thermometer

5. Key explanatory variables: retrospective and propsective evaluations ofnational economic performance and personal financial circumstances

6. Control variables: age, education, family income, race, gender, unionmembership, ideology

7. Extra variables included in the imputation model to help prediction:attention to the campaign; feeling thermometers for Clinton, Dole,Democrats, Republicans; PID; Partisan moderation; vote intention;martial status; Hispanic; party contact, number of organizations R is apaying member of, and active member of.

8. Include nonlinear terms: age2

Gary King () Missing Data 33 / 42

Page 219: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

9. Transform variables to more closely approximate distributionalassumptions: logged number of organizations participating in.

10. Run Amelia to generate 5 imputed data sets.

11. Key substantive result is the coefficient on retrospective economicevaluations (ranges from 1 to 5):

Listwise deletion .43(.90)

Multiple imputation 1.65(.72)

so (5− 1)× 1.65 = 6.6, which is also a percentage of the range of Y .

(a) MI estimator is more efficient, with a smaller SE(b) The MI estimator is 4 times larger(c) Based on listwise deletion, there is no evidence that perception of poor

economic performance is related to support for Perot(d) Based on the MI estimator, R’s with negative retrospective economic

evaluations are more likely to have favorable views of Perot.

Gary King () Missing Data 34 / 42

Page 220: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

9. Transform variables to more closely approximate distributionalassumptions: logged number of organizations participating in.

10. Run Amelia to generate 5 imputed data sets.

11. Key substantive result is the coefficient on retrospective economicevaluations (ranges from 1 to 5):

Listwise deletion .43(.90)

Multiple imputation 1.65(.72)

so (5− 1)× 1.65 = 6.6, which is also a percentage of the range of Y .

(a) MI estimator is more efficient, with a smaller SE(b) The MI estimator is 4 times larger(c) Based on listwise deletion, there is no evidence that perception of poor

economic performance is related to support for Perot(d) Based on the MI estimator, R’s with negative retrospective economic

evaluations are more likely to have favorable views of Perot.

Gary King () Missing Data 34 / 42

Page 221: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

9. Transform variables to more closely approximate distributionalassumptions: logged number of organizations participating in.

10. Run Amelia to generate 5 imputed data sets.

11. Key substantive result is the coefficient on retrospective economicevaluations (ranges from 1 to 5):

Listwise deletion .43(.90)

Multiple imputation 1.65(.72)

so (5− 1)× 1.65 = 6.6, which is also a percentage of the range of Y .

(a) MI estimator is more efficient, with a smaller SE(b) The MI estimator is 4 times larger(c) Based on listwise deletion, there is no evidence that perception of poor

economic performance is related to support for Perot(d) Based on the MI estimator, R’s with negative retrospective economic

evaluations are more likely to have favorable views of Perot.

Gary King () Missing Data 34 / 42

Page 222: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

9. Transform variables to more closely approximate distributionalassumptions: logged number of organizations participating in.

10. Run Amelia to generate 5 imputed data sets.

11. Key substantive result is the coefficient on retrospective economicevaluations (ranges from 1 to 5):

Listwise deletion .43(.90)

Multiple imputation 1.65(.72)

so (5− 1)× 1.65 = 6.6, which is also a percentage of the range of Y .

(a) MI estimator is more efficient, with a smaller SE(b) The MI estimator is 4 times larger(c) Based on listwise deletion, there is no evidence that perception of poor

economic performance is related to support for Perot(d) Based on the MI estimator, R’s with negative retrospective economic

evaluations are more likely to have favorable views of Perot.

Gary King () Missing Data 34 / 42

Page 223: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

9. Transform variables to more closely approximate distributionalassumptions: logged number of organizations participating in.

10. Run Amelia to generate 5 imputed data sets.

11. Key substantive result is the coefficient on retrospective economicevaluations (ranges from 1 to 5):

Listwise deletion .43(.90)

Multiple imputation 1.65(.72)

so (5− 1)× 1.65 = 6.6, which is also a percentage of the range of Y .

(a) MI estimator is more efficient, with a smaller SE(b) The MI estimator is 4 times larger(c) Based on listwise deletion, there is no evidence that perception of poor

economic performance is related to support for Perot(d) Based on the MI estimator, R’s with negative retrospective economic

evaluations are more likely to have favorable views of Perot.

Gary King () Missing Data 34 / 42

Page 224: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

9. Transform variables to more closely approximate distributionalassumptions: logged number of organizations participating in.

10. Run Amelia to generate 5 imputed data sets.

11. Key substantive result is the coefficient on retrospective economicevaluations (ranges from 1 to 5):

Listwise deletion .43(.90)

Multiple imputation 1.65(.72)

so (5− 1)× 1.65 = 6.6, which is also a percentage of the range of Y .

(a) MI estimator is more efficient, with a smaller SE

(b) The MI estimator is 4 times larger(c) Based on listwise deletion, there is no evidence that perception of poor

economic performance is related to support for Perot(d) Based on the MI estimator, R’s with negative retrospective economic

evaluations are more likely to have favorable views of Perot.

Gary King () Missing Data 34 / 42

Page 225: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

9. Transform variables to more closely approximate distributionalassumptions: logged number of organizations participating in.

10. Run Amelia to generate 5 imputed data sets.

11. Key substantive result is the coefficient on retrospective economicevaluations (ranges from 1 to 5):

Listwise deletion .43(.90)

Multiple imputation 1.65(.72)

so (5− 1)× 1.65 = 6.6, which is also a percentage of the range of Y .

(a) MI estimator is more efficient, with a smaller SE(b) The MI estimator is 4 times larger

(c) Based on listwise deletion, there is no evidence that perception of pooreconomic performance is related to support for Perot

(d) Based on the MI estimator, R’s with negative retrospective economicevaluations are more likely to have favorable views of Perot.

Gary King () Missing Data 34 / 42

Page 226: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

9. Transform variables to more closely approximate distributionalassumptions: logged number of organizations participating in.

10. Run Amelia to generate 5 imputed data sets.

11. Key substantive result is the coefficient on retrospective economicevaluations (ranges from 1 to 5):

Listwise deletion .43(.90)

Multiple imputation 1.65(.72)

so (5− 1)× 1.65 = 6.6, which is also a percentage of the range of Y .

(a) MI estimator is more efficient, with a smaller SE(b) The MI estimator is 4 times larger(c) Based on listwise deletion, there is no evidence that perception of poor

economic performance is related to support for Perot

(d) Based on the MI estimator, R’s with negative retrospective economicevaluations are more likely to have favorable views of Perot.

Gary King () Missing Data 34 / 42

Page 227: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

9. Transform variables to more closely approximate distributionalassumptions: logged number of organizations participating in.

10. Run Amelia to generate 5 imputed data sets.

11. Key substantive result is the coefficient on retrospective economicevaluations (ranges from 1 to 5):

Listwise deletion .43(.90)

Multiple imputation 1.65(.72)

so (5− 1)× 1.65 = 6.6, which is also a percentage of the range of Y .

(a) MI estimator is more efficient, with a smaller SE(b) The MI estimator is 4 times larger(c) Based on listwise deletion, there is no evidence that perception of poor

economic performance is related to support for Perot(d) Based on the MI estimator, R’s with negative retrospective economic

evaluations are more likely to have favorable views of Perot.

Gary King () Missing Data 34 / 42

Page 228: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

MI in Time Series Cross-Section Data

●●

●●

●●

●●

●●

●●●●●●

75 80 85 90 95 100

1600

2200

2800

Cameroon

year

gdp

●●

●●●●

●●●

●●●●●

●●●

●●●

●●

75 80 85 90 95 100

1200

1600

2000

Rep.Congo

year

gdp ●●●

●●

●●

●●●

●●●●●

●●●●

●●

●●●●

●●

75 80 85 90 95 100

2000

2400

2800

Cote d'Ivoire

year

gdp

●●

●●●

●●●●

●●●●

●●●

●●

●●●

75 80 85 90 95 100

1100

1300

1500

Ghana

year

gdp

●●●●

●●●

●●●

●●

●●●●●●

●●

●●●

75 80 85 90 95 100

800

1200

1800

Mozambique

year

gdp

●●

●●

●●

●●

●●

●●●●●

●●

●●●●

75 80 85 90 95 100

800

1000

1300

Zambia

year

gdp

Include: (1) fixed effects, (2) time trends, and (3) priors for cellsRead: James Honaker and Gary King, ”What to do About Missing Valuesin Time Series Cross-Section Data,”http://gking.harvard.edu/files/abs/pr-abs.shtml

Gary King () Missing Data 35 / 42

Page 229: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

MI in Time Series Cross-Section Data

●●

●●

●●

●●

●●

●●●●●●

75 80 85 90 95 100

1600

2200

2800

Cameroon

year

gdp

●●

●●●●

●●●

●●●●●

●●●

●●●

●●

75 80 85 90 95 100

1200

1600

2000

Rep.Congo

year

gdp ●●●

●●

●●

●●●

●●●●●

●●●●

●●

●●●●

●●

75 80 85 90 95 100

2000

2400

2800

Cote d'Ivoire

year

gdp

●●

●●●

●●●●

●●●●

●●●

●●

●●●

75 80 85 90 95 100

1100

1300

1500

Ghana

year

gdp

●●●●

●●●

●●●

●●

●●●●●●

●●

●●●

75 80 85 90 95 100

800

1200

1800

Mozambique

year

gdp

●●

●●

●●

●●

●●

●●●●●

●●

●●●●

75 80 85 90 95 10080

010

0013

00

Zambia

year

gdp

Include: (1) fixed effects, (2) time trends, and (3) priors for cellsRead: James Honaker and Gary King, ”What to do About Missing Valuesin Time Series Cross-Section Data,”http://gking.harvard.edu/files/abs/pr-abs.shtml

Gary King () Missing Data 35 / 42

Page 230: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

MI in Time Series Cross-Section Data

●●

●●

●●

●●

●●

●●●●●●

75 80 85 90 95 100

1600

2200

2800

Cameroon

year

gdp

●●

●●●●

●●●

●●●●●

●●●

●●●

●●

75 80 85 90 95 100

1200

1600

2000

Rep.Congo

year

gdp ●●●

●●

●●

●●●

●●●●●

●●●●

●●

●●●●

●●

75 80 85 90 95 100

2000

2400

2800

Cote d'Ivoire

year

gdp

●●

●●●

●●●●

●●●●

●●●

●●

●●●

75 80 85 90 95 100

1100

1300

1500

Ghana

year

gdp

●●●●

●●●

●●●

●●

●●●●●●

●●

●●●

75 80 85 90 95 100

800

1200

1800

Mozambique

year

gdp

●●

●●

●●

●●

●●

●●●●●

●●

●●●●

75 80 85 90 95 10080

010

0013

00

Zambia

year

gdp

Include: (1) fixed effects, (2) time trends, and (3) priors for cells

Read: James Honaker and Gary King, ”What to do About Missing Valuesin Time Series Cross-Section Data,”http://gking.harvard.edu/files/abs/pr-abs.shtml

Gary King () Missing Data 35 / 42

Page 231: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

MI in Time Series Cross-Section Data

●●

●●

●●

●●

●●

●●●●●●

75 80 85 90 95 100

1600

2200

2800

Cameroon

year

gdp

●●

●●●●

●●●

●●●●●

●●●

●●●

●●

75 80 85 90 95 100

1200

1600

2000

Rep.Congo

year

gdp ●●●

●●

●●

●●●

●●●●●

●●●●

●●

●●●●

●●

75 80 85 90 95 100

2000

2400

2800

Cote d'Ivoire

year

gdp

●●

●●●

●●●●

●●●●

●●●

●●

●●●

75 80 85 90 95 100

1100

1300

1500

Ghana

year

gdp

●●●●

●●●

●●●

●●

●●●●●●

●●

●●●

75 80 85 90 95 100

800

1200

1800

Mozambique

year

gdp

●●

●●

●●

●●

●●

●●●●●

●●

●●●●

75 80 85 90 95 10080

010

0013

00

Zambia

year

gdp

Include: (1) fixed effects, (2) time trends, and (3) priors for cellsRead: James Honaker and Gary King, ”What to do About Missing Valuesin Time Series Cross-Section Data,”http://gking.harvard.edu/files/abs/pr-abs.shtml

Gary King () Missing Data 35 / 42

Page 232: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Imputation one Observation at a time

Cameroon

matrix(allyears[j] − epsilon, 2, 1)

Cameroon

matrix(allyears[j], 2, 1)

Cameroon

matrix(allyears[j] + epsilon, 2, 1)

Cameroon

matrix(allyears[j] − epsilon, 2, 1)

Cameroon

matrix(allyears[j], 2, 1)

Cameroon

matrix(allyears[j] + epsilon, 2, 1)

Cameroon

matrix(allyears[j] − epsilon, 2, 1)

Cameroon

matrix(allyears[j], 2, 1)

Cameroon

matrix(allyears[j] + epsilon, 2, 1)

Cameroon

matrix(allyears[j] − epsilon, 2, 1)

Cameroon

matrix(allyears[j], 2, 1)

Cameroon

matrix(allyears[j] + epsilon, 2, 1)

Cameroon

matrix(allyears[j] − epsilon, 2, 1)

Cameroon

matrix(allyears[j], 2, 1)

Cameroon

matrix(allyears[j] + epsilon, 2, 1)

Cameroon

matrix(allyears[j] − epsilon, 2, 1)

Cameroon

matrix(allyears[j], 2, 1)

Cameroon

matrix(allyears[j] + epsilon, 2, 1)

Cameroon

matrix(allyears[j] − epsilon, 2, 1)

Cameroon

matrix(allyears[j], 2, 1)

Cameroon

matrix(allyears[j] + epsilon, 2, 1)

Cameroon

matrix(allyears[j] − epsilon, 2, 1)

Cameroon

matrix(allyears[j], 2, 1)

Cameroon

matrix(allyears[j] + epsilon, 2, 1)

Cameroon

matrix(allyears[j] − epsilon, 2, 1)

Cameroon

matrix(allyears[j], 2, 1)

Cameroon

matrix(allyears[j] + epsilon, 2, 1)

Cameroon

matrix(allyears[j] − epsilon, 2, 1)

Cameroon

matrix(allyears[j], 2, 1)

Cameroon

matrix(allyears[j] + epsilon, 2, 1)

Cameroon

matrix(allyears[j] − epsilon, 2, 1)

Cameroon

matrix(allyears[j], 2, 1)

Cameroon

matrix(allyears[j] + epsilon, 2, 1)

Cameroon

matrix(allyears[j] − epsilon, 2, 1)

Cameroon

matrix(allyears[j], 2, 1)

Cameroon

matrix(allyears[j] + epsilon, 2, 1)

Cameroon

matrix(allyears[j] − epsilon, 2, 1)

Cameroon

matrix(allyears[j], 2, 1)

Cameroon

matrix(allyears[j] + epsilon, 2, 1)

Cameroon

matrix(allyears[j] − epsilon, 2, 1)

Cameroon

matrix(allyears[j], 2, 1)

Cameroon

matrix(allyears[j] + epsilon, 2, 1)

Cameroon

matrix(allyears[j] − epsilon, 2, 1)

Cameroon

matrix(allyears[j], 2, 1)

Cameroon

matrix(allyears[j] + epsilon, 2, 1)

Cameroon

matrix(allyears[j] − epsilon, 2, 1)

Cameroon

matrix(allyears[j], 2, 1)

Cameroon

matrix(allyears[j] + epsilon, 2, 1)

Cameroon

matrix(allyears[j] − epsilon, 2, 1)

Cameroon

matrix(allyears[j], 2, 1)

Cameroon

matrix(allyears[j] + epsilon, 2, 1)

Cameroon

matrix(allyears[j] − epsilon, 2, 1)

Cameroon

matrix(allyears[j], 2, 1)

Cameroon

matrix(allyears[j] + epsilon, 2, 1)

Cameroon

matrix(allyears[j] − epsilon, 2, 1)

Cameroon

matrix(allyears[j], 2, 1)

Cameroon

matrix(allyears[j] + epsilon, 2, 1)

Cameroon

matrix(allyears[j] − epsilon, 2, 1)

Cameroon

matrix(allyears[j], 2, 1)

Cameroon

matrix(allyears[j] + epsilon, 2, 1)

Cameroon

matrix(allyears[j] − epsilon, 2, 1)

Cameroon

matrix(allyears[j], 2, 1)

Cameroon

matrix(allyears[j] + epsilon, 2, 1)

Cameroon

matrix(allyears[j] − epsilon, 2, 1)

Cameroon

matrix(allyears[j], 2, 1)

Cameroon

matrix(allyears[j] + epsilon, 2, 1)

Cameroon

matrix(allyears[j] − epsilon, 2, 1)

Cameroon

matrix(allyears[j], 2, 1)

Cameroon

matrix(allyears[j] + epsilon, 2, 1)

Cameroon

matrix(allyears[j] − epsilon, 2, 1)

Cameroon

matrix(allyears[j], 2, 1)

Cameroon

matrix(allyears[j] + epsilon, 2, 1)

Cameroon

matrix(allyears[j] − epsilon, 2, 1)

Cameroon

matrix(allyears[j], 2, 1)

Cameroon

matrix(allyears[j] + epsilon, 2, 1)

Cameroon

matrix(allyears[j] − epsilon, 2, 1)

Cameroon

matrix(allyears[j], 2, 1)

Cameroon

matrix(allyears[j] + epsilon, 2, 1)

Cameroon

matrix(allyears[j] − epsilon, 2, 1)

Cameroon

matrix(allyears[j], 2, 1)

Cameroon

matrix(allyears[j] + epsilon, 2, 1)

Cameroon

matrix(allyears[j] − epsilon, 2, 1)

Cameroon

matrix(allyears[j], 2, 1)

Cameroon

matrix(allyears[j] + epsilon, 2, 1)

75 80 85 90 95

010

0020

0030

00

●●

●●

●●

●●

●●

●●●

●●●

Cameroon

Rep.Congo

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j], 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j], 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j], 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j], 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j], 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j], 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j], 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j], 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j], 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j], 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j], 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j], 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j], 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j], 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j], 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j], 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j], 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j], 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j], 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j], 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j], 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j], 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j], 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j], 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j], 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j], 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j], 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Rep.Congo

matrix(allyears[j], 2, 1)

GD

P

Rep.Congo

matrix(allyears[j] + epsilon, 2, 1)

GD

P

75 80 85 90 95

010

0020

0030

00

●●

●●●●

●●●

●●●●●

●●●

●●●

●●●

Rep.Congo

GD

P

Cote d'Ivoire

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j], 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j], 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j], 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j], 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j], 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j], 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j], 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j], 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j], 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j], 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j], 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j], 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j], 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j], 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j], 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j], 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j], 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j], 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j], 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j], 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j], 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j], 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j], 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j], 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j], 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j], 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j], 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] + epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] − epsilon, 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j], 2, 1)

GD

P

Cote d'Ivoire

matrix(allyears[j] + epsilon, 2, 1)

GD

P75 80 85 90 95

010

0020

0030

00

●●●

●●

●●

●●●

●●●●●

●●●●

●●

●●●●

●●

Cote d'Ivoire

GD

PGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhanaGhana

75 80 85 90 95

010

0020

0030

00

●●

●●

●●●●●

●●●●●●●●●●●●

●●●●●

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

Mozambique

GD

P

75 80 85 90 95

010

0020

0030

00

●●

●●

●●●

●●●

●●

●●●●●●

●●

●●●

GD

P

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

ZambiaG

DP

75 80 85 90 95

010

0020

0030

00

●●●●

●●●●

●●

●●●●●●●●●●

●●

●●●●●

GD

P

Circles=true GDP; green=no time trends; blue=polynomials; red=LOESS

Gary King () Missing Data 36 / 42

Page 233: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Priors on Cell Values

Recall: p(θ|y) = p(θ)∏n

i=1 Li (θ|y)

take logs: ln p(θ|y) = ln[p(θ)] +∑n

i=1 ln Li (θ|y)

=⇒ Suppose prior is of the same form, p(θ|y) = Li (θ|y); then itsjust another observation: ln p(θ|y) =

∑n+1i=1 ln Li (θ|y)

Honaker and King show how to modify these “data augmentationpriors” to put priors on missing values rather than on µ and σ (or β).

Gary King () Missing Data 37 / 42

Page 234: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Priors on Cell Values

Recall: p(θ|y) = p(θ)∏n

i=1 Li (θ|y)

take logs: ln p(θ|y) = ln[p(θ)] +∑n

i=1 ln Li (θ|y)

=⇒ Suppose prior is of the same form, p(θ|y) = Li (θ|y); then itsjust another observation: ln p(θ|y) =

∑n+1i=1 ln Li (θ|y)

Honaker and King show how to modify these “data augmentationpriors” to put priors on missing values rather than on µ and σ (or β).

Gary King () Missing Data 37 / 42

Page 235: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Priors on Cell Values

Recall: p(θ|y) = p(θ)∏n

i=1 Li (θ|y)

take logs: ln p(θ|y) = ln[p(θ)] +∑n

i=1 ln Li (θ|y)

=⇒ Suppose prior is of the same form, p(θ|y) = Li (θ|y); then itsjust another observation: ln p(θ|y) =

∑n+1i=1 ln Li (θ|y)

Honaker and King show how to modify these “data augmentationpriors” to put priors on missing values rather than on µ and σ (or β).

Gary King () Missing Data 37 / 42

Page 236: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Priors on Cell Values

Recall: p(θ|y) = p(θ)∏n

i=1 Li (θ|y)

take logs: ln p(θ|y) = ln[p(θ)] +∑n

i=1 ln Li (θ|y)

=⇒ Suppose prior is of the same form, p(θ|y) = Li (θ|y); then itsjust another observation: ln p(θ|y) =

∑n+1i=1 ln Li (θ|y)

Honaker and King show how to modify these “data augmentationpriors” to put priors on missing values rather than on µ and σ (or β).

Gary King () Missing Data 37 / 42

Page 237: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Priors on Cell Values

Recall: p(θ|y) = p(θ)∏n

i=1 Li (θ|y)

take logs: ln p(θ|y) = ln[p(θ)] +∑n

i=1 ln Li (θ|y)

=⇒ Suppose prior is of the same form, p(θ|y) = Li (θ|y); then itsjust another observation: ln p(θ|y) =

∑n+1i=1 ln Li (θ|y)

Honaker and King show how to modify these “data augmentationpriors” to put priors on missing values rather than on µ and σ (or β).

Gary King () Missing Data 37 / 42

Page 238: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Posterior imputation: mean=0, prior mean=5

σ12 = 0.95

−6 −4 −2 0 2 4 6

λ = 0.001

−6 −4 −2 0 2 4 6

σ12 = 0.85

−6 −4 −2 0 2 4 6

λ = 0.1

−6 −4 −2 0 2 4 6

σ12 = 0.5

−6 −4 −2 0 2 4 6

λ = 1

−6 −4 −2 0 2 4 6

σ12 = 0

−6 −4 −2 0 2 4 6

λ = 10

−6 −4 −2 0 2 4 6

Distribution of imputed values for one observation with prior µ= 5

Left column: holds prior N(5, λ) constant (λ = 1) and changes predictivestrength (the covariance, σ12).

Right column: holds predictive strength of data constant (at σ12 = 0.5)and changes the strength of the prior (λ).

Gary King () Missing Data 38 / 42

Page 239: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Posterior imputation: mean=0, prior mean=5

σ12 = 0.95

−6 −4 −2 0 2 4 6

λ = 0.001

−6 −4 −2 0 2 4 6

σ12 = 0.85

−6 −4 −2 0 2 4 6

λ = 0.1

−6 −4 −2 0 2 4 6

σ12 = 0.5

−6 −4 −2 0 2 4 6

λ = 1

−6 −4 −2 0 2 4 6

σ12 = 0

−6 −4 −2 0 2 4 6

λ = 10

−6 −4 −2 0 2 4 6

Distribution of imputed values for one observation with prior µ= 5

Left column: holds prior N(5, λ) constant (λ = 1) and changes predictivestrength (the covariance, σ12).Right column: holds predictive strength of data constant (at σ12 = 0.5)and changes the strength of the prior (λ).

Gary King () Missing Data 38 / 42

Page 240: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Model Parameters Respond to Prior on a Cell Value−

0.15

−0.

10−

0.05

0.00

0.05

0.10

ln(λ)

µ 2

0.01 0.1 1 10 100

Prior: p(x12) = N(5, λ). The parameter approaches the theoretical limits(dashed lines), upper bound is what is generated when the missing value isfilled in with the expectation; lower bound is the parameter when themodel is estimated without priors. The overall movement is small.

Gary King () Missing Data 39 / 42

Page 241: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Replication of Baum and Lake; Imputation Model Fit

1975 1980 1985 1990 1995

810

1214

1618

Barbados

Year

Wel

fare

Per

cent

1975 1980 1985 1990 1995

810

1214

1618

Barbados

Year

Wel

fare

Per

cent

● ●●

●●

● ●

● ●

●●

● ●

● ●●

1975 1980 1985 1990 1995

810

1214

1618

Barbados

Year

Wel

fare

Per

cent

● ●

● ●●

●●

● ●

1975 1980 1985 1990 1995

810

1214

1618

Barbados

Year

Wel

fare

Per

cent

1975 1980 1985 1990 1995

1015

2025

Chile

Year

Wel

fare

Per

cent

1975 1980 1985 1990 1995

1015

2025

Chile

Year

Wel

fare

Per

cent

●●

● ●

● ●

●●

●●

1975 1980 1985 1990 1995

1015

2025

Chile

Year

Wel

fare

Per

cent

● ●

● ● ●● ●

1975 1980 1985 1990 1995

1015

2025

Chile

Year

Wel

fare

Per

cent

1975 1980 1985 1990 1995

02

46

810

12

Ghana

Year

Wel

fare

Per

cent

1975 1980 1985 1990 1995

02

46

810

12

Ghana

Year

Wel

fare

Per

cent

●●

●●

● ●

● ●●

1975 1980 1985 1990 1995

02

46

810

12

Ghana

Year

Wel

fare

Per

cent

●●

● ● ●● ●

●●

1975 1980 1985 1990 1995

02

46

810

12

Ghana

Year

Wel

fare

Per

cent

1975 1980 1985 1990 1995

1012

1416

1820

22

Iceland

Year

Wel

fare

Per

cent

1975 1980 1985 1990 1995

1012

1416

1820

22

Iceland

Year

Wel

fare

Per

cent

●●

●●

●●

●●

1975 1980 1985 1990 1995

1012

1416

1820

22

Iceland

Year

Wel

fare

Per

cent

●●

● ●

●●

●●

1975 1980 1985 1990 1995

1012

1416

1820

22

Iceland

Year

Wel

fare

Per

cent

1960 1970 1980 1990 2000

020

4060

Ecuador

Year

Fem

ale

Sec

onda

ry S

choo

ling

1960 1970 1980 1990 2000

020

4060

Ecuador

Year

Fem

ale

Sec

onda

ry S

choo

ling

●●●

●●

●●

●●

●●

●●

●●

●●

●●

●●

●●●

●●

●●

●●●●

●●

●●

●●●

●●

●●

●●

●●

●●

●●

●●

●●

●●●

● ●●●

●●

●●

●●

●●●

●●

●●

●●

●●

●●●●

●●

1960 1970 1980 1990 2000

020

4060

Ecuador

Year

Fem

ale

Sec

onda

ry S

choo

ling

●●

●●

●●●●

●●

1960 1970 1980 1990 2000

020

4060

Ecuador

Year

Fem

ale

Sec

onda

ry S

choo

ling

1960 1970 1980 1990 2000

2040

6080

100

Greece

Year

Fem

ale

Sec

onda

ry S

choo

ling

1960 1970 1980 1990 2000

2040

6080

100

Greece

Year

Fem

ale

Sec

onda

ry S

choo

ling

●●

●●

●●●

●●

●●

●●

●● ●

●●

●●●

●●●

●●

●●

●●●

●●

●●

●●●

●●●

●●

●●

●●

●●

●●

●● ●●

●●●●

●●

●●●

●●

1960 1970 1980 1990 2000

2040

6080

100

Greece

Year

Fem

ale

Sec

onda

ry S

choo

ling

●●

●●●

●●●●●●●●

●●●

1960 1970 1980 1990 2000

2040

6080

100

Greece

Year

Fem

ale

Sec

onda

ry S

choo

ling

1960 1970 1980 1990 2000

2040

6080

100

Mongolia

Year

Fem

ale

Sec

onda

ry S

choo

ling

1960 1970 1980 1990 2000

2040

6080

100

Mongolia

Year

Fem

ale

Sec

onda

ry S

choo

ling

●●●

●●

●●

●●●

●●

●●

●●●●

●●

●●

●●●

●●●●●

●●

●●●

●●

●●

●●

●●

●●●

●●

●●

●●

●●

●●

●●

●●

●●●

● ●

●●

●●

1960 1970 1980 1990 2000

2040

6080

100

Mongolia

Year

Fem

ale

Sec

onda

ry S

choo

ling

● ●

●●●●●●●●●●●

●●●●

1960 1970 1980 1990 2000

2040

6080

100

Mongolia

Year

Fem

ale

Sec

onda

ry S

choo

ling

1960 1970 1980 1990 2000

010

2030

4050

Nepal

Year

Fem

ale

Sec

onda

ry S

choo

ling

1960 1970 1980 1990 2000

010

2030

4050

Nepal

Year

Fem

ale

Sec

onda

ry S

choo

ling

●●●

● ●●●●

●●

●● ●●●●

●●●● ●●

●●●●●

●●●

●●

●●●● ●●●● ●

●●●●

●●

●●●● ●●●

●●

●●● ●

●●●

●●●● ●●●

● ●●●

●●

●●●

1960 1970 1980 1990 2000

010

2030

4050

Nepal

Year

Fem

ale

Sec

onda

ry S

choo

ling

● ● ●●

●●

●●

●●●

●●

●●

●●

●●

1960 1970 1980 1990 2000

010

2030

4050

Nepal

Year

Fem

ale

Sec

onda

ry S

choo

ling

Black = observed. Blue circles = five imputations; Bars = 95% CIs

Gary King () Missing Data 40 / 42

Page 242: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Listwise Deletion Multiple Imputation

Life ExpectancyRich Democracies −.072 .233

(.179) (.037)Poor Democracies −.082 .120

(.040) (.099)N 1789 5627

Secondary EducationRich Democracies .948 .948

(.002) (.019)Poor Democracies .373 .393

(.094) (.081)N 1966 5627

Replication of Baum and Lake; the effect of being a democracy on lifeexpectancy and on the percentage enrolled in secondary education (withp-values in parentheses).

Gary King () Missing Data 41 / 42

Page 243: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Making Multiple Imputation Useful

1. MI was invented > 20 years ago by Donald Rubin. Despite having wonmany theoretical wars over its appropriateness, it was not often used.

2. Frequentists don’t like it because he gave it a Bayesian justification

3. Bayesians don’t like it because once you do the hard work you might aswell just use an application-specific method.

4. The idea was that data providers would use inside information to makeimputations, but this is rare

5. Applied people love the idea, since it doesn’t disrupt analysis methods

But it wasn’t used because creating proper imputations is hard withouteasy algorithms & software.

6. The new algorithms allow users to use create the imputationsthemselves.

Gary King () Missing Data 42 / 42

Page 244: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Making Multiple Imputation Useful

1. MI was invented > 20 years ago by Donald Rubin. Despite having wonmany theoretical wars over its appropriateness, it was not often used.

2. Frequentists don’t like it because he gave it a Bayesian justification

3. Bayesians don’t like it because once you do the hard work you might aswell just use an application-specific method.

4. The idea was that data providers would use inside information to makeimputations, but this is rare

5. Applied people love the idea, since it doesn’t disrupt analysis methods

But it wasn’t used because creating proper imputations is hard withouteasy algorithms & software.

6. The new algorithms allow users to use create the imputationsthemselves.

Gary King () Missing Data 42 / 42

Page 245: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Making Multiple Imputation Useful

1. MI was invented > 20 years ago by Donald Rubin. Despite having wonmany theoretical wars over its appropriateness, it was not often used.

2. Frequentists don’t like it because he gave it a Bayesian justification

3. Bayesians don’t like it because once you do the hard work you might aswell just use an application-specific method.

4. The idea was that data providers would use inside information to makeimputations, but this is rare

5. Applied people love the idea, since it doesn’t disrupt analysis methods

But it wasn’t used because creating proper imputations is hard withouteasy algorithms & software.

6. The new algorithms allow users to use create the imputationsthemselves.

Gary King () Missing Data 42 / 42

Page 246: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Making Multiple Imputation Useful

1. MI was invented > 20 years ago by Donald Rubin. Despite having wonmany theoretical wars over its appropriateness, it was not often used.

2. Frequentists don’t like it because he gave it a Bayesian justification

3. Bayesians don’t like it because once you do the hard work you might aswell just use an application-specific method.

4. The idea was that data providers would use inside information to makeimputations, but this is rare

5. Applied people love the idea, since it doesn’t disrupt analysis methods

But it wasn’t used because creating proper imputations is hard withouteasy algorithms & software.

6. The new algorithms allow users to use create the imputationsthemselves.

Gary King () Missing Data 42 / 42

Page 247: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Making Multiple Imputation Useful

1. MI was invented > 20 years ago by Donald Rubin. Despite having wonmany theoretical wars over its appropriateness, it was not often used.

2. Frequentists don’t like it because he gave it a Bayesian justification

3. Bayesians don’t like it because once you do the hard work you might aswell just use an application-specific method.

4. The idea was that data providers would use inside information to makeimputations, but this is rare

5. Applied people love the idea, since it doesn’t disrupt analysis methods

But it wasn’t used because creating proper imputations is hard withouteasy algorithms & software.

6. The new algorithms allow users to use create the imputationsthemselves.

Gary King () Missing Data 42 / 42

Page 248: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Making Multiple Imputation Useful

1. MI was invented > 20 years ago by Donald Rubin. Despite having wonmany theoretical wars over its appropriateness, it was not often used.

2. Frequentists don’t like it because he gave it a Bayesian justification

3. Bayesians don’t like it because once you do the hard work you might aswell just use an application-specific method.

4. The idea was that data providers would use inside information to makeimputations, but this is rare

5. Applied people love the idea, since it doesn’t disrupt analysis methods

But it wasn’t used because creating proper imputations is hard withouteasy algorithms & software.

6. The new algorithms allow users to use create the imputationsthemselves.

Gary King () Missing Data 42 / 42

Page 249: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Making Multiple Imputation Useful

1. MI was invented > 20 years ago by Donald Rubin. Despite having wonmany theoretical wars over its appropriateness, it was not often used.

2. Frequentists don’t like it because he gave it a Bayesian justification

3. Bayesians don’t like it because once you do the hard work you might aswell just use an application-specific method.

4. The idea was that data providers would use inside information to makeimputations, but this is rare

5. Applied people love the idea, since it doesn’t disrupt analysis methodsBut it wasn’t used because creating proper imputations is hard withouteasy algorithms & software.

6. The new algorithms allow users to use create the imputationsthemselves.

Gary King () Missing Data 42 / 42

Page 250: Advanced Quantitative Research Methodology, Lecture Notes ...Readings King, Gary; James Honaker; Ann Joseph; Kenneth Scheve. 2001. “Analyzing Incomplete Political Science Data: An

Making Multiple Imputation Useful

1. MI was invented > 20 years ago by Donald Rubin. Despite having wonmany theoretical wars over its appropriateness, it was not often used.

2. Frequentists don’t like it because he gave it a Bayesian justification

3. Bayesians don’t like it because once you do the hard work you might aswell just use an application-specific method.

4. The idea was that data providers would use inside information to makeimputations, but this is rare

5. Applied people love the idea, since it doesn’t disrupt analysis methodsBut it wasn’t used because creating proper imputations is hard withouteasy algorithms & software.

6. The new algorithms allow users to use create the imputationsthemselves.

Gary King () Missing Data 42 / 42