econometrics session 2 – introduction: inference amine ouazad, asst. prof. of economics

28
Econometrics Session 2 – Introduction: Inference Amine Ouazad, Asst. Prof. of Economics

Upload: colten-beans

Post on 01-Apr-2015

231 views

Category:

Documents


3 download

TRANSCRIPT

Page 1: Econometrics Session 2 – Introduction: Inference Amine Ouazad, Asst. Prof. of Economics

Econometrics

Session 2 – Introduction: Inference

Amine Ouazad,Asst. Prof. of Economics

Page 2: Econometrics Session 2 – Introduction: Inference Amine Ouazad, Asst. Prof. of Economics

Outline of the course

1. Introduction: Identification

2. Introduction: Inference

3. Linear Regression

4. Identification Issues in Linear Regressions

5. Inference Issues in Linear Regressions

Page 3: Econometrics Session 2 – Introduction: Inference Amine Ouazad, Asst. Prof. of Economics

Previous session: Identification

• Golden Benchmark: Randomization

– D = E(Y(1)|D=1) – E(Y(0)|D=0)

• We do not in fact observe E(Y(d)|D=d)…

• But we observe:�̂� ( Y ( d )| d )= 1

𝑁𝑑∑𝑘=1

𝑁 𝑑

𝑌 𝑑 ,𝑘

Page 4: Econometrics Session 2 – Introduction: Inference Amine Ouazad, Asst. Prof. of Economics

This sessionIntroduction: Inference

• What problems appear because of the limited number of observations?

• Hands-on problem #1:– At the dinner table, your brother-in-law

suggests playing heads or tails using a coin. You suspect he is cheating. How do you prove that the coin is unbalanced?

Page 5: Econometrics Session 2 – Introduction: Inference Amine Ouazad, Asst. Prof. of Economics

This sessionIntroduction: Inference

• Hands-on problem #2:– Using a survey of 1,248 subjects in

Singapore, you determine that the average income is $29,041 per year. How close is this mean to the true average income of Singaporeans? Do we have enough data?

Page 6: Econometrics Session 2 – Introduction: Inference Amine Ouazad, Asst. Prof. of Economics

This sessionIntroduction: Inference

• Convergence• The Law of Large Numbers• The Central Limit Theorem• Hypothesis Testing• Inference for the estimation of

treatment effects.

Page 7: Econometrics Session 2 – Introduction: Inference Amine Ouazad, Asst. Prof. of Economics

1. CONVERGENCESession 2 - Inference

Page 8: Econometrics Session 2 – Introduction: Inference Amine Ouazad, Asst. Prof. of Economics

Warning (you can ignore this)

• Proofs of the LLN and the CLT are omitted since most of their details are irrelevant to daily econometric practice.

• There are multiple flavors of the LLN and the CLT. I only introduce one flavor per theorem. I will introduce more versions as needed in the following sessions, but do not put too much emphasis on the distinctions (Appendix D of the Greene).

Page 9: Econometrics Session 2 – Introduction: Inference Amine Ouazad, Asst. Prof. of Economics

Notations• An estimator of a quantity is a function of

the observations in the sample.• Examples:– Estimator of the fraction of women in Singapore.– Estimator of the average salary of Chinese

CEOs.– Estimator of the effect of a medication of

patients’ health.

• An estimator is typically noted with a hat.• An estimator sometimes has an index n for

the number of observations in the sample.

Page 10: Econometrics Session 2 – Introduction: Inference Amine Ouazad, Asst. Prof. of Economics

Convergence

• Convergence in probability.– An estimator qn of q is converging in

probability to q if for all epsilon, P(|qn-q|>e) -> 0 as n->∞.

–We write plim qn = q

• An estimator of q is consistent if it converges in probability to .q

Page 11: Econometrics Session 2 – Introduction: Inference Amine Ouazad, Asst. Prof. of Economics

2. LAW OF LARGE NUMBERS

Session 2 - Inference

Page 12: Econometrics Session 2 – Introduction: Inference Amine Ouazad, Asst. Prof. of Economics

Law of Large Numbers

• Let X1, …, Xn be an independent sequence of random variables, with finite expected value mu = E(Xj), and finite variance sigma^2 = V(Xj). Let Sn = X1+…+Xn. Then, for any epsilon>0,

• As n->infinity

Page 13: Econometrics Session 2 – Introduction: Inference Amine Ouazad, Asst. Prof. of Economics

Law of large numbers

• The empirical mean of a series of random variables X1, …, Xn converges in probability to the actual expectancy of the sequence of random variables.

• Application: What is the fraction of women in Singapore?– Xi = 1 if an individual is a woman.– EXi is the fraction of women in the population.– Empirical mean is arbitrarily close to the true

fraction of women in Singapore.– Subtlety?

Page 14: Econometrics Session 2 – Introduction: Inference Amine Ouazad, Asst. Prof. of Economics

Another application• Load the micro census data.

• Take 100, 5% samples of the dataset. Calculate the fraction of women in the dataset, for each dataset.

• Consider the approximation that the fraction of women is 51% exactly.

• Illustrate that for epsilon = 0.5%, the number of samples with a mean above 51+-0.5 is shrinking as the size of the sample increases.

Page 15: Econometrics Session 2 – Introduction: Inference Amine Ouazad, Asst. Prof. of Economics

2. CENTRAL LIMIT THEOREM

Session 2 - Inference

Page 16: Econometrics Session 2 – Introduction: Inference Amine Ouazad, Asst. Prof. of Economics

Central Limit Theorem

Lindeberg-Levy Central Limit Theorem:• If x1,…,xn are an independent random

sample from a probability distribution with finite mean m and finite variance s2, and then

• Proof: Rao (1973, p.127) using characteristic functions

),0()( 2 Nxn dn

Page 17: Econometrics Session 2 – Introduction: Inference Amine Ouazad, Asst. Prof. of Economics

Applications: Central Limit Theorem

• Exercise #1:– You observe

heads,tails,tails,heads,tails,heads.– Give an estimate of the probability of

heads, with a 95% confidence interval.

• Exercise #2:– Solve the hands-on problem #2 at the

beginning of these slides.– Discuss the assumptions of the CLT.

Page 18: Econometrics Session 2 – Introduction: Inference Amine Ouazad, Asst. Prof. of Economics

3. HYPOTHESIS TESTINGSession 2 - Inference

Page 19: Econometrics Session 2 – Introduction: Inference Amine Ouazad, Asst. Prof. of Economics

Hypothesis testing

• Null hypothesis H0.

• Alternative hypothesis Ha.

• Unknown parameter q.

• Typical null hypothesis:– Is q = 0 ?– Is q > 3 ?– Is q = f ? (if f is another unknown

parameter).– Is q = 4 f ?

Page 20: Econometrics Session 2 – Introduction: Inference Amine Ouazad, Asst. Prof. of Economics

Hypothesis Testing: Applications

• Application #1 (Coin toss): is the coin balanced?–Write the null hypothesis.– Given the information presented before,

can we reject the null hypothesis at 95%?

• Application #2 (Average income): is the average income greater than $29,000 ?–Write the null hypothesis.– Given the information presented before,

can we reject the null hypothesis at 90%?

Page 21: Econometrics Session 2 – Introduction: Inference Amine Ouazad, Asst. Prof. of Economics

t-test• From the Central Limit Theorem, if

the standard deviation were known, under the null hypothesis:

• But the s.d. is estimated, and, under the null hypothesis:

𝑋−𝑚0

𝜎 /√𝑛→𝑁 (0,1)

𝑋−𝑚0

𝜎 /√𝑛→𝑆(𝑛−1)

Page 22: Econometrics Session 2 – Introduction: Inference Amine Ouazad, Asst. Prof. of Economics

Critical region

• Region for which the null hypothesis is rejected.

• If the null hypothesis is true, then the null is rejected in 5% of cases if the critical region is:

• Where cq is the qth quantile of the student distribution with n-1 degrees of freedom.

𝐶=(−∞,+∞ )−[𝑚−𝑐2.5𝑠 ;𝑚+𝑐97.5𝑠 ]

Page 23: Econometrics Session 2 – Introduction: Inference Amine Ouazad, Asst. Prof. of Economics

Flavors of t-tests

• One-sample, two-sided.– See previous slides.

• One-sample, one-sided.

• Two-sample, two-sided.– Equal and unequal variances.

• Two-sample, one-sided.– Equal and unequal variances.

Page 24: Econometrics Session 2 – Introduction: Inference Amine Ouazad, Asst. Prof. of Economics

Errors

• E.g. in judicial trials, medical tests, security checks.• Power of a test 1-b: probability of rejecting the

null when the null is false. • Size of a test a: proba of type I error.

Null hypothesisis not rejected

Null hypothesisis rejected

Null hypothesis is true

Cool, no worries Type I errorProbability a

Null hypothesis is wrong

Type II errorProbability b

Cool, no worries

Page 25: Econometrics Session 2 – Introduction: Inference Amine Ouazad, Asst. Prof. of Economics

Quirky

• Many papers run a large number of tests on the same data.

• Many papers report only significant tests…– What is wrong with this approach?

• Many papers run “robustness checks”, i.e. tests where the null hypothesis should not to be rejected.– What is wrong with this approach?

• Conclusion:– This is wrong, but common practice.

For more , see January 2012 of Strategic Management Journal.

Page 26: Econometrics Session 2 – Introduction: Inference Amine Ouazad, Asst. Prof. of Economics

4. INFERENCE FOR TREATMENT EFFECTS

Session 2 - Inference

Page 27: Econometrics Session 2 – Introduction: Inference Amine Ouazad, Asst. Prof. of Economics

Treatment effects:Inference (inspired by Lazear)

• There are two groups, a treatment and a control group.

• 128 employees are randomly allocated to the treatment and to the control.

• Treatment employees: piece rate payoff.• Control employees: fixed pay.• Treatment workers is 38.3 pieces per

hour in the treatment group, and is 23.1 in the control group.

Page 28: Econometrics Session 2 – Introduction: Inference Amine Ouazad, Asst. Prof. of Economics

Questions

1. Why do we perform a randomized experiment?

2. Do we have enough information to get an estimator of the treatment effect?

3. Is the estimator consistent?4. Is the estimator asymptotically

normal?5. Do we have enough information to get a

95% confidence interval around the estimator of the treatment effect?

6. Test the hypothesis that the medication is effective at raising the health index.