topic 11: matrix approach to linear regression

Topic 11: Matrix Approach to Linear

Regression

Outline

• Linear Regression in Matrix Form

The Model in Scalar Form• Yi = β0 + β1Xi + ei

• The ei are independent Normally distributed random variables with mean 0 and variance σ2

• Consider writing the observations: Y1= β0 + β1X1 + e1

Y2= β0 + β1X2 + e2

Yn= β0 + β1Xn + en

The Model in Matrix Form

The Model in Matrix Form II

The Design Matrix

Vector of Parameters

Vector of error terms

Vector of responses

Simple Linear Regression in Matrix Form

1n121nY

Variance-Covariance Matrix

}{Y}Y,{Y}Y,{Y

}{Y }Y,{Y

}Y,{Y}Y,{Y}Y{

1-nn 1n

n12 112

Main diagonal values are the variances and off-diagonal values are the covariances.

Covariance Matrix of e

Independent errors means that the covariance of any two residuals is zero. Common variance implies the

main diagonal values are equal.

Covariance Matrix of Y

Cov{Y}

Distributional Assumptions in Matrix Form

• e ~ N(0, σ2I)• I is an n x n identity matrix• Ones in the diagonal elements specify that the

variance of each ei is 1 times σ2

• Zeros in the off-diagonal elements specify that the covariance between different ei is zero

• This implies that the correlations are zero

Least Squares

• We want to minimize (Y-Xβ)(Y-Xβ)

• We take the derivative with respect to the (vector) β

• This is like a quadratic function

• Recall the function we minimized using the scalar form

Least Squares

• The derivative is 2 times the derivative of (Y-Xβ) with respect to β

• In other words, –2X(Y-Xβ) • We set this equal to 0 (a vector of zeros)

• So, –2X(Y-Xβ) = 0• Or, XY = XXβ (the normal equations)

Normal Equations

• XY = (XX)β

• Solving for β gives the least squares solution b = (b0, b1)

• b = (XX)–1(XY)

• See KNNL p 199 for details

• This same matrix approach works for multiple regression!!!!!!

Fitted Values

Hat Matrix

')'(ˆ

YXXXXY

We’ll use this matrix when assessing diagnostics in multiple regression

Estimated Covariance Matrix of b

• This matrix, b, is a linear combination of the elements of Y

• These estimates are Normal if Y is Normal

• These estimates will be approximately Normal in general

A Useful MultivariateTheorem

• U ~ N(μ, Σ), a multivariate Normal vector

• V = c + DU, a linear transformation of U

c is a vector and D is a matrix

• Then V ~ N(c+Dμ, DΣD)

Application to b

• b = (XX)–1(XY) = ((XX)–1X)Y

• Since Y ~ N(Xβ, σ2I) this means the vector b is Normally distributed with mean (XX)–1XXβ = β and covariance

σ2 ((XX)–1X) I((XX)–1X) = σ2 (XX)–1

Background Reading

• We will use this framework to do multiple regression where we have more than one explanatory variable

• Another explanatory variable is comparable to adding another column in the design matrix

• See Chapter 6

topic 11: matrix approach to linear regression

matrix approach

matrix forme

matrix formthe model

vector b

diagonal elements

derivative of y

vector of zerosso

linear combination

Documents

topic 26 cell matrix cell-matrix interactions

topic 12 simple linear regression and correlation

topic 3: correlation and regression

topic 1 strategic choice -...

polynomial functions topic 2: linear equations and...

regression analysis - stat · regression analysis • 1....

topic 3: multiple regression...

simple and multiple regression analysis in matrix form

matrix factorization for topic models - derek greene ·...

online sparse matrix gaussian process regression and vision

fin500j topic 2fall 2010 olin business school1 fin500j...

chapter 5: matrix approach to simple linear regression...

topic 18 multiple regression

topic 16: multicollinearity and polynomial regression

special topic: logistic regression for binary outcomes

optimal link prediction with matrix logistic regression

lecture 11 - matrix approach to linear regression

intrinsically smart cement-matrix composites topic 5

a beginner’s guide to matrix algebra & matrix linear...

topic 1 matrix