bayesian irt modeling - universiteit twente 25ste irt workshop/albert.pdf · bayesian irt modeling...

127
Some History MCMC IRT Fitting Model Checking The Future Bayesian IRT Modeling Jim Albert Bowling Green State University October 12, 2009

Upload: phamkien

Post on 03-May-2018

223 views

Category:

Documents


1 download

TRANSCRIPT

Some History MCMC IRT Fitting Model Checking The Future

Bayesian IRT Modeling

Jim Albert

Bowling Green State University

October 12, 2009

Some History MCMC IRT Fitting Model Checking The Future

Outline

Some History

MCMC IRT Fitting

Model Checking

The Future

Some History MCMC IRT Fitting Model Checking The Future

18th IRT Workshop (7 years ago)

• Why Bayes?

• Ease of fitting IRT models by MCMC.• Future of Bayes IRT?

• Software to simply MCMC fitting• Developments in hierarchical/multilevel modeling• Development in Bayesian model checking

Some History MCMC IRT Fitting Model Checking The Future

18th IRT Workshop (7 years ago)

• Why Bayes?

• Ease of fitting IRT models by MCMC.• Future of Bayes IRT?

• Software to simply MCMC fitting• Developments in hierarchical/multilevel modeling• Development in Bayesian model checking

Some History MCMC IRT Fitting Model Checking The Future

18th IRT Workshop (7 years ago)

• Why Bayes?

• Ease of fitting IRT models by MCMC.

• Future of Bayes IRT?• Software to simply MCMC fitting• Developments in hierarchical/multilevel modeling• Development in Bayesian model checking

Some History MCMC IRT Fitting Model Checking The Future

18th IRT Workshop (7 years ago)

• Why Bayes?

• Ease of fitting IRT models by MCMC.• Future of Bayes IRT?

• Software to simply MCMC fitting• Developments in hierarchical/multilevel modeling• Development in Bayesian model checking

Some History MCMC IRT Fitting Model Checking The Future

18th IRT Workshop (7 years ago)

• Why Bayes?

• Ease of fitting IRT models by MCMC.• Future of Bayes IRT?

• Software to simply MCMC fitting

• Developments in hierarchical/multilevel modeling• Development in Bayesian model checking

Some History MCMC IRT Fitting Model Checking The Future

18th IRT Workshop (7 years ago)

• Why Bayes?

• Ease of fitting IRT models by MCMC.• Future of Bayes IRT?

• Software to simply MCMC fitting• Developments in hierarchical/multilevel modeling

• Development in Bayesian model checking

Some History MCMC IRT Fitting Model Checking The Future

18th IRT Workshop (7 years ago)

• Why Bayes?

• Ease of fitting IRT models by MCMC.• Future of Bayes IRT?

• Software to simply MCMC fitting• Developments in hierarchical/multilevel modeling• Development in Bayesian model checking

Some History MCMC IRT Fitting Model Checking The Future

What’s Happened in 7 Years?

• R – the standard for the development ofstatistical software

• R packages are available for fitting BayesianIRT models

• Developments in Bayesian IRT modeling

• Developments in Bayesian model checking

Some History MCMC IRT Fitting Model Checking The Future

What’s Happened in 7 Years?

• R – the standard for the development ofstatistical software

• R packages are available for fitting BayesianIRT models

• Developments in Bayesian IRT modeling

• Developments in Bayesian model checking

Some History MCMC IRT Fitting Model Checking The Future

What’s Happened in 7 Years?

• R – the standard for the development ofstatistical software

• R packages are available for fitting BayesianIRT models

• Developments in Bayesian IRT modeling

• Developments in Bayesian model checking

Some History MCMC IRT Fitting Model Checking The Future

What’s Happened in 7 Years?

• R – the standard for the development ofstatistical software

• R packages are available for fitting BayesianIRT models

• Developments in Bayesian IRT modeling

• Developments in Bayesian model checking

Some History MCMC IRT Fitting Model Checking The Future

What’s Happened in 7 Years?

• R – the standard for the development ofstatistical software

• R packages are available for fitting BayesianIRT models

• Developments in Bayesian IRT modeling

• Developments in Bayesian model checking

Some History MCMC IRT Fitting Model Checking The Future

Bayesian IRT Literature

• Bayesian modeling by MCMC

• Data augmentation and Gibbs sampling(Albert, Patz and Junker)

• Provides a framework for more sophisticatedBayesian modeling.

Some History MCMC IRT Fitting Model Checking The Future

Bayesian IRT Literature

• Bayesian modeling by MCMC

• Data augmentation and Gibbs sampling(Albert, Patz and Junker)

• Provides a framework for more sophisticatedBayesian modeling.

Some History MCMC IRT Fitting Model Checking The Future

Bayesian IRT Literature

• Bayesian modeling by MCMC

• Data augmentation and Gibbs sampling(Albert, Patz and Junker)

• Provides a framework for more sophisticatedBayesian modeling.

Some History MCMC IRT Fitting Model Checking The Future

Bayesian IRT Literature

• Bayesian modeling by MCMC

• Data augmentation and Gibbs sampling(Albert, Patz and Junker)

• Provides a framework for more sophisticatedBayesian modeling.

Some History MCMC IRT Fitting Model Checking The Future

Bayesian Multilevel Modeling

• Modeling of ability parameters (Fox and Glas).

• Combining IRT models across groups.

• Modeling clustering patterns such as testscomposed of testlets.

• Natural to think of multilevel models from aBayesian perspective.

Some History MCMC IRT Fitting Model Checking The Future

Bayesian Multilevel Modeling

• Modeling of ability parameters (Fox and Glas).

• Combining IRT models across groups.

• Modeling clustering patterns such as testscomposed of testlets.

• Natural to think of multilevel models from aBayesian perspective.

Some History MCMC IRT Fitting Model Checking The Future

Bayesian Multilevel Modeling

• Modeling of ability parameters (Fox and Glas).

• Combining IRT models across groups.

• Modeling clustering patterns such as testscomposed of testlets.

• Natural to think of multilevel models from aBayesian perspective.

Some History MCMC IRT Fitting Model Checking The Future

Bayesian Multilevel Modeling

• Modeling of ability parameters (Fox and Glas).

• Combining IRT models across groups.

• Modeling clustering patterns such as testscomposed of testlets.

• Natural to think of multilevel models from aBayesian perspective.

Some History MCMC IRT Fitting Model Checking The Future

Bayesian Multilevel Modeling

• Modeling of ability parameters (Fox and Glas).

• Combining IRT models across groups.

• Modeling clustering patterns such as testscomposed of testlets.

• Natural to think of multilevel models from aBayesian perspective.

Some History MCMC IRT Fitting Model Checking The Future

Bayesian Modeling Checking

• Posterior predictive checking (Sinharray)

• Comparing IRT models by Bayes factors?

• Seems like there is a need for further work inthis area.

Some History MCMC IRT Fitting Model Checking The Future

Bayesian Modeling Checking

• Posterior predictive checking (Sinharray)

• Comparing IRT models by Bayes factors?

• Seems like there is a need for further work inthis area.

Some History MCMC IRT Fitting Model Checking The Future

Bayesian Modeling Checking

• Posterior predictive checking (Sinharray)

• Comparing IRT models by Bayes factors?

• Seems like there is a need for further work inthis area.

Some History MCMC IRT Fitting Model Checking The Future

Bayesian Modeling Checking

• Posterior predictive checking (Sinharray)

• Comparing IRT models by Bayes factors?

• Seems like there is a need for further work inthis area.

Some History MCMC IRT Fitting Model Checking The Future

Is Bayesian MCMC Popular?

• Ayala, The Theory and Practice of ItemResponse Theory, 2009.

• Mentions MCMC as a footnote on page 97.

• “Estimation of models ... is comparativelyeasier with MCMC than with either JMLE ormarginal mle’s”

• “However, MCMC implementation software isnot as user-friendly”

• “Given this book’s orientation, MCMCmethods are not covered”

Some History MCMC IRT Fitting Model Checking The Future

Is Bayesian MCMC Popular?

• Ayala, The Theory and Practice of ItemResponse Theory, 2009.

• Mentions MCMC as a footnote on page 97.

• “Estimation of models ... is comparativelyeasier with MCMC than with either JMLE ormarginal mle’s”

• “However, MCMC implementation software isnot as user-friendly”

• “Given this book’s orientation, MCMCmethods are not covered”

Some History MCMC IRT Fitting Model Checking The Future

Is Bayesian MCMC Popular?

• Ayala, The Theory and Practice of ItemResponse Theory, 2009.

• Mentions MCMC as a footnote on page 97.

• “Estimation of models ... is comparativelyeasier with MCMC than with either JMLE ormarginal mle’s”

• “However, MCMC implementation software isnot as user-friendly”

• “Given this book’s orientation, MCMCmethods are not covered”

Some History MCMC IRT Fitting Model Checking The Future

Is Bayesian MCMC Popular?

• Ayala, The Theory and Practice of ItemResponse Theory, 2009.

• Mentions MCMC as a footnote on page 97.

• “Estimation of models ... is comparativelyeasier with MCMC than with either JMLE ormarginal mle’s”

• “However, MCMC implementation software isnot as user-friendly”

• “Given this book’s orientation, MCMCmethods are not covered”

Some History MCMC IRT Fitting Model Checking The Future

Is Bayesian MCMC Popular?

• Ayala, The Theory and Practice of ItemResponse Theory, 2009.

• Mentions MCMC as a footnote on page 97.

• “Estimation of models ... is comparativelyeasier with MCMC than with either JMLE ormarginal mle’s”

• “However, MCMC implementation software isnot as user-friendly”

• “Given this book’s orientation, MCMCmethods are not covered”

Some History MCMC IRT Fitting Model Checking The Future

Is Bayesian MCMC Popular?

• Ayala, The Theory and Practice of ItemResponse Theory, 2009.

• Mentions MCMC as a footnote on page 97.

• “Estimation of models ... is comparativelyeasier with MCMC than with either JMLE ormarginal mle’s”

• “However, MCMC implementation software isnot as user-friendly”

• “Given this book’s orientation, MCMCmethods are not covered”

Some History MCMC IRT Fitting Model Checking The Future

Is Bayesian MCMC Dangerous?

• WinBUGS software User Manual states in UserManual:

• “Beware: MCMC sampling can be dangerous!”

• Some truth to this.

• Have to think about starting values, monitorconvergence, etc.

• But MCMC fitting works well for manyapplications.

Some History MCMC IRT Fitting Model Checking The Future

Is Bayesian MCMC Dangerous?

• WinBUGS software User Manual states in UserManual:

• “Beware: MCMC sampling can be dangerous!”

• Some truth to this.

• Have to think about starting values, monitorconvergence, etc.

• But MCMC fitting works well for manyapplications.

Some History MCMC IRT Fitting Model Checking The Future

Is Bayesian MCMC Dangerous?

• WinBUGS software User Manual states in UserManual:

• “Beware: MCMC sampling can be dangerous!”

• Some truth to this.

• Have to think about starting values, monitorconvergence, etc.

• But MCMC fitting works well for manyapplications.

Some History MCMC IRT Fitting Model Checking The Future

Is Bayesian MCMC Dangerous?

• WinBUGS software User Manual states in UserManual:

• “Beware: MCMC sampling can be dangerous!”

• Some truth to this.

• Have to think about starting values, monitorconvergence, etc.

• But MCMC fitting works well for manyapplications.

Some History MCMC IRT Fitting Model Checking The Future

Is Bayesian MCMC Dangerous?

• WinBUGS software User Manual states in UserManual:

• “Beware: MCMC sampling can be dangerous!”

• Some truth to this.

• Have to think about starting values, monitorconvergence, etc.

• But MCMC fitting works well for manyapplications.

Some History MCMC IRT Fitting Model Checking The Future

Is Bayesian MCMC Dangerous?

• WinBUGS software User Manual states in UserManual:

• “Beware: MCMC sampling can be dangerous!”

• Some truth to this.

• Have to think about starting values, monitorconvergence, etc.

• But MCMC fitting works well for manyapplications.

Some History MCMC IRT Fitting Model Checking The Future

Example: BGSU Math Placement Test

• Most freshmen take a math placement test.

• Focus on Form B of the test (students havehad two years of high school algebra).

• Use test results together with other info (ACTmath score, high school GPA) to determineproper placement.

• Procedure seems to be generally effective inplacing students in the “right” math course.

Some History MCMC IRT Fitting Model Checking The Future

Example: BGSU Math Placement Test

• Most freshmen take a math placement test.

• Focus on Form B of the test (students havehad two years of high school algebra).

• Use test results together with other info (ACTmath score, high school GPA) to determineproper placement.

• Procedure seems to be generally effective inplacing students in the “right” math course.

Some History MCMC IRT Fitting Model Checking The Future

Example: BGSU Math Placement Test

• Most freshmen take a math placement test.

• Focus on Form B of the test (students havehad two years of high school algebra).

• Use test results together with other info (ACTmath score, high school GPA) to determineproper placement.

• Procedure seems to be generally effective inplacing students in the “right” math course.

Some History MCMC IRT Fitting Model Checking The Future

Example: BGSU Math Placement Test

• Most freshmen take a math placement test.

• Focus on Form B of the test (students havehad two years of high school algebra).

• Use test results together with other info (ACTmath score, high school GPA) to determineproper placement.

• Procedure seems to be generally effective inplacing students in the “right” math course.

Some History MCMC IRT Fitting Model Checking The Future

Questions

• What factors are most helpful in predictingstudent success in math courses?

• Can we improve the math placement test?

• Are there redundant questions that can beremoved?

• Are there particular questions that are helpfulin discriminating between good and poorstudents?

Some History MCMC IRT Fitting Model Checking The Future

Questions

• What factors are most helpful in predictingstudent success in math courses?

• Can we improve the math placement test?

• Are there redundant questions that can beremoved?

• Are there particular questions that are helpfulin discriminating between good and poorstudents?

Some History MCMC IRT Fitting Model Checking The Future

Questions

• What factors are most helpful in predictingstudent success in math courses?

• Can we improve the math placement test?

• Are there redundant questions that can beremoved?

• Are there particular questions that are helpfulin discriminating between good and poorstudents?

Some History MCMC IRT Fitting Model Checking The Future

Questions

• What factors are most helpful in predictingstudent success in math courses?

• Can we improve the math placement test?

• Are there redundant questions that can beremoved?

• Are there particular questions that are helpfulin discriminating between good and poorstudents?

Some History MCMC IRT Fitting Model Checking The Future

General Two-Parameter IRT Model

• Have n students and k items, observe datamatrix {yij}.

• The probability that ith student gets jthquestion correct is given by

pij = Φ (βjθi − αj) .

• Likelihood function of all unknowns is

L =∏

i

∏j

pyij

ij (1 − pij)1−yij .

Some History MCMC IRT Fitting Model Checking The Future

General Two-Parameter IRT Model

• Have n students and k items, observe datamatrix {yij}.

• The probability that ith student gets jthquestion correct is given by

pij = Φ (βjθi − αj) .

• Likelihood function of all unknowns is

L =∏

i

∏j

pyij

ij (1 − pij)1−yij .

Some History MCMC IRT Fitting Model Checking The Future

General Two-Parameter IRT Model

• Have n students and k items, observe datamatrix {yij}.

• The probability that ith student gets jthquestion correct is given by

pij = Φ (βjθi − αj) .

• Likelihood function of all unknowns is

L =∏

i

∏j

pyij

ij (1 − pij)1−yij .

Some History MCMC IRT Fitting Model Checking The Future

MCMCpack package in R

• Performs MCMC fitting for a variety ofcommon Bayesian models.

• Function MCMCirt1d implements Gibbssampling for the 2-parameter model

• Inputs:• data matrix y• parameters of priors placed on item parameters• aspects of MCMC fitting• starting values• number of MCMC iterations

Some History MCMC IRT Fitting Model Checking The Future

MCMCpack package in R

• Performs MCMC fitting for a variety ofcommon Bayesian models.

• Function MCMCirt1d implements Gibbssampling for the 2-parameter model

• Inputs:• data matrix y• parameters of priors placed on item parameters• aspects of MCMC fitting• starting values• number of MCMC iterations

Some History MCMC IRT Fitting Model Checking The Future

MCMCpack package in R

• Performs MCMC fitting for a variety ofcommon Bayesian models.

• Function MCMCirt1d implements Gibbssampling for the 2-parameter model

• Inputs:• data matrix y• parameters of priors placed on item parameters• aspects of MCMC fitting• starting values• number of MCMC iterations

Some History MCMC IRT Fitting Model Checking The Future

MCMCpack package in R

• Performs MCMC fitting for a variety ofcommon Bayesian models.

• Function MCMCirt1d implements Gibbssampling for the 2-parameter model

• Inputs:

• data matrix y• parameters of priors placed on item parameters• aspects of MCMC fitting• starting values• number of MCMC iterations

Some History MCMC IRT Fitting Model Checking The Future

MCMCpack package in R

• Performs MCMC fitting for a variety ofcommon Bayesian models.

• Function MCMCirt1d implements Gibbssampling for the 2-parameter model

• Inputs:• data matrix y

• parameters of priors placed on item parameters• aspects of MCMC fitting• starting values• number of MCMC iterations

Some History MCMC IRT Fitting Model Checking The Future

MCMCpack package in R

• Performs MCMC fitting for a variety ofcommon Bayesian models.

• Function MCMCirt1d implements Gibbssampling for the 2-parameter model

• Inputs:• data matrix y• parameters of priors placed on item parameters

• aspects of MCMC fitting• starting values• number of MCMC iterations

Some History MCMC IRT Fitting Model Checking The Future

MCMCpack package in R

• Performs MCMC fitting for a variety ofcommon Bayesian models.

• Function MCMCirt1d implements Gibbssampling for the 2-parameter model

• Inputs:• data matrix y• parameters of priors placed on item parameters• aspects of MCMC fitting

• starting values• number of MCMC iterations

Some History MCMC IRT Fitting Model Checking The Future

MCMCpack package in R

• Performs MCMC fitting for a variety ofcommon Bayesian models.

• Function MCMCirt1d implements Gibbssampling for the 2-parameter model

• Inputs:• data matrix y• parameters of priors placed on item parameters• aspects of MCMC fitting• starting values

• number of MCMC iterations

Some History MCMC IRT Fitting Model Checking The Future

MCMCpack package in R

• Performs MCMC fitting for a variety ofcommon Bayesian models.

• Function MCMCirt1d implements Gibbssampling for the 2-parameter model

• Inputs:• data matrix y• parameters of priors placed on item parameters• aspects of MCMC fitting• starting values• number of MCMC iterations

Some History MCMC IRT Fitting Model Checking The Future

What Priors Does it Allow?

• θ1, ..., θN N(0, sθ)

• (αj , βj) iid N(µ, Σ)

• Default values sθ = 1, µ = 0, Σ = 2I .

Some History MCMC IRT Fitting Model Checking The Future

What Priors Does it Allow?

• θ1, ..., θN N(0, sθ)

• (αj , βj) iid N(µ, Σ)

• Default values sθ = 1, µ = 0, Σ = 2I .

Some History MCMC IRT Fitting Model Checking The Future

What Priors Does it Allow?

• θ1, ..., θN N(0, sθ)

• (αj , βj) iid N(µ, Σ)

• Default values sθ = 1, µ = 0, Σ = 2I .

Some History MCMC IRT Fitting Model Checking The Future

What Priors Does it Allow?

• θ1, ..., θN N(0, sθ)

• (αj , βj) iid N(µ, Σ)

• Default values sθ = 1, µ = 0, Σ = 2I .

Some History MCMC IRT Fitting Model Checking The Future

Run of this R function

• s=apply(y,1,mean)

• theta.start=(s-mean(s))/sd(s)

• fit=MCMCirt1d(y, store.item = TRUE,store.ability = TRUE, mcmc=10000,burnin=0, theta.start=theta.start)

• (76 seconds on my laptop)

Some History MCMC IRT Fitting Model Checking The Future

Run of this R function

• s=apply(y,1,mean)

• theta.start=(s-mean(s))/sd(s)

• fit=MCMCirt1d(y, store.item = TRUE,store.ability = TRUE, mcmc=10000,burnin=0, theta.start=theta.start)

• (76 seconds on my laptop)

Some History MCMC IRT Fitting Model Checking The Future

Run of this R function

• s=apply(y,1,mean)

• theta.start=(s-mean(s))/sd(s)

• fit=MCMCirt1d(y, store.item = TRUE,store.ability = TRUE, mcmc=10000,burnin=0, theta.start=theta.start)

• (76 seconds on my laptop)

Some History MCMC IRT Fitting Model Checking The Future

Run of this R function

• s=apply(y,1,mean)

• theta.start=(s-mean(s))/sd(s)

• fit=MCMCirt1d(y, store.item = TRUE,store.ability = TRUE, mcmc=10000,burnin=0, theta.start=theta.start)

• (76 seconds on my laptop)

Some History MCMC IRT Fitting Model Checking The Future

MCMC Diagnostics

• Have the simulated draws converged to theposterior distribution?

• Use a R package coda that contains a numberof diagnostic functions.

• Typically, this particular algorithm has rapidconvergence.

Some History MCMC IRT Fitting Model Checking The Future

MCMC Diagnostics

• Have the simulated draws converged to theposterior distribution?

• Use a R package coda that contains a numberof diagnostic functions.

• Typically, this particular algorithm has rapidconvergence.

Some History MCMC IRT Fitting Model Checking The Future

MCMC Diagnostics

• Have the simulated draws converged to theposterior distribution?

• Use a R package coda that contains a numberof diagnostic functions.

• Typically, this particular algorithm has rapidconvergence.

Some History MCMC IRT Fitting Model Checking The Future

MCMC Diagnostics

• Have the simulated draws converged to theposterior distribution?

• Use a R package coda that contains a numberof diagnostic functions.

• Typically, this particular algorithm has rapidconvergence.

Some History MCMC IRT Fitting Model Checking The Future

Some coda Output

0 2000 6000 10000

0.1

0.3

0.5

0.7

Iterations

Trace of beta.Q51

0.1 0.3 0.5 0.7

01

23

45

N = 10000 Bandwidth = 0.01277

Density of beta.Q51

0 2000 6000 10000

0.5

0.7

0.9

1.1

Iterations

Trace of beta.Q52

0.4 0.6 0.8 1.0 1.2

01

23

4

N = 10000 Bandwidth = 0.01538

Density of beta.Q52

Figure: Caption for coda1

Some History MCMC IRT Fitting Model Checking The Future

Inferences?

• Have matrix of simulated draws of parameters.

• Say you have particular parameter h(α, β) ofinterest.

• Apply this function to the simulated matrix –get sample from marginal posterior of h.

• In particular, can estimate item response curveF (βjθ − αj).

Some History MCMC IRT Fitting Model Checking The Future

Inferences?

• Have matrix of simulated draws of parameters.

• Say you have particular parameter h(α, β) ofinterest.

• Apply this function to the simulated matrix –get sample from marginal posterior of h.

• In particular, can estimate item response curveF (βjθ − αj).

Some History MCMC IRT Fitting Model Checking The Future

Inferences?

• Have matrix of simulated draws of parameters.

• Say you have particular parameter h(α, β) ofinterest.

• Apply this function to the simulated matrix –get sample from marginal posterior of h.

• In particular, can estimate item response curveF (βjθ − αj).

Some History MCMC IRT Fitting Model Checking The Future

Inferences?

• Have matrix of simulated draws of parameters.

• Say you have particular parameter h(α, β) ofinterest.

• Apply this function to the simulated matrix –get sample from marginal posterior of h.

• In particular, can estimate item response curveF (βjθ − αj).

Some History MCMC IRT Fitting Model Checking The Future

Inferences?

• Have matrix of simulated draws of parameters.

• Say you have particular parameter h(α, β) ofinterest.

• Apply this function to the simulated matrix –get sample from marginal posterior of h.

• In particular, can estimate item response curveF (βjθ − αj).

Some History MCMC IRT Fitting Model Checking The Future

Posterior Means of Difficulty andDiscrimination

−1.5 −1.0 −0.5 0.0 0.5

0.2

0.4

0.6

0.8

1.0

DIFFICULTY

DIS

CR

IMIN

AT

ION

Q51

Q52

Q53

Q54

Q55Q56

Q57

Q58

Q59

Q60

Q61Q62 Q63

Q64

Q65

Q66

Q67

Q68

Q69

Q70

Q71

Q72

Q73

Q74

Q75

Q76

Q77

Q78Q79

Q80

Q81

Q82

Q83

Q84

Q85

Figure: Caption for difficulty.discrimination

Some History MCMC IRT Fitting Model Checking The Future

Some Fitted Item Response Curves

−3 −2 −1 0 1 2 3

−0.2

0.2

0.6

1.0

QUESTION 53

ABILITY

PR

OB

AB

ILIT

Y

−3 −2 −1 0 1 2 3

−0.2

0.2

0.6

1.0

QUESTION 79

ABILITY

PR

OB

AB

ILIT

Y

−3 −2 −1 0 1 2 3

−0.2

0.2

0.6

1.0

QUESTION 66

ABILITY

PR

OB

AB

ILIT

Y

−3 −2 −1 0 1 2 3

−0.2

0.2

0.6

1.0

QUESTION 69

ABILITY

PR

OB

AB

ILIT

Y

Figure: Caption for irtplots

Some History MCMC IRT Fitting Model Checking The Future

Two Questions

• Q 53, Low Difficulty, Low DiscriminationQUESTION: If n × n × n = 30, which of thefollowing best approximates n?

• Q 66, High Difficulty, Low DiscriminationQUESTION: If x < 2, then |2 − x | =

Some History MCMC IRT Fitting Model Checking The Future

Two Questions

• Q 69, Moderate Difficulty, High DiscriminationQUESTION: The length L of a spring is givenby L = 5

6F + 17, where F is the applied force.What force F will produce a length L of 31?

• Q 79, High Difficulty, Moderate DiscriminationQUESTION: In the system of equations{2x + 6y = 5, x − 3y = 8}, find a solution x .

Some History MCMC IRT Fitting Model Checking The Future

Checking the Model

• General idea: is the observed data consistentwith predictions from the model?

• Construct a discrepancy function T (y obs) thatmeasures a particular aspect of the model fitthat you’d like to check.

• Simulate draws from the posterior predictivedistribution T (y ∗).

• Compute the probability that T (y) is at leastas extreme as T (y obs).

• If this probability is small, casts doubt on themodel.

Some History MCMC IRT Fitting Model Checking The Future

Checking the Model

• General idea: is the observed data consistentwith predictions from the model?

• Construct a discrepancy function T (y obs) thatmeasures a particular aspect of the model fitthat you’d like to check.

• Simulate draws from the posterior predictivedistribution T (y ∗).

• Compute the probability that T (y) is at leastas extreme as T (y obs).

• If this probability is small, casts doubt on themodel.

Some History MCMC IRT Fitting Model Checking The Future

Checking the Model

• General idea: is the observed data consistentwith predictions from the model?

• Construct a discrepancy function T (y obs) thatmeasures a particular aspect of the model fitthat you’d like to check.

• Simulate draws from the posterior predictivedistribution T (y ∗).

• Compute the probability that T (y) is at leastas extreme as T (y obs).

• If this probability is small, casts doubt on themodel.

Some History MCMC IRT Fitting Model Checking The Future

Checking the Model

• General idea: is the observed data consistentwith predictions from the model?

• Construct a discrepancy function T (y obs) thatmeasures a particular aspect of the model fitthat you’d like to check.

• Simulate draws from the posterior predictivedistribution T (y ∗).

• Compute the probability that T (y) is at leastas extreme as T (y obs).

• If this probability is small, casts doubt on themodel.

Some History MCMC IRT Fitting Model Checking The Future

Checking the Model

• General idea: is the observed data consistentwith predictions from the model?

• Construct a discrepancy function T (y obs) thatmeasures a particular aspect of the model fitthat you’d like to check.

• Simulate draws from the posterior predictivedistribution T (y ∗).

• Compute the probability that T (y) is at leastas extreme as T (y obs).

• If this probability is small, casts doubt on themodel.

Some History MCMC IRT Fitting Model Checking The Future

Checking the Model

• General idea: is the observed data consistentwith predictions from the model?

• Construct a discrepancy function T (y obs) thatmeasures a particular aspect of the model fitthat you’d like to check.

• Simulate draws from the posterior predictivedistribution T (y ∗).

• Compute the probability that T (y) is at leastas extreme as T (y obs).

• If this probability is small, casts doubt on themodel.

Some History MCMC IRT Fitting Model Checking The Future

Posterior Predictive Checks on R

• Have matrix of simulated draws of item andability parameters from posterior.

• From a simulated draws of θ, α, β, simulateprobability matrix p.

• Simulate future data y ∗ from Bernoullidistributions with probabilities p.

• Compute discrepancy function T (y ∗).

• Repeat process – get sample {T (y ∗)}.

Some History MCMC IRT Fitting Model Checking The Future

Posterior Predictive Checks on R

• Have matrix of simulated draws of item andability parameters from posterior.

• From a simulated draws of θ, α, β, simulateprobability matrix p.

• Simulate future data y ∗ from Bernoullidistributions with probabilities p.

• Compute discrepancy function T (y ∗).

• Repeat process – get sample {T (y ∗)}.

Some History MCMC IRT Fitting Model Checking The Future

Posterior Predictive Checks on R

• Have matrix of simulated draws of item andability parameters from posterior.

• From a simulated draws of θ, α, β, simulateprobability matrix p.

• Simulate future data y ∗ from Bernoullidistributions with probabilities p.

• Compute discrepancy function T (y ∗).

• Repeat process – get sample {T (y ∗)}.

Some History MCMC IRT Fitting Model Checking The Future

Posterior Predictive Checks on R

• Have matrix of simulated draws of item andability parameters from posterior.

• From a simulated draws of θ, α, β, simulateprobability matrix p.

• Simulate future data y ∗ from Bernoullidistributions with probabilities p.

• Compute discrepancy function T (y ∗).

• Repeat process – get sample {T (y ∗)}.

Some History MCMC IRT Fitting Model Checking The Future

Posterior Predictive Checks on R

• Have matrix of simulated draws of item andability parameters from posterior.

• From a simulated draws of θ, α, β, simulateprobability matrix p.

• Simulate future data y ∗ from Bernoullidistributions with probabilities p.

• Compute discrepancy function T (y ∗).

• Repeat process – get sample {T (y ∗)}.

Some History MCMC IRT Fitting Model Checking The Future

Posterior Predictive Checks on R

• Have matrix of simulated draws of item andability parameters from posterior.

• From a simulated draws of θ, α, β, simulateprobability matrix p.

• Simulate future data y ∗ from Bernoullidistributions with probabilities p.

• Compute discrepancy function T (y ∗).

• Repeat process – get sample {T (y ∗)}.

Some History MCMC IRT Fitting Model Checking The Future

Check Conditional IndependenceAssumption

• Given θ, assume responses are independent.

• Suggests using item correlation matrix as adiscrepancy function.

• Use our method to single out pairs of itemsthat have unusually small or large correlations.

• Use criterion Tail Probability < 0.02.

Some History MCMC IRT Fitting Model Checking The Future

Check Conditional IndependenceAssumption

• Given θ, assume responses are independent.

• Suggests using item correlation matrix as adiscrepancy function.

• Use our method to single out pairs of itemsthat have unusually small or large correlations.

• Use criterion Tail Probability < 0.02.

Some History MCMC IRT Fitting Model Checking The Future

Check Conditional IndependenceAssumption

• Given θ, assume responses are independent.

• Suggests using item correlation matrix as adiscrepancy function.

• Use our method to single out pairs of itemsthat have unusually small or large correlations.

• Use criterion Tail Probability < 0.02.

Some History MCMC IRT Fitting Model Checking The Future

Check Conditional IndependenceAssumption

• Given θ, assume responses are independent.

• Suggests using item correlation matrix as adiscrepancy function.

• Use our method to single out pairs of itemsthat have unusually small or large correlations.

• Use criterion Tail Probability < 0.02.

Some History MCMC IRT Fitting Model Checking The Future

Check Conditional IndependenceAssumption

• Given θ, assume responses are independent.

• Suggests using item correlation matrix as adiscrepancy function.

• Use our method to single out pairs of itemsthat have unusually small or large correlations.

• Use criterion Tail Probability < 0.02.

Some History MCMC IRT Fitting Model Checking The Future

Find Questions Pairs that are Similar orDissimilar

• Question Q83 that asks for the value of k whenf (x) = x2 − kx − 3, f (2) = 0, similar toquestion Q82 that asks for one of the solutionsof the equation x2 + 4x = −13.

• Question Q83 that solves a equation isdifferent from question Q53 that has thestudent choose the best approximation to nwhen n × n × n = 30.

Some History MCMC IRT Fitting Model Checking The Future

Find Questions Pairs that are Similar orDissimilar

• Question Q83 that asks for the value of k whenf (x) = x2 − kx − 3, f (2) = 0, similar toquestion Q82 that asks for one of the solutionsof the equation x2 + 4x = −13.

• Question Q83 that solves a equation isdifferent from question Q53 that has thestudent choose the best approximation to nwhen n × n × n = 30.

Some History MCMC IRT Fitting Model Checking The Future

Find Questions Pairs that are Similar orDissimilar

• Question Q83 that asks for the value of k whenf (x) = x2 − kx − 3, f (2) = 0, similar toquestion Q82 that asks for one of the solutionsof the equation x2 + 4x = −13.

• Question Q83 that solves a equation isdifferent from question Q53 that has thestudent choose the best approximation to nwhen n × n × n = 30.

Some History MCMC IRT Fitting Model Checking The Future

Are Discrimination Parameter EstimatesSuitable?

• An estimate of a item’s discrimination ability isby the item total correlation.

• Measure of variability of item total correlationis the standard deviation of these correlations.

• Use this standard deviation as a discrepancyfunction.

• We find that the item total correlationspredicted from the model are more dispersethan those from the observed data.

• This suggests our discrimination parameterestimates are too disperse.

Some History MCMC IRT Fitting Model Checking The Future

Are Discrimination Parameter EstimatesSuitable?

• An estimate of a item’s discrimination ability isby the item total correlation.

• Measure of variability of item total correlationis the standard deviation of these correlations.

• Use this standard deviation as a discrepancyfunction.

• We find that the item total correlationspredicted from the model are more dispersethan those from the observed data.

• This suggests our discrimination parameterestimates are too disperse.

Some History MCMC IRT Fitting Model Checking The Future

Are Discrimination Parameter EstimatesSuitable?

• An estimate of a item’s discrimination ability isby the item total correlation.

• Measure of variability of item total correlationis the standard deviation of these correlations.

• Use this standard deviation as a discrepancyfunction.

• We find that the item total correlationspredicted from the model are more dispersethan those from the observed data.

• This suggests our discrimination parameterestimates are too disperse.

Some History MCMC IRT Fitting Model Checking The Future

Are Discrimination Parameter EstimatesSuitable?

• An estimate of a item’s discrimination ability isby the item total correlation.

• Measure of variability of item total correlationis the standard deviation of these correlations.

• Use this standard deviation as a discrepancyfunction.

• We find that the item total correlationspredicted from the model are more dispersethan those from the observed data.

• This suggests our discrimination parameterestimates are too disperse.

Some History MCMC IRT Fitting Model Checking The Future

Are Discrimination Parameter EstimatesSuitable?

• An estimate of a item’s discrimination ability isby the item total correlation.

• Measure of variability of item total correlationis the standard deviation of these correlations.

• Use this standard deviation as a discrepancyfunction.

• We find that the item total correlationspredicted from the model are more dispersethan those from the observed data.

• This suggests our discrimination parameterestimates are too disperse.

Some History MCMC IRT Fitting Model Checking The Future

Are Discrimination Parameter EstimatesSuitable?

• An estimate of a item’s discrimination ability isby the item total correlation.

• Measure of variability of item total correlationis the standard deviation of these correlations.

• Use this standard deviation as a discrepancyfunction.

• We find that the item total correlationspredicted from the model are more dispersethan those from the observed data.

• This suggests our discrimination parameterestimates are too disperse.

Some History MCMC IRT Fitting Model Checking The Future

Histogram of the Posterior PredictiveDistribution of the Standard Deviation of

the Item Total Correlations

Standard Deviation of Item Score Correlations

Fre

quency

0.08 0.09 0.10 0.11 0.12 0.13

050

100

150

200

Observed

tail−prob = 0.082

Figure: Caption for histogram

Some History MCMC IRT Fitting Model Checking The Future

1 or 2 Parameter Model?

• Do items have equal discrimination ability (oneparameter model) or different discriminations(two parameter model)?

• Actually I think both models are incorrect.

• I believe there are several clusters of questionsand questions within each clusters have samediscrimination.

• Problem is to detect clusters.

Some History MCMC IRT Fitting Model Checking The Future

1 or 2 Parameter Model?

• Do items have equal discrimination ability (oneparameter model) or different discriminations(two parameter model)?

• Actually I think both models are incorrect.

• I believe there are several clusters of questionsand questions within each clusters have samediscrimination.

• Problem is to detect clusters.

Some History MCMC IRT Fitting Model Checking The Future

1 or 2 Parameter Model?

• Do items have equal discrimination ability (oneparameter model) or different discriminations(two parameter model)?

• Actually I think both models are incorrect.

• I believe there are several clusters of questionsand questions within each clusters have samediscrimination.

• Problem is to detect clusters.

Some History MCMC IRT Fitting Model Checking The Future

1 or 2 Parameter Model?

• Do items have equal discrimination ability (oneparameter model) or different discriminations(two parameter model)?

• Actually I think both models are incorrect.

• I believe there are several clusters of questionsand questions within each clusters have samediscrimination.

• Problem is to detect clusters.

Some History MCMC IRT Fitting Model Checking The Future

1 or 2 Parameter Model?

• Do items have equal discrimination ability (oneparameter model) or different discriminations(two parameter model)?

• Actually I think both models are incorrect.

• I believe there are several clusters of questionsand questions within each clusters have samediscrimination.

• Problem is to detect clusters.

Some History MCMC IRT Fitting Model Checking The Future

Multilevel Model

• Assume discrimination parameters β1, ..., βk arerandom sample from a common distributiong(β).

• If g is normal, posterior estimates of βj willshrink towards a common value.

• If instead you choose a Cauchy density for g ,get more adaptive shrinkage. (Seems moreappropriate here.)

Some History MCMC IRT Fitting Model Checking The Future

Multilevel Model

• Assume discrimination parameters β1, ..., βk arerandom sample from a common distributiong(β).

• If g is normal, posterior estimates of βj willshrink towards a common value.

• If instead you choose a Cauchy density for g ,get more adaptive shrinkage. (Seems moreappropriate here.)

Some History MCMC IRT Fitting Model Checking The Future

Multilevel Model

• Assume discrimination parameters β1, ..., βk arerandom sample from a common distributiong(β).

• If g is normal, posterior estimates of βj willshrink towards a common value.

• If instead you choose a Cauchy density for g ,get more adaptive shrinkage. (Seems moreappropriate here.)

Some History MCMC IRT Fitting Model Checking The Future

Multilevel Model

• Assume discrimination parameters β1, ..., βk arerandom sample from a common distributiong(β).

• If g is normal, posterior estimates of βj willshrink towards a common value.

• If instead you choose a Cauchy density for g ,get more adaptive shrinkage. (Seems moreappropriate here.)

Some History MCMC IRT Fitting Model Checking The Future

Multilevel Modeling with a Cauchy Density

0.2 0.4 0.6 0.8 1.0

ESTIMATE

INDIVIDUAL

EXCHANGEABLE

Figure: Caption for cauchy.estimates

Some History MCMC IRT Fitting Model Checking The Future

Math Placement Data – Further Analysis

• Multilevel model where θi are modeled in termsof student covariates (gender, ACT math,GPA).

• Combine modeling across different groups ofstudents, where “groups” are defined in termsof entering course, form of test, year, etc.

• Test can be grouped by different sections(Arithmetic, Fractions, Graphing, Advanced,Solving) and this motives a “testlet” modelwhere responses to questions within eachtestlet are assumed positively correlated.

Some History MCMC IRT Fitting Model Checking The Future

Math Placement Data – Further Analysis

• Multilevel model where θi are modeled in termsof student covariates (gender, ACT math,GPA).

• Combine modeling across different groups ofstudents, where “groups” are defined in termsof entering course, form of test, year, etc.

• Test can be grouped by different sections(Arithmetic, Fractions, Graphing, Advanced,Solving) and this motives a “testlet” modelwhere responses to questions within eachtestlet are assumed positively correlated.

Some History MCMC IRT Fitting Model Checking The Future

Math Placement Data – Further Analysis

• Multilevel model where θi are modeled in termsof student covariates (gender, ACT math,GPA).

• Combine modeling across different groups ofstudents, where “groups” are defined in termsof entering course, form of test, year, etc.

• Test can be grouped by different sections(Arithmetic, Fractions, Graphing, Advanced,Solving) and this motives a “testlet” modelwhere responses to questions within eachtestlet are assumed positively correlated.

Some History MCMC IRT Fitting Model Checking The Future

Math Placement Data – Further Analysis

• Multilevel model where θi are modeled in termsof student covariates (gender, ACT math,GPA).

• Combine modeling across different groups ofstudents, where “groups” are defined in termsof entering course, form of test, year, etc.

• Test can be grouped by different sections(Arithmetic, Fractions, Graphing, Advanced,Solving) and this motives a “testlet” modelwhere responses to questions within eachtestlet are assumed positively correlated.

Some History MCMC IRT Fitting Model Checking The Future

The Future of Bayesian IRT Modeling

• Model checking and model comparison.

• Multilevel modeling. Effective way ofcombining similar sources of data.

• New book by Jean-Paul Fox on Bayesian IRT(with software).

• Break down the “wall of resistance” in usingBayesian methods in fitting and checking IRTmodels.

Some History MCMC IRT Fitting Model Checking The Future

The Future of Bayesian IRT Modeling

• Model checking and model comparison.

• Multilevel modeling. Effective way ofcombining similar sources of data.

• New book by Jean-Paul Fox on Bayesian IRT(with software).

• Break down the “wall of resistance” in usingBayesian methods in fitting and checking IRTmodels.

Some History MCMC IRT Fitting Model Checking The Future

The Future of Bayesian IRT Modeling

• Model checking and model comparison.

• Multilevel modeling. Effective way ofcombining similar sources of data.

• New book by Jean-Paul Fox on Bayesian IRT(with software).

• Break down the “wall of resistance” in usingBayesian methods in fitting and checking IRTmodels.

Some History MCMC IRT Fitting Model Checking The Future

The Future of Bayesian IRT Modeling

• Model checking and model comparison.

• Multilevel modeling. Effective way ofcombining similar sources of data.

• New book by Jean-Paul Fox on Bayesian IRT(with software).

• Break down the “wall of resistance” in usingBayesian methods in fitting and checking IRTmodels.

Some History MCMC IRT Fitting Model Checking The Future

The Future of Bayesian IRT Modeling

• Model checking and model comparison.

• Multilevel modeling. Effective way ofcombining similar sources of data.

• New book by Jean-Paul Fox on Bayesian IRT(with software).

• Break down the “wall of resistance” in usingBayesian methods in fitting and checking IRTmodels.