bayesian irt modeling - universiteit twente 25ste irt workshop/albert.pdf · bayesian irt modeling...
TRANSCRIPT
Some History MCMC IRT Fitting Model Checking The Future
Bayesian IRT Modeling
Jim Albert
Bowling Green State University
October 12, 2009
Some History MCMC IRT Fitting Model Checking The Future
Outline
Some History
MCMC IRT Fitting
Model Checking
The Future
Some History MCMC IRT Fitting Model Checking The Future
18th IRT Workshop (7 years ago)
• Why Bayes?
• Ease of fitting IRT models by MCMC.• Future of Bayes IRT?
• Software to simply MCMC fitting• Developments in hierarchical/multilevel modeling• Development in Bayesian model checking
Some History MCMC IRT Fitting Model Checking The Future
18th IRT Workshop (7 years ago)
• Why Bayes?
• Ease of fitting IRT models by MCMC.• Future of Bayes IRT?
• Software to simply MCMC fitting• Developments in hierarchical/multilevel modeling• Development in Bayesian model checking
Some History MCMC IRT Fitting Model Checking The Future
18th IRT Workshop (7 years ago)
• Why Bayes?
• Ease of fitting IRT models by MCMC.
• Future of Bayes IRT?• Software to simply MCMC fitting• Developments in hierarchical/multilevel modeling• Development in Bayesian model checking
Some History MCMC IRT Fitting Model Checking The Future
18th IRT Workshop (7 years ago)
• Why Bayes?
• Ease of fitting IRT models by MCMC.• Future of Bayes IRT?
• Software to simply MCMC fitting• Developments in hierarchical/multilevel modeling• Development in Bayesian model checking
Some History MCMC IRT Fitting Model Checking The Future
18th IRT Workshop (7 years ago)
• Why Bayes?
• Ease of fitting IRT models by MCMC.• Future of Bayes IRT?
• Software to simply MCMC fitting
• Developments in hierarchical/multilevel modeling• Development in Bayesian model checking
Some History MCMC IRT Fitting Model Checking The Future
18th IRT Workshop (7 years ago)
• Why Bayes?
• Ease of fitting IRT models by MCMC.• Future of Bayes IRT?
• Software to simply MCMC fitting• Developments in hierarchical/multilevel modeling
• Development in Bayesian model checking
Some History MCMC IRT Fitting Model Checking The Future
18th IRT Workshop (7 years ago)
• Why Bayes?
• Ease of fitting IRT models by MCMC.• Future of Bayes IRT?
• Software to simply MCMC fitting• Developments in hierarchical/multilevel modeling• Development in Bayesian model checking
Some History MCMC IRT Fitting Model Checking The Future
What’s Happened in 7 Years?
• R – the standard for the development ofstatistical software
• R packages are available for fitting BayesianIRT models
• Developments in Bayesian IRT modeling
• Developments in Bayesian model checking
Some History MCMC IRT Fitting Model Checking The Future
What’s Happened in 7 Years?
• R – the standard for the development ofstatistical software
• R packages are available for fitting BayesianIRT models
• Developments in Bayesian IRT modeling
• Developments in Bayesian model checking
Some History MCMC IRT Fitting Model Checking The Future
What’s Happened in 7 Years?
• R – the standard for the development ofstatistical software
• R packages are available for fitting BayesianIRT models
• Developments in Bayesian IRT modeling
• Developments in Bayesian model checking
Some History MCMC IRT Fitting Model Checking The Future
What’s Happened in 7 Years?
• R – the standard for the development ofstatistical software
• R packages are available for fitting BayesianIRT models
• Developments in Bayesian IRT modeling
• Developments in Bayesian model checking
Some History MCMC IRT Fitting Model Checking The Future
What’s Happened in 7 Years?
• R – the standard for the development ofstatistical software
• R packages are available for fitting BayesianIRT models
• Developments in Bayesian IRT modeling
• Developments in Bayesian model checking
Some History MCMC IRT Fitting Model Checking The Future
Bayesian IRT Literature
• Bayesian modeling by MCMC
• Data augmentation and Gibbs sampling(Albert, Patz and Junker)
• Provides a framework for more sophisticatedBayesian modeling.
Some History MCMC IRT Fitting Model Checking The Future
Bayesian IRT Literature
• Bayesian modeling by MCMC
• Data augmentation and Gibbs sampling(Albert, Patz and Junker)
• Provides a framework for more sophisticatedBayesian modeling.
Some History MCMC IRT Fitting Model Checking The Future
Bayesian IRT Literature
• Bayesian modeling by MCMC
• Data augmentation and Gibbs sampling(Albert, Patz and Junker)
• Provides a framework for more sophisticatedBayesian modeling.
Some History MCMC IRT Fitting Model Checking The Future
Bayesian IRT Literature
• Bayesian modeling by MCMC
• Data augmentation and Gibbs sampling(Albert, Patz and Junker)
• Provides a framework for more sophisticatedBayesian modeling.
Some History MCMC IRT Fitting Model Checking The Future
Bayesian Multilevel Modeling
• Modeling of ability parameters (Fox and Glas).
• Combining IRT models across groups.
• Modeling clustering patterns such as testscomposed of testlets.
• Natural to think of multilevel models from aBayesian perspective.
Some History MCMC IRT Fitting Model Checking The Future
Bayesian Multilevel Modeling
• Modeling of ability parameters (Fox and Glas).
• Combining IRT models across groups.
• Modeling clustering patterns such as testscomposed of testlets.
• Natural to think of multilevel models from aBayesian perspective.
Some History MCMC IRT Fitting Model Checking The Future
Bayesian Multilevel Modeling
• Modeling of ability parameters (Fox and Glas).
• Combining IRT models across groups.
• Modeling clustering patterns such as testscomposed of testlets.
• Natural to think of multilevel models from aBayesian perspective.
Some History MCMC IRT Fitting Model Checking The Future
Bayesian Multilevel Modeling
• Modeling of ability parameters (Fox and Glas).
• Combining IRT models across groups.
• Modeling clustering patterns such as testscomposed of testlets.
• Natural to think of multilevel models from aBayesian perspective.
Some History MCMC IRT Fitting Model Checking The Future
Bayesian Multilevel Modeling
• Modeling of ability parameters (Fox and Glas).
• Combining IRT models across groups.
• Modeling clustering patterns such as testscomposed of testlets.
• Natural to think of multilevel models from aBayesian perspective.
Some History MCMC IRT Fitting Model Checking The Future
Bayesian Modeling Checking
• Posterior predictive checking (Sinharray)
• Comparing IRT models by Bayes factors?
• Seems like there is a need for further work inthis area.
Some History MCMC IRT Fitting Model Checking The Future
Bayesian Modeling Checking
• Posterior predictive checking (Sinharray)
• Comparing IRT models by Bayes factors?
• Seems like there is a need for further work inthis area.
Some History MCMC IRT Fitting Model Checking The Future
Bayesian Modeling Checking
• Posterior predictive checking (Sinharray)
• Comparing IRT models by Bayes factors?
• Seems like there is a need for further work inthis area.
Some History MCMC IRT Fitting Model Checking The Future
Bayesian Modeling Checking
• Posterior predictive checking (Sinharray)
• Comparing IRT models by Bayes factors?
• Seems like there is a need for further work inthis area.
Some History MCMC IRT Fitting Model Checking The Future
Is Bayesian MCMC Popular?
• Ayala, The Theory and Practice of ItemResponse Theory, 2009.
• Mentions MCMC as a footnote on page 97.
• “Estimation of models ... is comparativelyeasier with MCMC than with either JMLE ormarginal mle’s”
• “However, MCMC implementation software isnot as user-friendly”
• “Given this book’s orientation, MCMCmethods are not covered”
Some History MCMC IRT Fitting Model Checking The Future
Is Bayesian MCMC Popular?
• Ayala, The Theory and Practice of ItemResponse Theory, 2009.
• Mentions MCMC as a footnote on page 97.
• “Estimation of models ... is comparativelyeasier with MCMC than with either JMLE ormarginal mle’s”
• “However, MCMC implementation software isnot as user-friendly”
• “Given this book’s orientation, MCMCmethods are not covered”
Some History MCMC IRT Fitting Model Checking The Future
Is Bayesian MCMC Popular?
• Ayala, The Theory and Practice of ItemResponse Theory, 2009.
• Mentions MCMC as a footnote on page 97.
• “Estimation of models ... is comparativelyeasier with MCMC than with either JMLE ormarginal mle’s”
• “However, MCMC implementation software isnot as user-friendly”
• “Given this book’s orientation, MCMCmethods are not covered”
Some History MCMC IRT Fitting Model Checking The Future
Is Bayesian MCMC Popular?
• Ayala, The Theory and Practice of ItemResponse Theory, 2009.
• Mentions MCMC as a footnote on page 97.
• “Estimation of models ... is comparativelyeasier with MCMC than with either JMLE ormarginal mle’s”
• “However, MCMC implementation software isnot as user-friendly”
• “Given this book’s orientation, MCMCmethods are not covered”
Some History MCMC IRT Fitting Model Checking The Future
Is Bayesian MCMC Popular?
• Ayala, The Theory and Practice of ItemResponse Theory, 2009.
• Mentions MCMC as a footnote on page 97.
• “Estimation of models ... is comparativelyeasier with MCMC than with either JMLE ormarginal mle’s”
• “However, MCMC implementation software isnot as user-friendly”
• “Given this book’s orientation, MCMCmethods are not covered”
Some History MCMC IRT Fitting Model Checking The Future
Is Bayesian MCMC Popular?
• Ayala, The Theory and Practice of ItemResponse Theory, 2009.
• Mentions MCMC as a footnote on page 97.
• “Estimation of models ... is comparativelyeasier with MCMC than with either JMLE ormarginal mle’s”
• “However, MCMC implementation software isnot as user-friendly”
• “Given this book’s orientation, MCMCmethods are not covered”
Some History MCMC IRT Fitting Model Checking The Future
Is Bayesian MCMC Dangerous?
• WinBUGS software User Manual states in UserManual:
• “Beware: MCMC sampling can be dangerous!”
• Some truth to this.
• Have to think about starting values, monitorconvergence, etc.
• But MCMC fitting works well for manyapplications.
Some History MCMC IRT Fitting Model Checking The Future
Is Bayesian MCMC Dangerous?
• WinBUGS software User Manual states in UserManual:
• “Beware: MCMC sampling can be dangerous!”
• Some truth to this.
• Have to think about starting values, monitorconvergence, etc.
• But MCMC fitting works well for manyapplications.
Some History MCMC IRT Fitting Model Checking The Future
Is Bayesian MCMC Dangerous?
• WinBUGS software User Manual states in UserManual:
• “Beware: MCMC sampling can be dangerous!”
• Some truth to this.
• Have to think about starting values, monitorconvergence, etc.
• But MCMC fitting works well for manyapplications.
Some History MCMC IRT Fitting Model Checking The Future
Is Bayesian MCMC Dangerous?
• WinBUGS software User Manual states in UserManual:
• “Beware: MCMC sampling can be dangerous!”
• Some truth to this.
• Have to think about starting values, monitorconvergence, etc.
• But MCMC fitting works well for manyapplications.
Some History MCMC IRT Fitting Model Checking The Future
Is Bayesian MCMC Dangerous?
• WinBUGS software User Manual states in UserManual:
• “Beware: MCMC sampling can be dangerous!”
• Some truth to this.
• Have to think about starting values, monitorconvergence, etc.
• But MCMC fitting works well for manyapplications.
Some History MCMC IRT Fitting Model Checking The Future
Is Bayesian MCMC Dangerous?
• WinBUGS software User Manual states in UserManual:
• “Beware: MCMC sampling can be dangerous!”
• Some truth to this.
• Have to think about starting values, monitorconvergence, etc.
• But MCMC fitting works well for manyapplications.
Some History MCMC IRT Fitting Model Checking The Future
Example: BGSU Math Placement Test
• Most freshmen take a math placement test.
• Focus on Form B of the test (students havehad two years of high school algebra).
• Use test results together with other info (ACTmath score, high school GPA) to determineproper placement.
• Procedure seems to be generally effective inplacing students in the “right” math course.
Some History MCMC IRT Fitting Model Checking The Future
Example: BGSU Math Placement Test
• Most freshmen take a math placement test.
• Focus on Form B of the test (students havehad two years of high school algebra).
• Use test results together with other info (ACTmath score, high school GPA) to determineproper placement.
• Procedure seems to be generally effective inplacing students in the “right” math course.
Some History MCMC IRT Fitting Model Checking The Future
Example: BGSU Math Placement Test
• Most freshmen take a math placement test.
• Focus on Form B of the test (students havehad two years of high school algebra).
• Use test results together with other info (ACTmath score, high school GPA) to determineproper placement.
• Procedure seems to be generally effective inplacing students in the “right” math course.
Some History MCMC IRT Fitting Model Checking The Future
Example: BGSU Math Placement Test
• Most freshmen take a math placement test.
• Focus on Form B of the test (students havehad two years of high school algebra).
• Use test results together with other info (ACTmath score, high school GPA) to determineproper placement.
• Procedure seems to be generally effective inplacing students in the “right” math course.
Some History MCMC IRT Fitting Model Checking The Future
Questions
• What factors are most helpful in predictingstudent success in math courses?
• Can we improve the math placement test?
• Are there redundant questions that can beremoved?
• Are there particular questions that are helpfulin discriminating between good and poorstudents?
Some History MCMC IRT Fitting Model Checking The Future
Questions
• What factors are most helpful in predictingstudent success in math courses?
• Can we improve the math placement test?
• Are there redundant questions that can beremoved?
• Are there particular questions that are helpfulin discriminating between good and poorstudents?
Some History MCMC IRT Fitting Model Checking The Future
Questions
• What factors are most helpful in predictingstudent success in math courses?
• Can we improve the math placement test?
• Are there redundant questions that can beremoved?
• Are there particular questions that are helpfulin discriminating between good and poorstudents?
Some History MCMC IRT Fitting Model Checking The Future
Questions
• What factors are most helpful in predictingstudent success in math courses?
• Can we improve the math placement test?
• Are there redundant questions that can beremoved?
• Are there particular questions that are helpfulin discriminating between good and poorstudents?
Some History MCMC IRT Fitting Model Checking The Future
General Two-Parameter IRT Model
• Have n students and k items, observe datamatrix {yij}.
• The probability that ith student gets jthquestion correct is given by
pij = Φ (βjθi − αj) .
• Likelihood function of all unknowns is
L =∏
i
∏j
pyij
ij (1 − pij)1−yij .
Some History MCMC IRT Fitting Model Checking The Future
General Two-Parameter IRT Model
• Have n students and k items, observe datamatrix {yij}.
• The probability that ith student gets jthquestion correct is given by
pij = Φ (βjθi − αj) .
• Likelihood function of all unknowns is
L =∏
i
∏j
pyij
ij (1 − pij)1−yij .
Some History MCMC IRT Fitting Model Checking The Future
General Two-Parameter IRT Model
• Have n students and k items, observe datamatrix {yij}.
• The probability that ith student gets jthquestion correct is given by
pij = Φ (βjθi − αj) .
• Likelihood function of all unknowns is
L =∏
i
∏j
pyij
ij (1 − pij)1−yij .
Some History MCMC IRT Fitting Model Checking The Future
MCMCpack package in R
• Performs MCMC fitting for a variety ofcommon Bayesian models.
• Function MCMCirt1d implements Gibbssampling for the 2-parameter model
• Inputs:• data matrix y• parameters of priors placed on item parameters• aspects of MCMC fitting• starting values• number of MCMC iterations
Some History MCMC IRT Fitting Model Checking The Future
MCMCpack package in R
• Performs MCMC fitting for a variety ofcommon Bayesian models.
• Function MCMCirt1d implements Gibbssampling for the 2-parameter model
• Inputs:• data matrix y• parameters of priors placed on item parameters• aspects of MCMC fitting• starting values• number of MCMC iterations
Some History MCMC IRT Fitting Model Checking The Future
MCMCpack package in R
• Performs MCMC fitting for a variety ofcommon Bayesian models.
• Function MCMCirt1d implements Gibbssampling for the 2-parameter model
• Inputs:• data matrix y• parameters of priors placed on item parameters• aspects of MCMC fitting• starting values• number of MCMC iterations
Some History MCMC IRT Fitting Model Checking The Future
MCMCpack package in R
• Performs MCMC fitting for a variety ofcommon Bayesian models.
• Function MCMCirt1d implements Gibbssampling for the 2-parameter model
• Inputs:
• data matrix y• parameters of priors placed on item parameters• aspects of MCMC fitting• starting values• number of MCMC iterations
Some History MCMC IRT Fitting Model Checking The Future
MCMCpack package in R
• Performs MCMC fitting for a variety ofcommon Bayesian models.
• Function MCMCirt1d implements Gibbssampling for the 2-parameter model
• Inputs:• data matrix y
• parameters of priors placed on item parameters• aspects of MCMC fitting• starting values• number of MCMC iterations
Some History MCMC IRT Fitting Model Checking The Future
MCMCpack package in R
• Performs MCMC fitting for a variety ofcommon Bayesian models.
• Function MCMCirt1d implements Gibbssampling for the 2-parameter model
• Inputs:• data matrix y• parameters of priors placed on item parameters
• aspects of MCMC fitting• starting values• number of MCMC iterations
Some History MCMC IRT Fitting Model Checking The Future
MCMCpack package in R
• Performs MCMC fitting for a variety ofcommon Bayesian models.
• Function MCMCirt1d implements Gibbssampling for the 2-parameter model
• Inputs:• data matrix y• parameters of priors placed on item parameters• aspects of MCMC fitting
• starting values• number of MCMC iterations
Some History MCMC IRT Fitting Model Checking The Future
MCMCpack package in R
• Performs MCMC fitting for a variety ofcommon Bayesian models.
• Function MCMCirt1d implements Gibbssampling for the 2-parameter model
• Inputs:• data matrix y• parameters of priors placed on item parameters• aspects of MCMC fitting• starting values
• number of MCMC iterations
Some History MCMC IRT Fitting Model Checking The Future
MCMCpack package in R
• Performs MCMC fitting for a variety ofcommon Bayesian models.
• Function MCMCirt1d implements Gibbssampling for the 2-parameter model
• Inputs:• data matrix y• parameters of priors placed on item parameters• aspects of MCMC fitting• starting values• number of MCMC iterations
Some History MCMC IRT Fitting Model Checking The Future
What Priors Does it Allow?
• θ1, ..., θN N(0, sθ)
• (αj , βj) iid N(µ, Σ)
• Default values sθ = 1, µ = 0, Σ = 2I .
Some History MCMC IRT Fitting Model Checking The Future
What Priors Does it Allow?
• θ1, ..., θN N(0, sθ)
• (αj , βj) iid N(µ, Σ)
• Default values sθ = 1, µ = 0, Σ = 2I .
Some History MCMC IRT Fitting Model Checking The Future
What Priors Does it Allow?
• θ1, ..., θN N(0, sθ)
• (αj , βj) iid N(µ, Σ)
• Default values sθ = 1, µ = 0, Σ = 2I .
Some History MCMC IRT Fitting Model Checking The Future
What Priors Does it Allow?
• θ1, ..., θN N(0, sθ)
• (αj , βj) iid N(µ, Σ)
• Default values sθ = 1, µ = 0, Σ = 2I .
Some History MCMC IRT Fitting Model Checking The Future
Run of this R function
• s=apply(y,1,mean)
• theta.start=(s-mean(s))/sd(s)
• fit=MCMCirt1d(y, store.item = TRUE,store.ability = TRUE, mcmc=10000,burnin=0, theta.start=theta.start)
• (76 seconds on my laptop)
Some History MCMC IRT Fitting Model Checking The Future
Run of this R function
• s=apply(y,1,mean)
• theta.start=(s-mean(s))/sd(s)
• fit=MCMCirt1d(y, store.item = TRUE,store.ability = TRUE, mcmc=10000,burnin=0, theta.start=theta.start)
• (76 seconds on my laptop)
Some History MCMC IRT Fitting Model Checking The Future
Run of this R function
• s=apply(y,1,mean)
• theta.start=(s-mean(s))/sd(s)
• fit=MCMCirt1d(y, store.item = TRUE,store.ability = TRUE, mcmc=10000,burnin=0, theta.start=theta.start)
• (76 seconds on my laptop)
Some History MCMC IRT Fitting Model Checking The Future
Run of this R function
• s=apply(y,1,mean)
• theta.start=(s-mean(s))/sd(s)
• fit=MCMCirt1d(y, store.item = TRUE,store.ability = TRUE, mcmc=10000,burnin=0, theta.start=theta.start)
• (76 seconds on my laptop)
Some History MCMC IRT Fitting Model Checking The Future
MCMC Diagnostics
• Have the simulated draws converged to theposterior distribution?
• Use a R package coda that contains a numberof diagnostic functions.
• Typically, this particular algorithm has rapidconvergence.
Some History MCMC IRT Fitting Model Checking The Future
MCMC Diagnostics
• Have the simulated draws converged to theposterior distribution?
• Use a R package coda that contains a numberof diagnostic functions.
• Typically, this particular algorithm has rapidconvergence.
Some History MCMC IRT Fitting Model Checking The Future
MCMC Diagnostics
• Have the simulated draws converged to theposterior distribution?
• Use a R package coda that contains a numberof diagnostic functions.
• Typically, this particular algorithm has rapidconvergence.
Some History MCMC IRT Fitting Model Checking The Future
MCMC Diagnostics
• Have the simulated draws converged to theposterior distribution?
• Use a R package coda that contains a numberof diagnostic functions.
• Typically, this particular algorithm has rapidconvergence.
Some History MCMC IRT Fitting Model Checking The Future
Some coda Output
0 2000 6000 10000
0.1
0.3
0.5
0.7
Iterations
Trace of beta.Q51
0.1 0.3 0.5 0.7
01
23
45
N = 10000 Bandwidth = 0.01277
Density of beta.Q51
0 2000 6000 10000
0.5
0.7
0.9
1.1
Iterations
Trace of beta.Q52
0.4 0.6 0.8 1.0 1.2
01
23
4
N = 10000 Bandwidth = 0.01538
Density of beta.Q52
Figure: Caption for coda1
Some History MCMC IRT Fitting Model Checking The Future
Inferences?
• Have matrix of simulated draws of parameters.
• Say you have particular parameter h(α, β) ofinterest.
• Apply this function to the simulated matrix –get sample from marginal posterior of h.
• In particular, can estimate item response curveF (βjθ − αj).
Some History MCMC IRT Fitting Model Checking The Future
Inferences?
• Have matrix of simulated draws of parameters.
• Say you have particular parameter h(α, β) ofinterest.
• Apply this function to the simulated matrix –get sample from marginal posterior of h.
• In particular, can estimate item response curveF (βjθ − αj).
Some History MCMC IRT Fitting Model Checking The Future
Inferences?
• Have matrix of simulated draws of parameters.
• Say you have particular parameter h(α, β) ofinterest.
• Apply this function to the simulated matrix –get sample from marginal posterior of h.
• In particular, can estimate item response curveF (βjθ − αj).
Some History MCMC IRT Fitting Model Checking The Future
Inferences?
• Have matrix of simulated draws of parameters.
• Say you have particular parameter h(α, β) ofinterest.
• Apply this function to the simulated matrix –get sample from marginal posterior of h.
• In particular, can estimate item response curveF (βjθ − αj).
Some History MCMC IRT Fitting Model Checking The Future
Inferences?
• Have matrix of simulated draws of parameters.
• Say you have particular parameter h(α, β) ofinterest.
• Apply this function to the simulated matrix –get sample from marginal posterior of h.
• In particular, can estimate item response curveF (βjθ − αj).
Some History MCMC IRT Fitting Model Checking The Future
Posterior Means of Difficulty andDiscrimination
−1.5 −1.0 −0.5 0.0 0.5
0.2
0.4
0.6
0.8
1.0
DIFFICULTY
DIS
CR
IMIN
AT
ION
Q51
Q52
Q53
Q54
Q55Q56
Q57
Q58
Q59
Q60
Q61Q62 Q63
Q64
Q65
Q66
Q67
Q68
Q69
Q70
Q71
Q72
Q73
Q74
Q75
Q76
Q77
Q78Q79
Q80
Q81
Q82
Q83
Q84
Q85
Figure: Caption for difficulty.discrimination
Some History MCMC IRT Fitting Model Checking The Future
Some Fitted Item Response Curves
−3 −2 −1 0 1 2 3
−0.2
0.2
0.6
1.0
QUESTION 53
ABILITY
PR
OB
AB
ILIT
Y
−3 −2 −1 0 1 2 3
−0.2
0.2
0.6
1.0
QUESTION 79
ABILITY
PR
OB
AB
ILIT
Y
−3 −2 −1 0 1 2 3
−0.2
0.2
0.6
1.0
QUESTION 66
ABILITY
PR
OB
AB
ILIT
Y
−3 −2 −1 0 1 2 3
−0.2
0.2
0.6
1.0
QUESTION 69
ABILITY
PR
OB
AB
ILIT
Y
Figure: Caption for irtplots
Some History MCMC IRT Fitting Model Checking The Future
Two Questions
• Q 53, Low Difficulty, Low DiscriminationQUESTION: If n × n × n = 30, which of thefollowing best approximates n?
• Q 66, High Difficulty, Low DiscriminationQUESTION: If x < 2, then |2 − x | =
Some History MCMC IRT Fitting Model Checking The Future
Two Questions
• Q 69, Moderate Difficulty, High DiscriminationQUESTION: The length L of a spring is givenby L = 5
6F + 17, where F is the applied force.What force F will produce a length L of 31?
• Q 79, High Difficulty, Moderate DiscriminationQUESTION: In the system of equations{2x + 6y = 5, x − 3y = 8}, find a solution x .
Some History MCMC IRT Fitting Model Checking The Future
Checking the Model
• General idea: is the observed data consistentwith predictions from the model?
• Construct a discrepancy function T (y obs) thatmeasures a particular aspect of the model fitthat you’d like to check.
• Simulate draws from the posterior predictivedistribution T (y ∗).
• Compute the probability that T (y) is at leastas extreme as T (y obs).
• If this probability is small, casts doubt on themodel.
Some History MCMC IRT Fitting Model Checking The Future
Checking the Model
• General idea: is the observed data consistentwith predictions from the model?
• Construct a discrepancy function T (y obs) thatmeasures a particular aspect of the model fitthat you’d like to check.
• Simulate draws from the posterior predictivedistribution T (y ∗).
• Compute the probability that T (y) is at leastas extreme as T (y obs).
• If this probability is small, casts doubt on themodel.
Some History MCMC IRT Fitting Model Checking The Future
Checking the Model
• General idea: is the observed data consistentwith predictions from the model?
• Construct a discrepancy function T (y obs) thatmeasures a particular aspect of the model fitthat you’d like to check.
• Simulate draws from the posterior predictivedistribution T (y ∗).
• Compute the probability that T (y) is at leastas extreme as T (y obs).
• If this probability is small, casts doubt on themodel.
Some History MCMC IRT Fitting Model Checking The Future
Checking the Model
• General idea: is the observed data consistentwith predictions from the model?
• Construct a discrepancy function T (y obs) thatmeasures a particular aspect of the model fitthat you’d like to check.
• Simulate draws from the posterior predictivedistribution T (y ∗).
• Compute the probability that T (y) is at leastas extreme as T (y obs).
• If this probability is small, casts doubt on themodel.
Some History MCMC IRT Fitting Model Checking The Future
Checking the Model
• General idea: is the observed data consistentwith predictions from the model?
• Construct a discrepancy function T (y obs) thatmeasures a particular aspect of the model fitthat you’d like to check.
• Simulate draws from the posterior predictivedistribution T (y ∗).
• Compute the probability that T (y) is at leastas extreme as T (y obs).
• If this probability is small, casts doubt on themodel.
Some History MCMC IRT Fitting Model Checking The Future
Checking the Model
• General idea: is the observed data consistentwith predictions from the model?
• Construct a discrepancy function T (y obs) thatmeasures a particular aspect of the model fitthat you’d like to check.
• Simulate draws from the posterior predictivedistribution T (y ∗).
• Compute the probability that T (y) is at leastas extreme as T (y obs).
• If this probability is small, casts doubt on themodel.
Some History MCMC IRT Fitting Model Checking The Future
Posterior Predictive Checks on R
• Have matrix of simulated draws of item andability parameters from posterior.
• From a simulated draws of θ, α, β, simulateprobability matrix p.
• Simulate future data y ∗ from Bernoullidistributions with probabilities p.
• Compute discrepancy function T (y ∗).
• Repeat process – get sample {T (y ∗)}.
Some History MCMC IRT Fitting Model Checking The Future
Posterior Predictive Checks on R
• Have matrix of simulated draws of item andability parameters from posterior.
• From a simulated draws of θ, α, β, simulateprobability matrix p.
• Simulate future data y ∗ from Bernoullidistributions with probabilities p.
• Compute discrepancy function T (y ∗).
• Repeat process – get sample {T (y ∗)}.
Some History MCMC IRT Fitting Model Checking The Future
Posterior Predictive Checks on R
• Have matrix of simulated draws of item andability parameters from posterior.
• From a simulated draws of θ, α, β, simulateprobability matrix p.
• Simulate future data y ∗ from Bernoullidistributions with probabilities p.
• Compute discrepancy function T (y ∗).
• Repeat process – get sample {T (y ∗)}.
Some History MCMC IRT Fitting Model Checking The Future
Posterior Predictive Checks on R
• Have matrix of simulated draws of item andability parameters from posterior.
• From a simulated draws of θ, α, β, simulateprobability matrix p.
• Simulate future data y ∗ from Bernoullidistributions with probabilities p.
• Compute discrepancy function T (y ∗).
• Repeat process – get sample {T (y ∗)}.
Some History MCMC IRT Fitting Model Checking The Future
Posterior Predictive Checks on R
• Have matrix of simulated draws of item andability parameters from posterior.
• From a simulated draws of θ, α, β, simulateprobability matrix p.
• Simulate future data y ∗ from Bernoullidistributions with probabilities p.
• Compute discrepancy function T (y ∗).
• Repeat process – get sample {T (y ∗)}.
Some History MCMC IRT Fitting Model Checking The Future
Posterior Predictive Checks on R
• Have matrix of simulated draws of item andability parameters from posterior.
• From a simulated draws of θ, α, β, simulateprobability matrix p.
• Simulate future data y ∗ from Bernoullidistributions with probabilities p.
• Compute discrepancy function T (y ∗).
• Repeat process – get sample {T (y ∗)}.
Some History MCMC IRT Fitting Model Checking The Future
Check Conditional IndependenceAssumption
• Given θ, assume responses are independent.
• Suggests using item correlation matrix as adiscrepancy function.
• Use our method to single out pairs of itemsthat have unusually small or large correlations.
• Use criterion Tail Probability < 0.02.
Some History MCMC IRT Fitting Model Checking The Future
Check Conditional IndependenceAssumption
• Given θ, assume responses are independent.
• Suggests using item correlation matrix as adiscrepancy function.
• Use our method to single out pairs of itemsthat have unusually small or large correlations.
• Use criterion Tail Probability < 0.02.
Some History MCMC IRT Fitting Model Checking The Future
Check Conditional IndependenceAssumption
• Given θ, assume responses are independent.
• Suggests using item correlation matrix as adiscrepancy function.
• Use our method to single out pairs of itemsthat have unusually small or large correlations.
• Use criterion Tail Probability < 0.02.
Some History MCMC IRT Fitting Model Checking The Future
Check Conditional IndependenceAssumption
• Given θ, assume responses are independent.
• Suggests using item correlation matrix as adiscrepancy function.
• Use our method to single out pairs of itemsthat have unusually small or large correlations.
• Use criterion Tail Probability < 0.02.
Some History MCMC IRT Fitting Model Checking The Future
Check Conditional IndependenceAssumption
• Given θ, assume responses are independent.
• Suggests using item correlation matrix as adiscrepancy function.
• Use our method to single out pairs of itemsthat have unusually small or large correlations.
• Use criterion Tail Probability < 0.02.
Some History MCMC IRT Fitting Model Checking The Future
Find Questions Pairs that are Similar orDissimilar
• Question Q83 that asks for the value of k whenf (x) = x2 − kx − 3, f (2) = 0, similar toquestion Q82 that asks for one of the solutionsof the equation x2 + 4x = −13.
• Question Q83 that solves a equation isdifferent from question Q53 that has thestudent choose the best approximation to nwhen n × n × n = 30.
Some History MCMC IRT Fitting Model Checking The Future
Find Questions Pairs that are Similar orDissimilar
• Question Q83 that asks for the value of k whenf (x) = x2 − kx − 3, f (2) = 0, similar toquestion Q82 that asks for one of the solutionsof the equation x2 + 4x = −13.
• Question Q83 that solves a equation isdifferent from question Q53 that has thestudent choose the best approximation to nwhen n × n × n = 30.
Some History MCMC IRT Fitting Model Checking The Future
Find Questions Pairs that are Similar orDissimilar
• Question Q83 that asks for the value of k whenf (x) = x2 − kx − 3, f (2) = 0, similar toquestion Q82 that asks for one of the solutionsof the equation x2 + 4x = −13.
• Question Q83 that solves a equation isdifferent from question Q53 that has thestudent choose the best approximation to nwhen n × n × n = 30.
Some History MCMC IRT Fitting Model Checking The Future
Are Discrimination Parameter EstimatesSuitable?
• An estimate of a item’s discrimination ability isby the item total correlation.
• Measure of variability of item total correlationis the standard deviation of these correlations.
• Use this standard deviation as a discrepancyfunction.
• We find that the item total correlationspredicted from the model are more dispersethan those from the observed data.
• This suggests our discrimination parameterestimates are too disperse.
Some History MCMC IRT Fitting Model Checking The Future
Are Discrimination Parameter EstimatesSuitable?
• An estimate of a item’s discrimination ability isby the item total correlation.
• Measure of variability of item total correlationis the standard deviation of these correlations.
• Use this standard deviation as a discrepancyfunction.
• We find that the item total correlationspredicted from the model are more dispersethan those from the observed data.
• This suggests our discrimination parameterestimates are too disperse.
Some History MCMC IRT Fitting Model Checking The Future
Are Discrimination Parameter EstimatesSuitable?
• An estimate of a item’s discrimination ability isby the item total correlation.
• Measure of variability of item total correlationis the standard deviation of these correlations.
• Use this standard deviation as a discrepancyfunction.
• We find that the item total correlationspredicted from the model are more dispersethan those from the observed data.
• This suggests our discrimination parameterestimates are too disperse.
Some History MCMC IRT Fitting Model Checking The Future
Are Discrimination Parameter EstimatesSuitable?
• An estimate of a item’s discrimination ability isby the item total correlation.
• Measure of variability of item total correlationis the standard deviation of these correlations.
• Use this standard deviation as a discrepancyfunction.
• We find that the item total correlationspredicted from the model are more dispersethan those from the observed data.
• This suggests our discrimination parameterestimates are too disperse.
Some History MCMC IRT Fitting Model Checking The Future
Are Discrimination Parameter EstimatesSuitable?
• An estimate of a item’s discrimination ability isby the item total correlation.
• Measure of variability of item total correlationis the standard deviation of these correlations.
• Use this standard deviation as a discrepancyfunction.
• We find that the item total correlationspredicted from the model are more dispersethan those from the observed data.
• This suggests our discrimination parameterestimates are too disperse.
Some History MCMC IRT Fitting Model Checking The Future
Are Discrimination Parameter EstimatesSuitable?
• An estimate of a item’s discrimination ability isby the item total correlation.
• Measure of variability of item total correlationis the standard deviation of these correlations.
• Use this standard deviation as a discrepancyfunction.
• We find that the item total correlationspredicted from the model are more dispersethan those from the observed data.
• This suggests our discrimination parameterestimates are too disperse.
Some History MCMC IRT Fitting Model Checking The Future
Histogram of the Posterior PredictiveDistribution of the Standard Deviation of
the Item Total Correlations
Standard Deviation of Item Score Correlations
Fre
quency
0.08 0.09 0.10 0.11 0.12 0.13
050
100
150
200
Observed
tail−prob = 0.082
Figure: Caption for histogram
Some History MCMC IRT Fitting Model Checking The Future
1 or 2 Parameter Model?
• Do items have equal discrimination ability (oneparameter model) or different discriminations(two parameter model)?
• Actually I think both models are incorrect.
• I believe there are several clusters of questionsand questions within each clusters have samediscrimination.
• Problem is to detect clusters.
Some History MCMC IRT Fitting Model Checking The Future
1 or 2 Parameter Model?
• Do items have equal discrimination ability (oneparameter model) or different discriminations(two parameter model)?
• Actually I think both models are incorrect.
• I believe there are several clusters of questionsand questions within each clusters have samediscrimination.
• Problem is to detect clusters.
Some History MCMC IRT Fitting Model Checking The Future
1 or 2 Parameter Model?
• Do items have equal discrimination ability (oneparameter model) or different discriminations(two parameter model)?
• Actually I think both models are incorrect.
• I believe there are several clusters of questionsand questions within each clusters have samediscrimination.
• Problem is to detect clusters.
Some History MCMC IRT Fitting Model Checking The Future
1 or 2 Parameter Model?
• Do items have equal discrimination ability (oneparameter model) or different discriminations(two parameter model)?
• Actually I think both models are incorrect.
• I believe there are several clusters of questionsand questions within each clusters have samediscrimination.
• Problem is to detect clusters.
Some History MCMC IRT Fitting Model Checking The Future
1 or 2 Parameter Model?
• Do items have equal discrimination ability (oneparameter model) or different discriminations(two parameter model)?
• Actually I think both models are incorrect.
• I believe there are several clusters of questionsand questions within each clusters have samediscrimination.
• Problem is to detect clusters.
Some History MCMC IRT Fitting Model Checking The Future
Multilevel Model
• Assume discrimination parameters β1, ..., βk arerandom sample from a common distributiong(β).
• If g is normal, posterior estimates of βj willshrink towards a common value.
• If instead you choose a Cauchy density for g ,get more adaptive shrinkage. (Seems moreappropriate here.)
Some History MCMC IRT Fitting Model Checking The Future
Multilevel Model
• Assume discrimination parameters β1, ..., βk arerandom sample from a common distributiong(β).
• If g is normal, posterior estimates of βj willshrink towards a common value.
• If instead you choose a Cauchy density for g ,get more adaptive shrinkage. (Seems moreappropriate here.)
Some History MCMC IRT Fitting Model Checking The Future
Multilevel Model
• Assume discrimination parameters β1, ..., βk arerandom sample from a common distributiong(β).
• If g is normal, posterior estimates of βj willshrink towards a common value.
• If instead you choose a Cauchy density for g ,get more adaptive shrinkage. (Seems moreappropriate here.)
Some History MCMC IRT Fitting Model Checking The Future
Multilevel Model
• Assume discrimination parameters β1, ..., βk arerandom sample from a common distributiong(β).
• If g is normal, posterior estimates of βj willshrink towards a common value.
• If instead you choose a Cauchy density for g ,get more adaptive shrinkage. (Seems moreappropriate here.)
Some History MCMC IRT Fitting Model Checking The Future
Multilevel Modeling with a Cauchy Density
0.2 0.4 0.6 0.8 1.0
ESTIMATE
INDIVIDUAL
EXCHANGEABLE
Figure: Caption for cauchy.estimates
Some History MCMC IRT Fitting Model Checking The Future
Math Placement Data – Further Analysis
• Multilevel model where θi are modeled in termsof student covariates (gender, ACT math,GPA).
• Combine modeling across different groups ofstudents, where “groups” are defined in termsof entering course, form of test, year, etc.
• Test can be grouped by different sections(Arithmetic, Fractions, Graphing, Advanced,Solving) and this motives a “testlet” modelwhere responses to questions within eachtestlet are assumed positively correlated.
Some History MCMC IRT Fitting Model Checking The Future
Math Placement Data – Further Analysis
• Multilevel model where θi are modeled in termsof student covariates (gender, ACT math,GPA).
• Combine modeling across different groups ofstudents, where “groups” are defined in termsof entering course, form of test, year, etc.
• Test can be grouped by different sections(Arithmetic, Fractions, Graphing, Advanced,Solving) and this motives a “testlet” modelwhere responses to questions within eachtestlet are assumed positively correlated.
Some History MCMC IRT Fitting Model Checking The Future
Math Placement Data – Further Analysis
• Multilevel model where θi are modeled in termsof student covariates (gender, ACT math,GPA).
• Combine modeling across different groups ofstudents, where “groups” are defined in termsof entering course, form of test, year, etc.
• Test can be grouped by different sections(Arithmetic, Fractions, Graphing, Advanced,Solving) and this motives a “testlet” modelwhere responses to questions within eachtestlet are assumed positively correlated.
Some History MCMC IRT Fitting Model Checking The Future
Math Placement Data – Further Analysis
• Multilevel model where θi are modeled in termsof student covariates (gender, ACT math,GPA).
• Combine modeling across different groups ofstudents, where “groups” are defined in termsof entering course, form of test, year, etc.
• Test can be grouped by different sections(Arithmetic, Fractions, Graphing, Advanced,Solving) and this motives a “testlet” modelwhere responses to questions within eachtestlet are assumed positively correlated.
Some History MCMC IRT Fitting Model Checking The Future
The Future of Bayesian IRT Modeling
• Model checking and model comparison.
• Multilevel modeling. Effective way ofcombining similar sources of data.
• New book by Jean-Paul Fox on Bayesian IRT(with software).
• Break down the “wall of resistance” in usingBayesian methods in fitting and checking IRTmodels.
Some History MCMC IRT Fitting Model Checking The Future
The Future of Bayesian IRT Modeling
• Model checking and model comparison.
• Multilevel modeling. Effective way ofcombining similar sources of data.
• New book by Jean-Paul Fox on Bayesian IRT(with software).
• Break down the “wall of resistance” in usingBayesian methods in fitting and checking IRTmodels.
Some History MCMC IRT Fitting Model Checking The Future
The Future of Bayesian IRT Modeling
• Model checking and model comparison.
• Multilevel modeling. Effective way ofcombining similar sources of data.
• New book by Jean-Paul Fox on Bayesian IRT(with software).
• Break down the “wall of resistance” in usingBayesian methods in fitting and checking IRTmodels.
Some History MCMC IRT Fitting Model Checking The Future
The Future of Bayesian IRT Modeling
• Model checking and model comparison.
• Multilevel modeling. Effective way ofcombining similar sources of data.
• New book by Jean-Paul Fox on Bayesian IRT(with software).
• Break down the “wall of resistance” in usingBayesian methods in fitting and checking IRTmodels.
Some History MCMC IRT Fitting Model Checking The Future
The Future of Bayesian IRT Modeling
• Model checking and model comparison.
• Multilevel modeling. Effective way ofcombining similar sources of data.
• New book by Jean-Paul Fox on Bayesian IRT(with software).
• Break down the “wall of resistance” in usingBayesian methods in fitting and checking IRTmodels.