bengt muthen & tihomir asparouhov´ mplus

30
BSEM 2.0 Bengt Muth´ en & Tihomir Asparouhov Mplus www.statmodel.com Presentation at the Mplus Users’ Meeting Utrecht, January 13, 2016 Bengt Muth´ en & Tihomir Asparouhov Mplus Modeling 1/ 30

Upload: others

Post on 01-Nov-2021

6 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Bengt Muthen & Tihomir Asparouhov´ Mplus

BSEM 2.0

Bengt Muthen & Tihomir Asparouhov

Mpluswww.statmodel.com

Presentation at the Mplus Users’ MeetingUtrecht, January 13, 2016

Bengt Muthen & Tihomir Asparouhov Mplus Modeling 1/ 30

Page 2: Bengt Muthen & Tihomir Asparouhov´ Mplus

Overview

How to make the case for Bayes:

Non-informative priorsRegression analysis with missing on x’sMediation analysis with missing on mediator and x’s

Informative priorsBSEMTime-series factor analysis

Bengt Muthen & Tihomir Asparouhov Mplus Modeling 2/ 30

Page 3: Bengt Muthen & Tihomir Asparouhov´ Mplus

1. Bayes’ Advantage Over ML: Non-Informative Priors

Using Bayes with non-informative priors as a computational device toobtain results that are essentially the same as ML if ML could havebeen used:

The example of missing data on covariates

Regression analysis

Mediation analysis

Bengt Muthen & Tihomir Asparouhov Mplus Modeling 3/ 30

Page 4: Bengt Muthen & Tihomir Asparouhov´ Mplus

Regressing y On x: Bringing x’s Into The Model

ML estimation maximizes the log likelihood for the bivariatedistribution of y and x expressed as,

logL = ∑i

log[yi,xi] =n1

∑i=1

log[yi | xi]+n1+n2

∑i=1

log[xi]+n2+n3

∑i=n2+1

log[yi].

(1)

Figure : Missing data patterns. White areas represent missing data

x y

n

n

n

1

2

3

Bengt Muthen & Tihomir Asparouhov Mplus Modeling 4/ 30

Page 5: Bengt Muthen & Tihomir Asparouhov´ Mplus

Example: Monte Carlo Simulation Study

Linear regression with 40% missing on x1 - x4; no missing on yx3 and x4 s are binary split 86/16MAR holds as a function of the covariate z with no missingn = 200Comparison of Bayes and ML

z

x1

x2

x3

x4

y

Bengt Muthen & Tihomir Asparouhov Mplus Modeling 5/ 30

Page 6: Bengt Muthen & Tihomir Asparouhov´ Mplus

Bayes Treating Binary X’s As Binary

DATA: FILE = MARn200replist.dat;TYPE = MONTECARLO;

VARIABLE: NAMES = y x1-x4 z;USEVARIABLES = y x1-z;CATEGORICAL = x3-x4;

DEFINE: IF(z gt .25)THEN x1= MISSING;IF(z gt .25)THEN x2= MISSING;IF(-z gt .25)THEN x3= MISSING;IF(-z gt .25)THEN x4= MISSING;

ANALYSIS: ESTIMATOR = BAYES;PROCESSORS = 2;BITERATIONS = (10000);MEDIATOR = OBSERVED;

MODEL: y ON x1-z*.5;y*1;x1-z WITH x1-z;

Bengt Muthen & Tihomir Asparouhov Mplus Modeling 6/ 30

Page 7: Bengt Muthen & Tihomir Asparouhov´ Mplus

ML Versus Bayes Treating Binary X’s As Binary

Attempting to estimate the same model using ML leads to muchheavier computations due to the need for numerical integrationover several dimensions

Already in this simple model ML requires three dimensions ofintegration, two for the x3, x4 covariates and one for a factorcapturing the association between x3 and x4.

Bayes uses a multivariate probit model that generates correlatedlatent response variables underlying the binary x’s - no need fornumerical integration

Bengt Muthen & Tihomir Asparouhov Mplus Modeling 7/ 30

Page 8: Bengt Muthen & Tihomir Asparouhov´ Mplus

Monte Carlo Simulation Results (500 replications)

S.E. M.S.E. 95% % SigPopulation Average Std. Dev. Average Cover Coeff

MLR with binary x’s treated as normalx1 0.500 0.5087 0.1474 0.1358 0.0218 0.928 0.912x2 0.500 0.5016 0.1453 0.1380 0.0211 0.932 0.910x3 0.500 0.4684 0.4193 0.3440 0.1764 0.904 0.366x4 0.500 0.4970 0.4255 0.3508 0.1807 0.888 0.380

Bayes with binary x’s treated as normalx1 0.500 0.5024 0.1351 0.1366 0.0182 0.968 0.926x2 0.500 0.4966 0.1327 0.1380 0.0176 0.960 0.912x3 0.500 0.4874 0.3518 0.3516 0.1237 0.958 0.300x4 0.500 0.5066 0.3549 0.3519 0.1257 0.962 0.318

Bayes with binary x’s treated as binaryx1 0.500 0.5106 0.1227 0.1214 0.0151 0.946 0.984x2 0.500 0.5069 0.1212 0.1221 0.0147 0.948 0.972x3 0.500 0.4765 0.3363 0.3345 0.1134 0.960 0.300x4 0.500 0.5016 0.3459 0.3354 0.1194 0.950 0.332

Bengt Muthen & Tihomir Asparouhov Mplus Modeling 8/ 30

Page 9: Bengt Muthen & Tihomir Asparouhov´ Mplus

Binary yWith Missing Data On Covariates

Treating all covariates as normal, ML needs 4 dimensions ofintegration for the 4 covariates with missing data

Treating all covariates as normal, Bayes takes 30% of the MLcomputational time

Treating x3,x4 as binary ML needs 5 dimensions of integration

With a categorical y and many covariates with missing data thatare brought into the model Bayes is the only practical alternative

Bengt Muthen & Tihomir Asparouhov Mplus Modeling 9/ 30

Page 10: Bengt Muthen & Tihomir Asparouhov´ Mplus

Example: Mediation Analysis With Missing DataOn The Mediator And The Covariates

Figure : Mediation model for a binary outcome of dropping out of highschool (n=2898)

femalemothedhomeresexpectlunchexpelarrest

droptht7hispblackmath7

math10

hsdrop

Bengt Muthen & Tihomir Asparouhov Mplus Modeling 10/ 30

Page 11: Bengt Muthen & Tihomir Asparouhov´ Mplus

Bayes With Missing Data On The Mediator

CATEGORICAL = hsdrop;ANALYSIS: ESTIMATOR = BAYES;

PROCESSORS = 2;BITERATIONS = (20000);

MODEL: hsdrop ON math10 female-math7;math10 ON female-math7;

MODEL INDIRECT:hsdrop IND math10 math7(61.01 50.88);

OUTPUT: SAMPSTAT PATTERNS TECH1 TECH8 CINTERVAL;PLOT: TYPE = PLOT3;

Indirect and direct effects computed in probability scale usingcounterfactually-based causal effects:Muthen, B. & Asparouhov, T. (2015). Causal effects in mediationmodeling: An introduction with applications to latent variables.Structural Equation Modeling: A Multidisciplinary Journal.

Bengt Muthen & Tihomir Asparouhov Mplus Modeling 11/ 30

Page 12: Bengt Muthen & Tihomir Asparouhov´ Mplus

Bayesian Posterior Distribution Of Indirect EffectFor High School Dropout

-0.0

455

-0.0

435

-0.0

415

-0.0

395

-0.0

375

-0.0

355

-0.0

335

-0.0

315

-0.0

295

-0.0

275

-0.0

255

-0.0

235

-0.0

215

-0.0

195

-0.0

175

-0.0

155

-0.0

135

-0.0

115

-0.0

095

-0.0

075

-0.0

055

-0.0

035

-0.0

015

0.0

005

0.0

025

0.0

045

Indirect effect

0

50

100

150

200

250

300

350

400

450

500

550

600

650

700

750

800

850

900

Count

Mean = -0.01339, Std Dev = 0.00482

Median = -0.01303

Mode = -0.01254

95% Lower CI = -0.02376

95% Upper CI = -0.00503

Bengt Muthen & Tihomir Asparouhov Mplus Modeling 12/ 30

Page 13: Bengt Muthen & Tihomir Asparouhov´ Mplus

Missing On The Mediator: ML Versus Bayes

ML estimates are almost identical to Bayes, but:

ML needs Monte Carlo integration with 250 points because themediator is a partially latent variable due to missing data

ML needs bootstrapping (1,000 draws) to capture CIs for thenon-normal indirect effect

ML takes 21 minutes

Bayes takes 21 seconds

Bayes posterior distribution for the indirect effect is based on20,000 draws as compared to 1,000 bootstraps for ML

Bengt Muthen & Tihomir Asparouhov Mplus Modeling 13/ 30

Page 14: Bengt Muthen & Tihomir Asparouhov´ Mplus

Missing On The Mediator And The CovariatesTreating All Covariates As Normal: ML Versus Bayes

ML requires integration over 10 dimensions

ML needs 2,500 Monte Carlo integration points for sufficientprecision

ML takes 6 hours with 1,000 bootstraps

Bayes takes less than a minute

Bayes posterior based on 20,000 draws as compared to 1,000bootstraps for ML

Bengt Muthen & Tihomir Asparouhov Mplus Modeling 14/ 30

Page 15: Bengt Muthen & Tihomir Asparouhov´ Mplus

Missing On The Mediator And The CovariatesTreating Binary Covariates As Binary: ML Versus Bayes

6 covariates are binary.

ML requires 10 + 15 = 35 dimensions of integration: intractable

Bayes takes 3 minutes for 20,000 draws

Bengt Muthen & Tihomir Asparouhov Mplus Modeling 15/ 30

Page 16: Bengt Muthen & Tihomir Asparouhov´ Mplus

Speed Of Bayes In Mplus

Wang & Preacher (2014). Moderated mediation analysis usingBayesian methods. Structural Equation Modeling.

Comparison of ML (with bootstrap) and Bayes: Similarstatistical performance

Comparison of Bayes using BUGS versus Mplus: Mplus is 15times faster

Bengt Muthen & Tihomir Asparouhov Mplus Modeling 16/ 30

Page 17: Bengt Muthen & Tihomir Asparouhov´ Mplus

2. Bayes’ Advantage Over ML: Informative Priors

Frequentists often object to Bayes using informative priors

But they already do use such priors in many cases in unrealisticways

Bayes can let informative priors reflect prior studies

Bayes can let informative priors identify models that areunidentified by ML which is useful for model modification

Example: CFA

Bengt Muthen & Tihomir Asparouhov Mplus Modeling 17/ 30

Page 18: Bengt Muthen & Tihomir Asparouhov´ Mplus

ML Versus BESEM: CFA Cross-Loadings

ML uses a very strict zero-mean, zero-variance prior

BSEM uses a zero-mean, small-variance prior for the parameter:

EFA <BSEM <CFA

Bengt Muthen & Tihomir Asparouhov Mplus Modeling 18/ 30

Page 19: Bengt Muthen & Tihomir Asparouhov´ Mplus

The Several Uses Of BSEM

Non-identified models in ML made identified in Bayes usingzero-mean, small-variance priors. Produces a Bayes version of“modification indices”.

Single-group analysis (2012 Muthen-Asparouhov article inPsychological Methods):

Cross-loadings in CFADirect effects in MIMICResidual covariances in CFA (2015 Asparouhov-Muthen-Morinarticle in Journal of Management)

Multiple-group analysis:Configural and scalar analysis with cross-loadings and/or residualcovariancesApproximate measurement invariance (Web Note 17)BSEM-based alignment optimization (Web Note 18):

Residual covariancesApproximate measurement invariance

Bengt Muthen & Tihomir Asparouhov Mplus Modeling 19/ 30

Page 20: Bengt Muthen & Tihomir Asparouhov´ Mplus

Multilevel Time-Series Factor Analysis

f ft-1 t

fb

Within

Between

φ

φ

Bengt Muthen & Tihomir Asparouhov Mplus Modeling 20/ 30

Page 21: Bengt Muthen & Tihomir Asparouhov´ Mplus

Time-Series Factor Analysis:DAFS and WNFS Models

f ft-1 t

f ft-1 t

Bengt Muthen & Tihomir Asparouhov Mplus Modeling 21/ 30

Page 22: Bengt Muthen & Tihomir Asparouhov´ Mplus

Time-Series Factor Analysis:Combined DAFS And WNFS Model

f ft-1 t

Bengt Muthen & Tihomir Asparouhov Mplus Modeling 22/ 30

Page 23: Bengt Muthen & Tihomir Asparouhov´ Mplus

Example: Affective InstabilityIn Ecological Momentary Assessment

Jahng S., Wood, P. K.,& Trull, T. J., (2008). Analysis ofAffective Instability in Ecological Momentary Assessment:Indices Using Successive Difference and Group Comparison viaMultilevel Modeling. Psychological Methods, 13, 354-375

An example of the growing amount of EMA data

84 outpatient subjects: 46 meeting borderline personalitydisorder (BPD) and 38 meeting MDD or DYS

Each individual is measured several times a day for 4 weeks fortotal of about 100 assessments

A mood factor for each individual is measured with 21 self-ratedcontinuous items

The research question is if the BPD group demonstrates moretemporal negative mood instability than the MDD/DYS group

Bengt Muthen & Tihomir Asparouhov Mplus Modeling 23/ 30

Page 24: Bengt Muthen & Tihomir Asparouhov´ Mplus

BSEM For The Combined DAFS And WNFS Model

f ft-1 t

Bengt Muthen & Tihomir Asparouhov Mplus Modeling 24/ 30

Page 25: Bengt Muthen & Tihomir Asparouhov´ Mplus

Input for BSEM Of The Combined DAFS-WNFS Model

USEVARIABLES = jittery-scornful group;BETWEEN = group;CLUSTER = id;

DEFINE: group = group-1;ANALYSIS: TYPE = TWOLEVEL;

ESTIMATOR = BAYES;PROCESSORS = 2; THIN = 5; BITERATIONS = (2000);

MODEL: %WITHIN%f BY jittery-scornful*(& 1);f@1;f ON f&1;jittery-scornful ON f&1 (p1-p21);%BETWEEN%fb BY jittery-scornful*;fb ON group; fb@1;

MODELPRIORS: p1-p21∼N(0,0.01);

Bengt Muthen & Tihomir Asparouhov Mplus Modeling 25/ 30

Page 26: Bengt Muthen & Tihomir Asparouhov´ Mplus

BSEM Direct Effects From ft−1 To yt

Items with the largest direct effects:

Upset

Distressed

Angry

Irritable

Effects are negative, indicating that these items have lowerauto-correlation than the rest. The factor auto-correlation thereforegoes up. These direct effects can be freed, but...

Might these items measure a separate factor?

Bengt Muthen & Tihomir Asparouhov Mplus Modeling 26/ 30

Page 27: Bengt Muthen & Tihomir Asparouhov´ Mplus

3-Factor EFA/CFA DAFS

Although time-series ESEM is needed, crude EFA suggests 3 factors:

Angry: Upset, Distressed, Angry, Irritable

Sad: Downhearted, Sad, Blue, Lonely

Afraid: Afraid, Frightened, Scared

3-factor EFA/CFA DAFS factor autocorrelation (single-factorauto-corr = 0.596):

0.536 (Angry), 0.578 (Sad), 0.623 (Afraid).

- to which you could add random effects for the factorauto-correlations to see if they have different variability acrosssubjects.

BSEM can be used again to search for direct effects from ft−1 to yt.

Bengt Muthen & Tihomir Asparouhov Mplus Modeling 27/ 30

Page 28: Bengt Muthen & Tihomir Asparouhov´ Mplus

Extended DAFS Model: Direct Effects From yt−1 To yt

f ft-1 t

Bengt Muthen & Tihomir Asparouhov Mplus Modeling 28/ 30

Page 29: Bengt Muthen & Tihomir Asparouhov´ Mplus

Factor Auto-Correlations With And Without Direct EffectsFrom yt−1 To yt

All but one of the 21 direct effects are significant and positive. Directeffects vary in size.

Table : Factor auto-correlations

Angry Sad Afraid

Without direct effects 0.536 0.587 0.623With direct effects 0.479 0.543 0.597

Bengt Muthen & Tihomir Asparouhov Mplus Modeling 29/ 30

Page 30: Bengt Muthen & Tihomir Asparouhov´ Mplus

How to Learn More About Bayesian Analysis In Mplus:www.statmodel.com

Topic 9 handout and video from the 6/1/11 Mplus session atJohns Hopkins

Part 1 - Part 3 handouts and video from the August 2012 MplusVersion 7 training session at Utrecht University

Bengt Muthen & Tihomir Asparouhov Mplus Modeling 30/ 30