shinyitemanalysis forpsychometrictraining ...shinyitemanalysis.org/docs/slides_rmeetup2018.pdf ·...

38
ShinyItemAnalysis for Psychometric Training and to Enforce Routine Analysis of Educational Tests Patrícia Martinková Dept. of Statistical Modelling, Institute of Computer Science, Czech Academy of Sciences College of Education, Charles University in Prague R meetup Warsaw, May 24, 2018 R meetup Warsaw, 2018 1/35

Upload: others

Post on 19-Apr-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: ShinyItemAnalysis forPsychometricTraining ...shinyitemanalysis.org/docs/slides_RMeetUp2018.pdf · Why DIF Analysis should be a Routine Part of Developing Conceptual Assessments. CBE-Life

ShinyItemAnalysis for Psychometric Trainingand to Enforce Routine Analysis of Educational Tests

Patrícia Martinková

Dept. of Statistical Modelling, Institute of Computer Science, Czech Academy of SciencesCollege of Education, Charles University in Prague

R meetup Warsaw, May 24, 2018

R meetup Warsaw, 2018 1/35

Page 2: ShinyItemAnalysis forPsychometricTraining ...shinyitemanalysis.org/docs/slides_RMeetUp2018.pdf · Why DIF Analysis should be a Routine Part of Developing Conceptual Assessments. CBE-Life

Introduction ShinyItemAnalysis Teaching psychometrics Routine analysis of tests Discussion

Announcement 1: Save the date for Psychoco 2019!

International Workshop on Psychometric Computing

Psychoco 2019

February 21 - 22, 2019Charles University & Czech Academy of Sciences, Prague

www.psychoco.org

Since 2008, the international Psychoco workshops aim at bringing together researchers working on moderntechniques for the analysis of data from psychology and the social sciences (especially in R).

Patrícia Martinková ShinyItemAnalysis for Psychometric Training and Test Validation R meetup Warsaw, 2018 2/35

Page 3: ShinyItemAnalysis forPsychometricTraining ...shinyitemanalysis.org/docs/slides_RMeetUp2018.pdf · Why DIF Analysis should be a Routine Part of Developing Conceptual Assessments. CBE-Life

Introduction ShinyItemAnalysis Teaching psychometrics Routine analysis of tests Discussion

Announcement 2: Job offers

Job offers at Institute of Computer Science:CAS-ICS Postdoctoral position (deadline: August 30)ICS Doctoral position (deadline: June 30)ICS Fellowship for junior researchers (deadline: June 30)... further possibilities to participate on grants

E-mail at [email protected] if interested in position in the area of

Computational psychometricsInterdisciplinary statisticsOther related disciplines

Patrícia Martinková ShinyItemAnalysis for Psychometric Training and Test Validation R meetup Warsaw, 2018 3/35

Page 4: ShinyItemAnalysis forPsychometricTraining ...shinyitemanalysis.org/docs/slides_RMeetUp2018.pdf · Why DIF Analysis should be a Routine Part of Developing Conceptual Assessments. CBE-Life

Outline

1. Introduction

2. ShinyItemAnalysis

3. Teaching psychometrics

4. Routine analysis of tests

5. Discussion

Page 5: ShinyItemAnalysis forPsychometricTraining ...shinyitemanalysis.org/docs/slides_RMeetUp2018.pdf · Why DIF Analysis should be a Routine Part of Developing Conceptual Assessments. CBE-Life

Introduction ShinyItemAnalysis Teaching psychometrics Routine analysis of tests Discussion

Motivation

To teach psychometric concepts and methodsGraduate courses "IRT models", "Selected topics in psychometrics"Workshops for admission test developersActive learning approach w/ hands-on examples

To enforce routine analyses of educational testsAdmission tests to Czech UniversitiesPhysiology concept inventories... tests of various purposes across the world

Promotion of own psychometrics researchDetection of Differential Item Functioning (DIF)

Need for user-friendly and freely available tool

Patrícia Martinková ShinyItemAnalysis for Psychometric Training and Test Validation R meetup Warsaw, 2018 4/35

Page 6: ShinyItemAnalysis forPsychometricTraining ...shinyitemanalysis.org/docs/slides_RMeetUp2018.pdf · Why DIF Analysis should be a Routine Part of Developing Conceptual Assessments. CBE-Life

Introduction ShinyItemAnalysis Teaching psychometrics Routine analysis of tests Discussion

ShinyItemAnalysis Application

Patrícia Martinková ShinyItemAnalysis for Psychometric Training and Test Validation R meetup Warsaw, 2018 5/35

Page 7: ShinyItemAnalysis forPsychometricTraining ...shinyitemanalysis.org/docs/slides_RMeetUp2018.pdf · Why DIF Analysis should be a Routine Part of Developing Conceptual Assessments. CBE-Life

Introduction ShinyItemAnalysis Teaching psychometrics Routine analysis of tests Discussion

ShinyItemAnalysis

Interactive (and step by step) analysis of educational tests and their items

Available as:R package

Version 1.2.7 now on CRAN

Newest version on GitHub

startShinyItemAnalysis()

Online shiny applicationICS server in Prague, CZ:

https://shiny.cs.cas.cz/ShinyItemAnalysis/

shinyapps.io:

https://cemp.shinyapps.io/ShinyItemAnalysis/

Patrícia Martinková ShinyItemAnalysis for Psychometric Training and Test Validation R meetup Warsaw, 2018 6/35

Page 8: ShinyItemAnalysis forPsychometricTraining ...shinyitemanalysis.org/docs/slides_RMeetUp2018.pdf · Why DIF Analysis should be a Routine Part of Developing Conceptual Assessments. CBE-Life

Introduction ShinyItemAnalysis Teaching psychometrics Routine analysis of tests Discussion

Authors and contributors

Patrícia Martinková1,2 Adéla Drabinová1,3 Jakub Houdek1,4 Ondřej Leder3 Lubomír Štěpánek4,5

1Department of Statistical Modelling, Institute of Computer Science, Czech Academy of Sciences2College of Education, Charles University, Prague3Department of Probability and Mathematical Statistics, Charles University, Prague4Faculty of Informatics and Statistics, University of Economics, Prague5First Faculty of Medicine, Charles University, Prague

Patrícia Martinková ShinyItemAnalysis for Psychometric Training and Test Validation R meetup Warsaw, 2018 7/35

Page 9: ShinyItemAnalysis forPsychometricTraining ...shinyitemanalysis.org/docs/slides_RMeetUp2018.pdf · Why DIF Analysis should be a Routine Part of Developing Conceptual Assessments. CBE-Life

Introduction ShinyItemAnalysis Teaching psychometrics Routine analysis of tests Discussion

ShinyItemAnalysis application

Patrícia Martinková ShinyItemAnalysis for Psychometric Training and Test Validation R meetup Warsaw, 2018 8/35

Page 10: ShinyItemAnalysis forPsychometricTraining ...shinyitemanalysis.org/docs/slides_RMeetUp2018.pdf · Why DIF Analysis should be a Routine Part of Developing Conceptual Assessments. CBE-Life

Introduction ShinyItemAnalysis Teaching psychometrics Routine analysis of tests Discussion

R package ShinyItemAnalysis downloads from CRAN

Patrícia Martinková ShinyItemAnalysis for Psychometric Training and Test Validation R meetup Warsaw, 2018 9/35

Page 11: ShinyItemAnalysis forPsychometricTraining ...shinyitemanalysis.org/docs/slides_RMeetUp2018.pdf · Why DIF Analysis should be a Routine Part of Developing Conceptual Assessments. CBE-Life

Introduction ShinyItemAnalysis Teaching psychometrics Routine analysis of tests Discussion

ShinyItemAnalysis online app is used worldwide!

Patrícia Martinková ShinyItemAnalysis for Psychometric Training and Test Validation R meetup Warsaw, 2018 10/35

Page 12: ShinyItemAnalysis forPsychometricTraining ...shinyitemanalysis.org/docs/slides_RMeetUp2018.pdf · Why DIF Analysis should be a Routine Part of Developing Conceptual Assessments. CBE-Life

Introduction ShinyItemAnalysis Teaching psychometrics Routine analysis of tests Discussion

ShinyItemAnalysis for teaching psychometrics

Who do we teach:Graduate students of different fields (Psychometrics NMST570 )

Faculties, university stakeholders

Some helpful features:Toy datasets, allows to upload own data

Building models in a step-by-step way

Models, estimates, interactive interpretation of results

Interactive training and exercises

Provides sample R code

Patrícia Martinková ShinyItemAnalysis for Psychometric Training and Test Validation R meetup Warsaw, 2018 11/35

Page 13: ShinyItemAnalysis forPsychometricTraining ...shinyitemanalysis.org/docs/slides_RMeetUp2018.pdf · Why DIF Analysis should be a Routine Part of Developing Conceptual Assessments. CBE-Life

Introduction ShinyItemAnalysis Teaching psychometrics Routine analysis of tests Discussion

Datasets

Five toy datasets are availableAllows to upload and preview one’s own dataset

Patrícia Martinková ShinyItemAnalysis for Psychometric Training and Test Validation R meetup Warsaw, 2018 12/35

Page 14: ShinyItemAnalysis forPsychometricTraining ...shinyitemanalysis.org/docs/slides_RMeetUp2018.pdf · Why DIF Analysis should be a Routine Part of Developing Conceptual Assessments. CBE-Life

Introduction ShinyItemAnalysis Teaching psychometrics Routine analysis of tests Discussion

Summary of Total Scores

Summary statisticsInteractive histogram

Patrícia Martinková ShinyItemAnalysis for Psychometric Training and Test Validation R meetup Warsaw, 2018 13/35

Page 15: ShinyItemAnalysis forPsychometricTraining ...shinyitemanalysis.org/docs/slides_RMeetUp2018.pdf · Why DIF Analysis should be a Routine Part of Developing Conceptual Assessments. CBE-Life

Introduction ShinyItemAnalysis Teaching psychometrics Routine analysis of tests Discussion

Criterion validity

Ony when criterion variable is available (study success, GPA, etc.)Available for total score as well as for items

Patrícia Martinková ShinyItemAnalysis for Psychometric Training and Test Validation R meetup Warsaw, 2018 14/35

Page 16: ShinyItemAnalysis forPsychometricTraining ...shinyitemanalysis.org/docs/slides_RMeetUp2018.pdf · Why DIF Analysis should be a Routine Part of Developing Conceptual Assessments. CBE-Life

Introduction ShinyItemAnalysis Teaching psychometrics Routine analysis of tests Discussion

Correlation structure

Correlations between itemsItem clusters

Patrícia Martinková ShinyItemAnalysis for Psychometric Training and Test Validation R meetup Warsaw, 2018 15/35

Page 17: ShinyItemAnalysis forPsychometricTraining ...shinyitemanalysis.org/docs/slides_RMeetUp2018.pdf · Why DIF Analysis should be a Routine Part of Developing Conceptual Assessments. CBE-Life

Introduction ShinyItemAnalysis Teaching psychometrics Routine analysis of tests Discussion

Traditional Item Analysis

Difficulty, discriminationCronbach’s alpha w/o item, index RIT, RIR, etc.

Patrícia Martinková ShinyItemAnalysis for Psychometric Training and Test Validation R meetup Warsaw, 2018 16/35

Page 18: ShinyItemAnalysis forPsychometricTraining ...shinyitemanalysis.org/docs/slides_RMeetUp2018.pdf · Why DIF Analysis should be a Routine Part of Developing Conceptual Assessments. CBE-Life

Introduction ShinyItemAnalysis Teaching psychometrics Routine analysis of tests Discussion

Distractor Analysis

Displays option selection percentage by total score groupNumber of groups can be changed

Patrícia Martinková ShinyItemAnalysis for Psychometric Training and Test Validation R meetup Warsaw, 2018 17/35

Page 19: ShinyItemAnalysis forPsychometricTraining ...shinyitemanalysis.org/docs/slides_RMeetUp2018.pdf · Why DIF Analysis should be a Routine Part of Developing Conceptual Assessments. CBE-Life

Introduction ShinyItemAnalysis Teaching psychometrics Routine analysis of tests Discussion

Logistic Regression

Displays probability of correct answer by total scoreParameterization can be changed (Z scores, IRT parameterization)

Patrícia Martinková ShinyItemAnalysis for Psychometric Training and Test Validation R meetup Warsaw, 2018 18/35

Page 20: ShinyItemAnalysis forPsychometricTraining ...shinyitemanalysis.org/docs/slides_RMeetUp2018.pdf · Why DIF Analysis should be a Routine Part of Developing Conceptual Assessments. CBE-Life

Introduction ShinyItemAnalysis Teaching psychometrics Routine analysis of tests Discussion

Nonlinear Regression

Allows for guessing (and inattention)

Patrícia Martinková ShinyItemAnalysis for Psychometric Training and Test Validation R meetup Warsaw, 2018 19/35

Page 21: ShinyItemAnalysis forPsychometricTraining ...shinyitemanalysis.org/docs/slides_RMeetUp2018.pdf · Why DIF Analysis should be a Routine Part of Developing Conceptual Assessments. CBE-Life

Introduction ShinyItemAnalysis Teaching psychometrics Routine analysis of tests Discussion

Multinomial Regression

Allows for joint modeling of distractors

Patrícia Martinková ShinyItemAnalysis for Psychometric Training and Test Validation R meetup Warsaw, 2018 20/35

Page 22: ShinyItemAnalysis forPsychometricTraining ...shinyitemanalysis.org/docs/slides_RMeetUp2018.pdf · Why DIF Analysis should be a Routine Part of Developing Conceptual Assessments. CBE-Life

Introduction ShinyItemAnalysis Teaching psychometrics Routine analysis of tests Discussion

IRT Models

Conceptualized as nonlinear mixed effect modelsMore precise ability estimation

Patrícia Martinková ShinyItemAnalysis for Psychometric Training and Test Validation R meetup Warsaw, 2018 21/35

Page 23: ShinyItemAnalysis forPsychometricTraining ...shinyitemanalysis.org/docs/slides_RMeetUp2018.pdf · Why DIF Analysis should be a Routine Part of Developing Conceptual Assessments. CBE-Life

Introduction ShinyItemAnalysis Teaching psychometrics Routine analysis of tests Discussion

Dichotomous IRT Models - interactive training

Plots Item Characteristic and Information Curves (ICC and IIC)based on selected parameters

Patrícia Martinková ShinyItemAnalysis for Psychometric Training and Test Validation R meetup Warsaw, 2018 22/35

Page 24: ShinyItemAnalysis forPsychometricTraining ...shinyitemanalysis.org/docs/slides_RMeetUp2018.pdf · Why DIF Analysis should be a Routine Part of Developing Conceptual Assessments. CBE-Life

Introduction ShinyItemAnalysis Teaching psychometrics Routine analysis of tests Discussion

Polytomous IRT Models - interactve training

Plots Category Response Curves and Expected Item Scorebased on selected parameters

Patrícia Martinková ShinyItemAnalysis for Psychometric Training and Test Validation R meetup Warsaw, 2018 23/35

Page 25: ShinyItemAnalysis forPsychometricTraining ...shinyitemanalysis.org/docs/slides_RMeetUp2018.pdf · Why DIF Analysis should be a Routine Part of Developing Conceptual Assessments. CBE-Life

Introduction ShinyItemAnalysis Teaching psychometrics Routine analysis of tests Discussion

Dichotomous IRT Models - check your understanding

Patrícia Martinková ShinyItemAnalysis for Psychometric Training and Test Validation R meetup Warsaw, 2018 24/35

Page 26: ShinyItemAnalysis forPsychometricTraining ...shinyitemanalysis.org/docs/slides_RMeetUp2018.pdf · Why DIF Analysis should be a Routine Part of Developing Conceptual Assessments. CBE-Life

Introduction ShinyItemAnalysis Teaching psychometrics Routine analysis of tests Discussion

Selected R Code

Sample R code may be run and modified in separate R session

Patrícia Martinková ShinyItemAnalysis for Psychometric Training and Test Validation R meetup Warsaw, 2018 25/35

Page 27: ShinyItemAnalysis forPsychometricTraining ...shinyitemanalysis.org/docs/slides_RMeetUp2018.pdf · Why DIF Analysis should be a Routine Part of Developing Conceptual Assessments. CBE-Life

Introduction ShinyItemAnalysis Teaching psychometrics Routine analysis of tests Discussion

ShinyItemAnalysis to promote our research

App promotes methods and research of our team:

Detection of Differential Item Functioning (DIF)

Detection of Differential Distractor Functioning (DDF)

Why DIF/DDF analysis should be routine part of test development

etc.

Patrícia Martinková ShinyItemAnalysis for Psychometric Training and Test Validation R meetup Warsaw, 2018 26/35

Page 28: ShinyItemAnalysis forPsychometricTraining ...shinyitemanalysis.org/docs/slides_RMeetUp2018.pdf · Why DIF Analysis should be a Routine Part of Developing Conceptual Assessments. CBE-Life

Introduction ShinyItemAnalysis Teaching psychometrics Routine analysis of tests Discussion

Differential Item Functioning (DIF)

DIF: Students from two groups and with the same underlying latentability have different probability of answering the item correctly.

Drabinová & Martinková (2017): Detection of DIF with Non-Linear Regression:Non-IRT Approach Accounting for Guessing. Journal of Educational Measurement,54(4), pp. 498-517. doi 10.1111/jedm.12158

Patrícia Martinková ShinyItemAnalysis for Psychometric Training and Test Validation R meetup Warsaw, 2018 27/35

Page 29: ShinyItemAnalysis forPsychometricTraining ...shinyitemanalysis.org/docs/slides_RMeetUp2018.pdf · Why DIF Analysis should be a Routine Part of Developing Conceptual Assessments. CBE-Life

Introduction ShinyItemAnalysis Teaching psychometrics Routine analysis of tests Discussion

Differential Distractor Functioning (DDF)

DDF: Students from two groups and with the same underlying latentability have different probability of selecting given options.

Patrícia Martinková ShinyItemAnalysis for Psychometric Training and Test Validation R meetup Warsaw, 2018 28/35

Page 30: ShinyItemAnalysis forPsychometricTraining ...shinyitemanalysis.org/docs/slides_RMeetUp2018.pdf · Why DIF Analysis should be a Routine Part of Developing Conceptual Assessments. CBE-Life

Introduction ShinyItemAnalysis Teaching psychometrics Routine analysis of tests Discussion

Why DIF Analysis Should Be Analyzed Routinely?

Dataset HCI: significant difference in total score between males andfemales, yet no DIF item!Simulated GMAT data: total scores may have exactly the samedistribution, yet there may be DIF present in some items!

Martinková, Drabinová, Liaw, Sanders, McFarland & Price (2017): Checking Equity:Why DIF Analysis should be a Routine Part of Developing Conceptual Assessments.CBE-Life Sciences Education, 16(2), rm2. doi 10.1187/cbe.16-10-0307

Patrícia Martinková ShinyItemAnalysis for Psychometric Training and Test Validation R meetup Warsaw, 2018 29/35

Page 31: ShinyItemAnalysis forPsychometricTraining ...shinyitemanalysis.org/docs/slides_RMeetUp2018.pdf · Why DIF Analysis should be a Routine Part of Developing Conceptual Assessments. CBE-Life

Introduction ShinyItemAnalysis Teaching psychometrics Routine analysis of tests Discussion

Routine validation of educational tests

Supporting tool for routine validation of educational tests:Upload your own dataGenerate PDF/HTML reportLocal or online version

Patrícia Martinková ShinyItemAnalysis for Psychometric Training and Test Validation R meetup Warsaw, 2018 30/35

Page 32: ShinyItemAnalysis forPsychometricTraining ...shinyitemanalysis.org/docs/slides_RMeetUp2018.pdf · Why DIF Analysis should be a Routine Part of Developing Conceptual Assessments. CBE-Life

Introduction ShinyItemAnalysis Teaching psychometrics Routine analysis of tests Discussion

Report generation - settings

Chose methods, customize settingsChose report format (PDF/HTML)

Patrícia Martinková ShinyItemAnalysis for Psychometric Training and Test Validation R meetup Warsaw, 2018 31/35

Page 33: ShinyItemAnalysis forPsychometricTraining ...shinyitemanalysis.org/docs/slides_RMeetUp2018.pdf · Why DIF Analysis should be a Routine Part of Developing Conceptual Assessments. CBE-Life

Introduction ShinyItemAnalysis Teaching psychometrics Routine analysis of tests Discussion

Report generation

Generate report (run analyses)Download report (compile text into HTML/PDF)

Patrícia Martinková ShinyItemAnalysis for Psychometric Training and Test Validation R meetup Warsaw, 2018 32/35

Page 34: ShinyItemAnalysis forPsychometricTraining ...shinyitemanalysis.org/docs/slides_RMeetUp2018.pdf · Why DIF Analysis should be a Routine Part of Developing Conceptual Assessments. CBE-Life

Introduction ShinyItemAnalysis Teaching psychometrics Routine analysis of tests Discussion

Sample PDF report

Patrícia Martinková ShinyItemAnalysis for Psychometric Training and Test Validation R meetup Warsaw, 2018 33/35

Page 35: ShinyItemAnalysis forPsychometricTraining ...shinyitemanalysis.org/docs/slides_RMeetUp2018.pdf · Why DIF Analysis should be a Routine Part of Developing Conceptual Assessments. CBE-Life

Introduction ShinyItemAnalysis Teaching psychometrics Routine analysis of tests Discussion

Report generation workflow

shinyenvironment

rmarkdown knitr

LATEXPDFreport

HTMLreport

shiny provides a user interfacermarkdown for creating templates for PDF/HTML report generationknitr for compiling R markdown syntax into HTML/PDFTEX for creating PDF reports (latest distribution of LATEX is needed)

Patrícia Martinková ShinyItemAnalysis for Psychometric Training and Test Validation R meetup Warsaw, 2018 34/35

Page 36: ShinyItemAnalysis forPsychometricTraining ...shinyitemanalysis.org/docs/slides_RMeetUp2018.pdf · Why DIF Analysis should be a Routine Part of Developing Conceptual Assessments. CBE-Life

Introduction ShinyItemAnalysis Teaching psychometrics Routine analysis of tests Discussion

Conclusion and Discussion

ShinyItemAnalysis is an R package and online application forinteractive and step-by-step analysis of educational tests. It is useful for:

TEACHING of psychometrics and educational measurementoffers example datasets, upload of new datasetsvisualization, interpretation of resultssample R Code

ROUTINE VALIDATION OF EDUCATIONAL TESTSgenerates extensive reports for supplied data

ShinyItemAnalysis also promotes our RESEARCH in DIF/DDFdetection

https://shiny.cs.cas.cz/ShinyItemAnalysis/

[email protected]

Patrícia Martinková ShinyItemAnalysis for Psychometric Training and Test Validation R meetup Warsaw, 2018 35/35

Page 37: ShinyItemAnalysis forPsychometricTraining ...shinyitemanalysis.org/docs/slides_RMeetUp2018.pdf · Why DIF Analysis should be a Routine Part of Developing Conceptual Assessments. CBE-Life

Thank you for your attention!www.cs.cas.cz/martinkova

[email protected]

This research was supported by Czech Science Foundation under grant GJ15-15856Y

Page 38: ShinyItemAnalysis forPsychometricTraining ...shinyitemanalysis.org/docs/slides_RMeetUp2018.pdf · Why DIF Analysis should be a Routine Part of Developing Conceptual Assessments. CBE-Life

References

Martinková, Drabinová, Leder & Houdek (2017). ShinyItemAnalysis:Test and Item Analysis with Shiny.https://shiny.cs.cas.cz/ShinyItemAnalysis/https://CRAN.R-project.org/package=ShinyItemAnalysis

Martinková, Drabinová & Houdek (2017). ShinyItemAnalysis: Analýzapřijímacích a jiných znalostních či psychologických testů. TESTFÓRUM,č.9, str. 16-35. doi 10.5817/TF2017-9-129

McFarland, Price, Wenderoth, Martinková, et al. (2017). Development andValidation of the Homeostasis Concept Inventory. CBE Life SciencesEducation, 16(2), ar35. doi 10.1187/cbe.16-10-0305

Martinková, Drabinová, Liaw, Sanders, McFarland & Price (2017).Checking Equity: Why DIF Analysis should be a Routine Part ofDeveloping Conceptual Assessments. CBE-Life Sciences Education, 16(2),rm2. doi 10.1187/cbe.16-10-0307

Drabinová & Martinková (2017). Detection of DIF with Non-LinearRegression: Non-IRT Approach Accounting for Guessing. Journal ofEducational Measurement, 54(4), pp. 498-517. doi 10.1111/jedm.12158