data integration - national-academies.org/media/files/activity files/healthservices...data...
TRANSCRIPT
Data Integration
MAJ Paul B. Lester, Ph.D. Director, Research Facilitation Team
Office of the Deputy Under Secretary of the Army [email protected]
Unclassified
• Background • Why is data integration worth the effort? • What do you want to do with the data? • Requirements for data integration • Questions
Agenda
Unclassified 2
• Academic expertise: Leadership science
• Practical expertise: Leading research teams
• Work with Comprehensive Soldier & Family Fitness (CSF2) • PM for the Global Assessment Tool (GAT) • PM for the Soldier Fitness Tracker (SFT) • PM for the data analysis / program evaluation team
• Work with Army Analytics Group (AAG) • Director of the Research Facilitation Team (RFT) • Bridge between researchers and data integrators • Partnerships with CSF2 and other organizations to provide
data analysis, interpretation, reports, etc..
Background
Unclassified 3
It isn’t easy, but… • Data reuse often cheaper than data creation (from scratch) • Allows for the creation of knowledge • Allows for granular analysis • Allows for a Return on Investment (ROI) calculation • Allows leadership to “see” the organization
• Analysis identifies intervention / opportunity points • Analysis should inform policy / business decisions • If done well, analysis avoids BOGSAT
Why is data integration worth the effort?
Unclassified 4
• Provide self-awareness for employees?
• Do basic organizational surveillance?
• Drive business decisions?
• Use predictive analytics?
• Tackle “low base-rate” problems?
As the number of “yes” responses increase, so does the amount and kind of data the organization probably needs
What do you want to do with the data?
Unclassified 5
ONE EXAMPLE: Data Integrated for the Suicide Study
Additional files added after original requirements for DMDC Cohort
Armed Forces Medical Examiner Tracking System (AFMETS)
Army Central Registry (ACR) Aug 2010 snapshot
Army Central Registry (ACR) July 2010 snapshot
Army Court Martial Information System (ACMIS)
Army Equal Opportunity Reporting System (EORS)
Army Safety Management Information System - Revised (ASMIS-R)
Army Training Requirements And Resource System (ATRRS)
Army Training Requirements And Resource System (ATRRS) Nov 2010 Redo
Army Waiver Data (AWD)
Automated Neuropsychological Assessment Metrics (ANAM)
Centralized Operations Police Suite (COPS) - Vehicle Registration System (COPS - VRS)
CID Information Management System (CIMS)
CID Information Management System (CIMS) Redo
CID Information Management System (CIMS) Sanction Redo
CIMS - Automated System Crime Records Center (ASCRC)
CIMS - Automated System Crime Records Center (ASCRC) Redo
Clinical Data Mart (CDM)
COPS - Army Correctional Information System (COPS - ACIS)
COPS - Military Police Reporting System (COPS - MPRS)
Defense Casualty Information Processing System (DCIPS) Casualty and NOK files
Defense Manpower Data Center (DMDC) - Master Personnel (ADRES)
Defense Manpower Data Center (DMDC) - Master Personnel Res Ethnic 2000_2002
Defense Manpower Data Center (DMDC) - Transaction Personnel (ADRES)
Defense Manpower Data Center (DMDC)- ADRES 201001 Snapshot
Defense Manpower Data Center (DMDC)- ADRES Non-Army snapshots
Defense Manpower Data Center (DMDC)- ADRES Strength Acct Code Snapshots
Digital Training Management System (DTMS)
DMDC - Casualty
DMDC - Contingency Tracking System (CTS)
DMDC - Contingency Tracking System (CTS) Redo
DMDC - Defense Enrollment and Eligibility Reporting System (DEERS)
DMDC - MEPCOM
DMDC - Payroll Active Duty 2000_2006
DMDC - Payroll Active Duty 2007_2009
DMDC - PERSTEMPO
DMDC Cohort June 2010
DMDC- Payroll Reserves
DoD Suicide Event Report (DODSER – Army) DoD Suicide Event Report (DODSER – Army) Data Refresh abbreviated
DoD Suicide Event Report (DODSER – Army) Data Refresh to include other cases
Drug & Alcohol Management Information System (DAMIS) HQDA G1 Unit name and location lookup table for Survey studies
Integrated Total Army Personnel Database (ITAPDB) Medical Data Repository (MDR) Partial Resend
Medical Protection System (MEDPROS)
MEDPROS - Deployment Health Assessments (MEDPROS - DHA)
MEDPROS - Periodic Health Assessments (MEDPROS - PHA)
Military Health System Data Repository (MDR)
Physical Disability Case Processing System (PDCAPS)
Risk Reduction Program System (RRPS)
Sexual Assault Data Management System (SADMS) Soldier Fitness Tracking (SFT)
Special Death Event Register File CID
Special Death Event Register File CID (Revised Dec 2010) Theater Medical Data Store (TMDS)
Theater Medical Data Store (TMDS) Patient table redo
TRANSCOM Regulating and Command & Control Evacuation System (TRAC2ES)
Wounded Warrior Accountability System (WWAS)
Takeaway: Depending on the
desired outcome, data integration
can be a “Herculean” effort. Unclassified
6
#1. Measurement “common to all” #2a. A plan for integration of data across the organization(s) #2b. Senior leadership emphasis & participation #3. Data collection / integration / staging / analysis platforms #4. Personnel to do the integration #5. Data analysis team(s) #6. Governance
What are the requirements?
Unclassified 7
•Optimism
•Work engagement
•Individual strengths
•+/- Coping strategies
•Spirituality (not religiosity)
•Strength of familial relationships
•How well the Army supports families
•Family support for serving in Army
•Trust in unit, leadership, peers
•+/- Affectivity (emotions)
•Strength of friendships
•Catastrophic thinking
•Depression
Life Orientation Scale Scheier, Carver, & Bridges (1994)
Patient Health Questionnaire - 9 Kroenke, Spitzer & Williams (2001)
Military Family Fitness Scale Directorate of Basic Combat Training
Experimentation & Analysis Element
Ft. Jackson, SC Military Family Fitness Scale Directorate of Basic Combat Training
Experimentation & Analysis Element
Ft. Jackson, SC Organizational Trust Scales Mayer, Davis, & Schoorman (1995)
Mayer & Davis (1999)
Sweeney, Thompson, & Blanton (2009)
Work as a Calling Scale Wrzesniewski et al. (1997)
Peterson, Park, & Seligman (2005)
Coping Strategy Scales Carver, Scheier, & Weutraub (1989)
Peterson & Park (In Press)
Pessimistic-Optimistic
Explanatory Style Peterson et al (2001)
PANAS Watson, Clark, & Tellegen (1989)
UCLA Loneliness Scale + Original Items Russell, Peplau, & Furguson (1978)
Russell, Peplau, & Cutrona (1980)
Peterson & Park (In Press)
Original Items Peterson & Park (In Press)
Brief Strengths Inventory Peterson & Seligman (2004)
Brief Multidimensional
Measure of Spirituality Fetzer Institute (1999)
Measurement common to all (1 of 2)
Unclassified 8
• The GAT is a 105 question survey administered online that measures a host of variables related to psychological resilience
• Must be taken annually
• Takes < 15 minutes to complete
• Most questions are not new / original 90% were already published in peer-reviewed scientific journals before the GAT was created
• Taken over 2.8 million times (Oct ‘09 – Jan ‘13) About once every 33 seconds
• Feedback: Narrative + bar chart scores + comparison to others (demographically)
• Results are confidential Not shared with command, investigators, doctors, etc.
THE GAT ONLY WORKS IF SOLDIERS ANSWER HONESTLY
CONFIDENTIALITY TRUST HONEST ANSWERS
Measurement common to all (2 of 2)
Unclassified 9
#1. Measurement “common to all” #2a. A plan for integration of data across the organization(s) #2b. Senior leadership emphasis & participation #3. Data collection / integration / staging / analysis platforms #4. Personnel to do the integration #5. Data analysis team(s) #6. Governance
What are the requirements?
Unclassified 10
ONE EXAMPLE: Army Data Framework
Data
Provider
Sources
Data Warehousing
Data Management & Quality Practices
User Access Data
Capture
Analysis
&
Reporting
Data
Warehouses
BI Tools
Source A
Source B
Master Data
ERP
ETL Tools
Data
Marts
ODS
&
Meta Data
Registry & Discovery
Other Services
………. Security Repository
Information Products
Enterprise Infrastructure Support Services
Data Governance
SENSORS
HUMAN INPUT
Ex: Condition Based
Maintenance
Situation
Awareness
Project
Area
Most Decision Support Business Intelligence Tools today have the
following features:
- Navigation Strategies (Menus, Drilldowns)
- Tabular Data, Graphic Displays
- Historic Trend Data (Regression)
- Gages
- Integrated Map Displays
What they all are missing is out-of-the-box clean coherent data.
Study
Area
Reporting Tools
Modeling Tools
NOTE #2b: This works best with senior leadership emphasis & participation
Unclassified 11
#1. Measurement “common to all” #2a. A plan for integration of data across the organization(s) #2b. Senior leadership emphasis & participation #3. Data collection / integration / staging / analysis platforms #4. Personnel to do the integration #5. Data analysis team(s) #6. Governance
What are the requirements?
Unclassified 12
13
• What if we had a single secure web-based platform where researchers could explore
complex research questions that require significant external data resources?
• What if this environment brought together over 60+ personnel-related data feeds from
across DoD…and didn’t require new Data Use Agreements?
• What if this environment had an approved governance structure (HIPAA & Privacy Act
compliant, Institutional Review Boards, etc.)?
• What if this environment gave researchers access to advanced / expensive statistical
tools…for free?
• What if this environment could be leveraged for collaboration across the DoD behavioral
science community?
• What if this environment hosted organic “research facilitators” who had meta data about the
environment to streamline the work?
• What if this environment also provided access for non-governmental researchers in order to
bring the “best” minds in science to bear on a problem set?
• What if this capability existed today?
It does…and it’s called the Person-Event Data Environment
IOC: TODAY FOC: 3rd Qtr FY14
ONE EXAMPLE: The Person-Event Data Environment (PDE)
Unclassified 13
14
ONE EXAMPLE: The Person-Event Data Environment (PDE)
PDE is designed for processing Controlled Unclassified
Information (CUI): personnel and medical encounter
data. The processing environment consists of separate
enclaves / environments:
Staging, Analysis (de-identified data), Analysis (encoded
data), and Reporting (Governance Portal,
Results/Reports, Results/Applications)
All data is initially loaded into the Staging enclave via the
SSH-Interface Server to be transformed and reviewed
prior to release into the Analysis enclaves.
Unclassified
#1. Measurement “common to all” #2a. A plan for integration of data across the organization(s) #2b. Senior leadership emphasis & participation #3. Data collection / integration / staging / analysis platforms #4. Personnel to do the integration #5. Data analysis team(s) #6. Governance
Data integration can be highly technical and labor-intensive. So, hire the right people to do it!
What are the requirements?
Unclassified 15
#1. Measurement “common to all” #2a. A plan for integration of data across the organization(s) #2b. Senior leadership emphasis & participation #3. Data collection / integration / staging / analysis platforms #4. Personnel to do the integration #5. Data analysis team(s) #6. Governance
What are the requirements?
Unclassified 16
17
Aplus Team
Knowledge Manager #2
IT DBA / Developer #2
IT DBA / Developer #3
IT DBA / Developer #1
Knowledge Manager #1
Strategic IT Manager
PDH Team (Nebraska)
Project Manager
Post Doc #2
Post-Doc #1
Senior Scientist
Drasgow Consulting Group (DCG)
Psychologist #1
Psychologist #3
IT Staffer
Psychologist #2
Senior Scientist
TRAC-MTRY
ORSA #1
Research Assistant #1
Research Assistant #2
ORSA #2
Project Manager
NPS GSBPP
Economist #1
Economist #2
Senior Scientist
RFT Organic Staff
Project Manager
Psychologist #1
Psychologist #2
Economist
Director
Research Coordinator
Research Fellow #1
Research Fellow #2
IT DBA / Developer
RWJF Initiative
Harvard
University of Michigan
Temple University
Northwestern
Univ. of Pennsylvania
University of Illinois
Post Production Staffer
ONE EXAMPLE: The Research Facilitation Team (RFT)
NOTE: Multi-disciplinary is GOOD. Unclassified
#1. Measurement “common to all” #2a. A plan for integration of data across the organization(s) #2b. Senior leadership emphasis & participation #3. Data collection / integration / staging / analysis platforms #4. Personnel to do the integration #5. Data analysis team(s) #6. Governance
What are the requirements?
Unclassified 18
19
ONE EXAMPLE: Governance
Unclassified
Questions?
MAJ Paul B. Lester, Ph.D. Director, Research Facilitation Team
Office of the Deputy Under Secretary of the Army [email protected]
831-583-2886
20 Ryan Ranch Road Suite 290
Monterey, CA 93940
Unclassified 20