analyzing child health data sets: how ucsf's celdac initiative helps to move your research...
DESCRIPTION
Overview of UCSF-CTSI Comparative Effectiveness Large Dataset Analysis Core and large, public datasets for studying the health of children and the health care they receive.TRANSCRIPT
UCSF’s Comparative Effectiveness
Large Dataset Analytic Core: Focus on Child Health Data Sets
Janet Coffman, PhD
Philip R. Lee Institute for Health Policy Studies
University of California, San Francisco
November 30, 2011
Outline
• Overview of CELDAC• Examples of major data sets for studying
child health• Online tools for simple data analyses• Discussion
2
Overview of CELDAC
3
CELDAC Partners
CELDAC is a partnership at UCSF among the – Philip R Lee Institute for Health Policy Studies– Academic Research Systems– Department of Orthopedic Surgery– Clinical and Translational Science Institute
Funding– Administrative supplement to the NCRR grant for UCSF’s Clinical & Translational Science Institute–California HealthCare Foundation
4
CELDAC Personnel
Faculty
• Janet Coffman• Jim G. Kahn• Claire Brindis• Steve Takemoto• Adams Dudley• Kirsten Johansen
IHPS Staff
• Leon Traister• Claire Will
5
ARS Staff• Rob Wynden• Ketty Mobed• Hari Rekapalli• Prakash Lakshminarayanan
CELDAC Mission
The mission of CELDAC is to enhance UCSF's capacity for analysis of large local, state, and national health datasets to conduct comparative effectiveness research and other types of health services and health policy research.
6
CELDAC Goals• Accelerate access to and use of local, state, and national
health datasets, as a model for other CTSAs and health research organizations.
• Enhance UCSF researchers’ ability to compete for funding to use large data sets to conduct CER.
• Develop procedures and infrastructure by conducting pilot studies.
• Support additional studies on the comparative effectiveness of clinical interventions.
• Provide consultation to researchers currently working with or interested in working with large data sets
7
Find Large Datasetshttp://ctsi.ucsf.edu/research/celdac
A guided search tool to find the best datasets for a project. Builds on previous efforts by Andy Bindman, Nancy Adler, Claire Brindis, Charlie Irwin and others.
8
Search Results –Search for administrative data on infants’ use of health care services
http://ctsi.ucsf.edu/research/celdac
9
Analyze Large Data Sets• CELDAC has created a repository of select large,
public data sets that are available to UCSF faculty at no cost.
• These data sets include– HCUP Kids Inpatient Databases – HCUP National Emergency Department Sample– HCUP National Inpatient Sample– HCUP State Emergency Department and Inpatient
Databases (select states)– American Hospital Association Annual Survey– Area Resource File
10
Provide Consultation
• Study design/conceptualization • Identification of relevant datasets• Assistance with data set acquisition• Cohort selection• Data cleaning• Linking data sets• Strategies to deal with common methodological
issues in analysis of observational data• Programming support for preliminary analyses
11
12
Test New Methods for Working with Large Data Sets
• Conventional methods for managing large data sets have important limitations, especially for studies that draw data from multiple data sets– Requires programmers with expertise in managing
and querying large data sets– Source data tables continue as individual entities– Manipulations and linkages between tables require
awareness of each table’s architecture and customized “One-Off” programming
Test New Methods for Working with Large Data Sets
• Pilot Projects– Integrated repository of data on spine
surgery procedures and outcomes from five data sources
– Graphical user interface for browsing California Office of Statewide Health Planning and Development data on hospital discharges
13
Examples of Major Child Health Data Sets
14
Major Types of Large Datasets Used in Health Services ResearchType of Data Set Description ExamplesSurvey Collects information from
individuals, families, or organizations
• National Survey of Children’s Health
• National Survey of Children with Special Health Care Needs
Administrative claims
Information from records of health professionals and health care facilities, usually from billing records
• HCUP Kid’s Inpatient Databases
• HCUP State Inpatient Databases
Registries Information from datasets that incorporate all persons with a particular condition(s)
• California Cancer Registry• San Francisco
Mammography Registry
15
Major Types of Designs for Surveys
Type of Survey Description Examples
Cross-sectional Data collected from a single sample at a single point in time
• National Health and Nutrition Examination Survey
• National School-based Youth Behavior Survey
• National Survey of Children’s HealthPanel Data collected from a
single sample at multiple points in time
• Medical Expenditure Panel Survey• National Longitudinal Study of
Adolescent Health• National Longitudinal Survey of
Youth
16
Major Types of Units of Observation
17
Unit of Observation Examples
Individual • National Health and Nutrition Examination Survey• National Survey of Children’s Health
Household • Medical Expenditure Panel Survey• National Health Interview Survey
Visit or discharge • HCUP Kid’s Inpatient Databases• National Ambulatory Medical Care Survey
Physician • American Medical Association Masterfile• HSC Health Tracking Physician Survey
Facility (e.g., hospital, clinic) •American Hospital Association Annual Survey•California OSHPD Hospital Annual Financial Data
Geographic area (e.g., county, state)
•US Census•Area Resource File
18
Major National Data Sets Focused on Child Health
• National Survey of Children’s Health• National Survey of Children with Special
Health Care Needs• National Immunization Survey• National School-based Youth Risk
Behavior Survey• National Longitudinal Study of
Adolescent Health • Kids’ Inpatient Database
National Survey of Children’s Health
• Nationally representative sample (90,000+ children in 2007-2008
• Cross-sectional design, independent samples• Administered by telephone to parent or guardian
• Historically landlines only; adding cell phones
• Questions about• Child’s physical and emotional health• Parents’ health• Family interactions• School and community
19http://www.cdc.gov/nchs/slaits/nsch.htm
Other National Datasets Containing Data on Child Health
• National Ambulatory Medical Care Survey• National Hospital Ambulatory Medical
Care Survey• National Health and Nutrition Examination
Survey• Medical Expenditures Panel Survey• HCUP State Emergency Department and
Inpatient Databases20
Medical Expenditure Panel Survey• Nationally representative sample of 22,000 to
37,000 persons• Overlapping panel design• 2 years of data collected through 5 rounds of
interviews• Three major components
• Household survey• Data on cost and utilization from providers caring for
household survey participants• Survey of employers regarding employer-sponsored
health insurance benefits
http://www.meps.ahrq.gov/mepsweb/21
Online Tools for Simple Data Analyses
22
23
Approaches to Obtaining Information from Large Data Sets
• Analyze the data set or find a programmer to do the analysis for you
• Use an interactive data analysis tool provided for the data set
• Use a web site that aggregates data from multiple sources
2424
2525
2626
2727
2828
2929
30
Questions for Discussion
• What services relating to large data set analysis would be most useful to you?
• What data sets are of greatest interest to you?
• How could CELDAC partner effectively with researchers in your school/department/division?
30
Contact CELDAC
• Jim G. Kahn: [email protected] • Janet Coffman: [email protected]
/415-476-2435• Claire Will: [email protected]/415-476-
6009
• http://ctsi.ucsf.edu/research/large-datasets
31