multivariate time series analysis of clinical and physiological data patricia ordóñez rozo phd...

18
Multivariate Time Series Analysis of Clinical and Physiological Data Patricia Ordóñez Rozo PhD Candidate University of Maryland, Baltimore County

Upload: irma-jackson

Post on 13-Dec-2015

214 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Multivariate Time Series Analysis of Clinical and Physiological Data Patricia Ordóñez Rozo PhD Candidate University of Maryland, Baltimore County

Multivariate Time Series Analysis of Clinical and

Physiological DataPatricia Ordóñez Rozo

PhD CandidateUniversity of Maryland, Baltimore County

Page 2: Multivariate Time Series Analysis of Clinical and Physiological Data Patricia Ordóñez Rozo PhD Candidate University of Maryland, Baltimore County

Overview

• Motivation• Hypothesis• Visualization Work• Related Work• Proposed Similarity Metric• Evaluation Plan

Page 3: Multivariate Time Series Analysis of Clinical and Physiological Data Patricia Ordóñez Rozo PhD Candidate University of Maryland, Baltimore County

Motivation

• Technical advances in medicine– 15 - 350 vital signs and lab results per patient

(physiological and clinical data)

• Need for personalized medicine– Individual differences among humans– Preset ‘general’ thresholds misleading

• Methods of data analysis not multivariate

Page 4: Multivariate Time Series Analysis of Clinical and Physiological Data Patricia Ordóñez Rozo PhD Candidate University of Maryland, Baltimore County

The Hypothesis

We hypothesize that it may be possible to:1. Create a visualization that will assist

providers in examining multivariate patient data over time more accurately and efficiently than current tabular visualizations,

2. Identify hidden patterns in medical data that would signal significant medical events (such as organ failure) hours in advance, and

Page 5: Multivariate Time Series Analysis of Clinical and Physiological Data Patricia Ordóñez Rozo PhD Candidate University of Maryland, Baltimore County

The Hypothesis (continued)

3. Develop a measure of similarity for multivariate time series representations of physiological and clinical electronic data allowing physicians to identify patients with similar events and/or phenotypes for the purpose of predicting patient outcomes.

Page 6: Multivariate Time Series Analysis of Clinical and Physiological Data Patricia Ordóñez Rozo PhD Candidate University of Maryland, Baltimore County

The Visualization

Page 7: Multivariate Time Series Analysis of Clinical and Physiological Data Patricia Ordóñez Rozo PhD Candidate University of Maryland, Baltimore County

Pilot Study

• Asked 14 residents at St. Agnes Hospital to predict whether the 10 patients went into an episode of acute hypotension – Each used tables and visualization for five patients

Page 8: Multivariate Time Series Analysis of Clinical and Physiological Data Patricia Ordóñez Rozo PhD Candidate University of Maryland, Baltimore County

Results

Accuracy with Tables 57.5%Accuracy with Visualization 52.2%

Physionet Challenge 200928 submissions13 had 100% accuracy9 had 80% accuracy5 had 60% accuracy 1 had 20% accuracy

Page 9: Multivariate Time Series Analysis of Clinical and Physiological Data Patricia Ordóñez Rozo PhD Candidate University of Maryland, Baltimore County

Publications on Visualization• Patricia Ordóñez, Marie desJardins, Michael Lombardi, Christoph U.

Lehmann, Jim Fackler, An Animated Multivariate Visualization for Physiological and Clinical Data in the ICU in Proceedings of First ACM International Health Informatics Symposium (IHI), Arlington, VA, November 11-12, 2010, to appear.

• Christoph U. Lehmann, Patricia Ordóñez, Jim Fackler, Kathryn Holmes Practical Visualization of Multivariate Time Series Data in a Neonatal ICU in Proceedings of Visual Analytics of Health Care (VAHC) Workshop at VisWeek 2010, Salt Lake City, UT, October 24, 2010, to appear.

• Patricia Ordóñez, Marie desJardins, Carolyn Feltes, Christoph U. Lehmann, James Fackler, Visualizing Multivariate Time Series Data to Detect Specific Medical Conditions in Proceedings of AMIA (American Medical Informatics Association) 2008 Annual Symposium, 6:530-534(2008). Paper nominated for Student Paper Competition.

Page 10: Multivariate Time Series Analysis of Clinical and Physiological Data Patricia Ordóñez Rozo PhD Candidate University of Maryland, Baltimore County

Finding Hidden Patterns

• Develop a symbolic representation of multivariate time series based on SAX and BOP for univariate time series and other work on multivariate times series data

• Create a similarity metric for the representation

Page 11: Multivariate Time Series Analysis of Clinical and Physiological Data Patricia Ordóñez Rozo PhD Candidate University of Maryland, Baltimore County

Related Work

• SAX by Jessica Lin and Eamonn Keogh at UC Riverside

• BOP by Jessica Lin at George Mason University• Novel similarity metric for a multivariate time

series representation based on a wavelets by Mohammed Saeed and Roger Mark at MIT Laboratory of Computational Physiology

Page 12: Multivariate Time Series Analysis of Clinical and Physiological Data Patricia Ordóñez Rozo PhD Candidate University of Maryland, Baltimore County

Symbolic Aggregate ApproXimation

0

-

-

0 20 40 60 80 100 120

bbb

a

cc

c

a

First convert the time series to Piecewise Aggregate Approximation (PAA) representation.

0 20 40 60 80 100 120

C

C

Then convert the PAA to SAX symbols.

baabccbcThanks to Eamonn Keogh and Jessica Lin for use of slide

Page 13: Multivariate Time Series Analysis of Clinical and Physiological Data Patricia Ordóñez Rozo PhD Candidate University of Maryland, Baltimore County

Bag-of-Patterns Representation

• Lin and Li• SSDBM 2009

Thanks to Jessica Lin for use of these images

Page 14: Multivariate Time Series Analysis of Clinical and Physiological Data Patricia Ordóñez Rozo PhD Candidate University of Maryland, Baltimore County

Novel Similarity Metric

• Saeed and Mark• AMIA 2006• Similar multi-parameter physiological time

series using wavelet-based symbolic representation at different levels of granularity

• Used HR, SBP and cardiac output to predict hemodynamic deterioration

Page 15: Multivariate Time Series Analysis of Clinical and Physiological Data Patricia Ordóñez Rozo PhD Candidate University of Maryland, Baltimore County

Novel Similarity Metric (cont.)

• Used modified information retrieval methods for finding similar time series– Term Frequency Vector (TFV) – Inverse Document Frequency (IDF)

Ignored temporal patterns

Emphasized multi-scale analysis

Page 16: Multivariate Time Series Analysis of Clinical and Physiological Data Patricia Ordóñez Rozo PhD Candidate University of Maryland, Baltimore County

Proposed Similarity Metric

• Multivariate BOP that crosses the time series

HR BCCBBACB RR AABAABAB BACB,

CACB,…DeltaBP CCBACCBAMAP BBBBBBBB

Histogram of word frequencies using a modified form of

TF/IDF incorporating personalized and standardized

representations

Page 17: Multivariate Time Series Analysis of Clinical and Physiological Data Patricia Ordóñez Rozo PhD Candidate University of Maryland, Baltimore County

Evaluation Plan

• Study on Patent Ductus Arteriosus (PDA) in neo-natal patients to evaluate the final visualization and the similarity and information retrieval methods using data from NICU patients and by surveying residents

• Convert 4000+, annotated ICU patients of the MIMIC II database to our representation and evaluate the modified IR methods in larger medical database

Page 18: Multivariate Time Series Analysis of Clinical and Physiological Data Patricia Ordóñez Rozo PhD Candidate University of Maryland, Baltimore County

Good Research Requires Good Support

• Advisors– Drs. Marie desJardins and Tim Oates (Computer Science)– Dr. Jim Fackler (Medicine)

• Committee– Drs. Jessica Lin and Penny Rheingans

• Advocates/Mentors– Drs. Wendy Carter, Michael Grasso, Anupam Joshi,

Christoph U. Lehmann, Roger Mark, Daniel J. Scott, Janet Rutledge, Renetta Tull, Jorge H. Ordóñez-Smith

• Maple, Coral and eBiquity lab mates and classmates• National Science Foundation