predicting neonatal acidemia by computer analysis: costa et al

2
Predicting neonatal acidemia by computer analysis: Costa et al Christian M. Pettker, MD; George A. Macones, MD, MSCE, Associate Editor The article below summarizes a roundtable discussion of a study published in this issue of the Journal in light of its methodology, relevance to practice, and implications for future research. Article discussed: Costa A, Ayres-de-Campos D, Costa F, et al. Prediction of neonatal acidemia by computer analysis of fetal heart rate and ST event signals. Am J Obstet Gynecol 2009;201:464.e1-6. The full discussion appears at www.AJOG.org, pages e1-4. DISCUSSION QUESTIONS What is the specific question being asked? What do the inclusion and exclusion criteria tell us about the study? What was the primary outcome? Why were tracings from the last hour of labor used? What difference does lack of traditional fetal heart rate interpretation make? Would you categorize this tool as a diagnostic test or a screening test? Which statistical assessments were most important? What evidence supports higher accuracy of combined computer analysis? I ntrapartum fetal assessment, a critical and ubiquitous tool in obstetrical management, is also a limited and vexing part of practice. In 1957, Yale University researchers Hess and Hon introduced electronic fetal heart rate monitoring (EFM) as a window into fetal status dur- ing labor. Since then, EFM has become highly controversial, a point exemplified by its universal adoption despite failure to demonstrate a beneficial impact on cesarean rates or long-term fetal/neona- tal injury. The test has a very poor posi- tive predictive value (PPV) and a high false-positive rate; the American College of Obstetricians and Gynecologists has concluded that only 1-2/1000 abnormal tracings leads to cerebral palsy. In the 50 years since EFM became available— cer- tainly, adequate time to assess and de- velop the technology—it has shown somewhat limited effectiveness in im- proving perinatal outcomes. The arti- cle discussed in this month’s meeting of the Journal Club introduced a new technology for fetal monitoring: the Omniview-SisPorto 3.5 (Speculum, Lisbon, Portugal). CURRENT S TATE Classic EFM, appropriate for both high- and low-risk patients, uses intervals be- tween heart beats (for external or Dopp- ler monitoring) or between R waves (for internal or electrocardiogram—ECG— monitoring) to calculate heart rates. This calculated rate is then mapped to the fe- tal heart rate (FHR) strip to demonstrate characteristics that can be read by a pro- vider. As a result, the technology is de- pendent on the knowledge and experi- ence of the interpreter. Over the last 10-15 years, progress has been made into the standardization of FHR characteristics to facilitate interpre- tation. The rationale behind such efforts is that guidelines formalizing the assess- ment of FHR patterns may improve the ability of providers to recognize and if necessary, act on strips that have been shown to correlate with poor outcomes. Recent technological advances, such as the STAN S31 ECG FHR monitor (Neoventa Medical AB, Mölndal, Swe- den), show promise in the monitoring of higher-risk patients. An FDA-approved device, the STAN uses internal fetal monitoring to examine specific ECG changes, such as T-wave amplitude and ST-interval duration; much like an adult ECG. The device alerts the provider to events associated with fetal hypoxemia and/or acidemia. Study results indicate that STAN might reduce the incidence of fetal scalp pH sampling (relative risk [RR], 0.65; 95% confidence interval [CI], 0.59 – 0.72) and operative vaginal delivery (RR, 0.88; 95% CI, 0.80 – 0.97). S CREENING OR DIAGNOSIS ? One issue Journal Club members dis- cussed was whether EFM is considered a screening test or a diagnostic test. The question is important when weighing the statistical measures of test performance: sensitivity, specificity, PPV, negative predictive value (NPV), and positive and negative likelihood ratios. No test per- forms perfectly, and typically, improve- ments in sensitivity or NPV lead to a re- duction in specificity or PPV. Diagnostic tests usually demand strong specificity and a robust PPV, while screening tests should emphasize sensitivity and NPV. Determining what kind of test is needed From Yale University School of Medicine, New Haven, CT: Moderator Christian M. Pettker, MD Assistant professor Discussants Stephen F. Thung, MD Assistant professor Antonette Dulay, MD Third-year fellow Christina Han, MD First-year fellow Unzila Ali, MD Chief resident 0002-9378/free © 2009 Published by Mosby, Inc. doi: 10.1016/j.ajog.2009.08.006 See related article, page 464 Journal Club www. AJOG.org NOVEMBER 2009 American Journal of Obstetrics & Gynecology 543

Upload: christian-m-pettker

Post on 02-Sep-2016

217 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Predicting neonatal acidemia by computer analysis: Costa et al

PC

T

i

CG

T

FN

MCA

DSA

AT

CF

UC

0©d

Journal Club www.AJOG.org

redicting neonatal acidemia by computer analysis: Costa et alhristian M. Pettker, MD; George A. Macones, MD, MSCE, Associate Editor

he article below summarizes a roundtable discussion of a study published in this issue of the Journal in light of its methodology, relevance to practice, and

mplications for future research. Article discussed:

osta A, Ayres-de-Campos D, Costa F, et al. Prediction of neonatal acidemia by computer analysis of fetal heart rate and ST event signals. Am J Obstetynecol 2009;201:464.e1-6.

he full discussion appears at www.AJOG.org, pages e1-4.

Impre(ihbtcttfoctytvspcotOL

CCatlimctcvp

bFtimans

a(dhdmcSEeatf[[d

SOcsqsspnfmdtas

DISCUSSION QUESTIONS

What is the specific question beingasked?

What do the inclusion and exclusioncriteria tell us about the study?

What was the primary outcome?

Why were tracings from the last hour oflabor used?

What difference does lack of traditionalfetal heart rate interpretation make?

Would you categorize this tool as adiagnostic test or a screening test?

Which statistical assessments weremost important?

What evidence supports higheraccuracy of combined computeranalysis?

rom Yale University School of Medicine,ew Haven, CT:

oderatorhristian M. Pettker, MDssistant professor

iscussantstephen F. Thung, MDssistant professor

ntonette Dulay, MDhird-year fellow

hristina Han, MDirst-year fellow

nzila Ali, MDhief resident

002-9378/free2009 Published by Mosby, Inc.

oi: 10.1016/j.ajog.2009.08.006

eSee related article, page 464

ntrapartum fetal assessment, a criticaland ubiquitous tool in obstetrical

anagement, is also a limited and vexingart of practice. In 1957, Yale Universityesearchers Hess and Hon introducedlectronic fetal heart rate monitoringEFM) as a window into fetal status dur-ng labor. Since then, EFM has becomeighly controversial, a point exemplifiedy its universal adoption despite failureo demonstrate a beneficial impact onesarean rates or long-term fetal/neona-al injury. The test has a very poor posi-ive predictive value (PPV) and a highalse-positive rate; the American Collegef Obstetricians and Gynecologists hasoncluded that only 1-2/1000 abnormalracings leads to cerebral palsy. In the 50ears since EFM became available— cer-ainly, adequate time to assess and de-elop the technology—it has shownomewhat limited effectiveness in im-roving perinatal outcomes. The arti-le discussed in this month’s meetingf the Journal Club introduced a newechnology for fetal monitoring: themniview-SisPorto 3.5 (Speculum,isbon, Portugal).

URRENT STATE

lassic EFM, appropriate for both high-nd low-risk patients, uses intervals be-ween heart beats (for external or Dopp-er monitoring) or between R waves (fornternal or electrocardiogram—ECG—

onitoring) to calculate heart rates. Thisalculated rate is then mapped to the fe-al heart rate (FHR) strip to demonstrateharacteristics that can be read by a pro-ider. As a result, the technology is de-endent on the knowledge and experi-

nce of the interpreter. D

NOVEMBER 2009 Am

Over the last 10-15 years, progress haseen made into the standardization ofHR characteristics to facilitate interpre-ation. The rationale behind such effortss that guidelines formalizing the assess-

ent of FHR patterns may improve thebility of providers to recognize and ifecessary, act on strips that have beenhown to correlate with poor outcomes.

Recent technological advances, suchs the STAN S31 ECG FHR monitorNeoventa Medical AB, Mölndal, Swe-en), show promise in the monitoring ofigher-risk patients. An FDA-approvedevice, the STAN uses internal fetalonitoring to examine specific ECG

hanges, such as T-wave amplitude andT-interval duration; much like an adultCG. The device alerts the provider tovents associated with fetal hypoxemiand/or acidemia. Study results indicatehat STAN might reduce the incidence ofetal scalp pH sampling (relative riskRR], 0.65; 95% confidence intervalCI], 0.59 – 0.72) and operative vaginalelivery (RR, 0.88; 95% CI, 0.80 – 0.97).

CREENING OR DIAGNOSIS?ne issue Journal Club members dis-

ussed was whether EFM is considered acreening test or a diagnostic test. Theuestion is important when weighing thetatistical measures of test performance:ensitivity, specificity, PPV, negativeredictive value (NPV), and positive andegative likelihood ratios. No test per-

orms perfectly, and typically, improve-ents in sensitivity or NPV lead to a re-

uction in specificity or PPV. Diagnosticests usually demand strong specificitynd a robust PPV, while screening testshould emphasize sensitivity and NPV.

etermining what kind of test is needed

erican Journal of Obstetrics & Gynecology 543

Page 2: Predicting neonatal acidemia by computer analysis: Costa et al

ce

sitcufbaEteEaacE

fiblthfemptscs

SIr

PwfaOtlotEs

tSothsiPcaewctlwiwaup

apd

ms

FTffpresnapptb

nhfssiEioCwcjputea

Journal Club www.AJOG.org

5

an help us concentrate on the most rel-vant statistics.

EFM and its associated technologieseem to combine components of screen-ng and diagnostic tests, depending onhe evaluator’s perspective. In someases, abnormal FHR patterns can besed to screen for cases in which use of

etal scalp pH sampling or lactate testing,oth diagnostic for acidemia, would beppropriate. In other cases, however,FM is used as a type of diagnostic test

hat results in action; specifically, deliv-ry. Because of its different functions,FM appears to demand high sensitivitynd specificity, along with a reliable NPVnd PPV. These requirements likelyomplicate efforts to define the value ofFM.This quandary can probably be simpli-

ed. As noted, traditional EFM is limitedy a high false-positive rate, and thus,

ow specificity. All too often, an ominousracing leads to operative delivery of aealthy, vigorous infant who is unaf-

ected by hypoxemia or acidemia. As wevaluate new tests for our fetal assess-ent armamentarium, we should em-

hasize those that reduce the false-posi-ive rate. We still need to maintain highensitivity but in order to indicate a test islinically valuable, improvements inpecificity and PPV are required.

OMETHING NEW

n addition to incorporating STAN ECG

eadings and alerts, the Omniview-Sis- l

44 American Journal of Obstetrics & Gynecology

orto 3.5 (Speculum, Lisbon, Portugal)arns of FHR changes associated with

etal hypoxemia or acidemia; examplesre reduced variability or tachycardia.ne aim is to reduce human error in in-

erpretation of tracings and possible de-ays in taking appropriate action. An-ther goal is to supplement theechnology already in use—traditionalFM and STAN ECG—to improve thepecificity of the tests.

Costa and colleagues used a retrospec-ive analysis to compare the Omniview-isPorto 3.5 to a system that, like STAN,nly analyzes ST changes. The popula-ion included patients who were atigher risk for adverse outcomes. Thistudy showed dramatic improvementsn sensitivity with the Omniview-Sis-orto 3.5; more infants with acidemiaould be identified in the study group ex-mined with the new technology. How-ver, specificity was shown to be highith both systems, and there was little

hange in the rather modest PPV withhe new test. Regardless, the positiveikelihood ratio for predicting acidemiaas high (17.6; 95% CI, 9.0 –34.5); a pos-

tive likelihood ratio of this magnitudeould give a provider comfort in makingdecision for operative delivery, partic-larly in a patient with a high pretestrobability of an adverse event.Importantly, the test was not evalu-

ted against traditional visual FHR inter-retation, which is generally the stan-ard of care at most centers. This fact

imits our ability to judge the perfor- p

NOVEMBER 2009

ance of the Omniview-SisPorto 3.5ystem in different settings.

UTURE DIRECTIONS

he study by Costa et al is only 1 steporward in introducing a new technologyor EFM. Although important as an ex-loratory study, it is limited by its size,etrospective nature, and inclusion andxclusion criteria. Small exploratorytudies of promising new technologieseed to be followed by larger clinical tri-ls. Fetal pulse oximetry is a good exam-le of an intrapartum test that was ap-ropriately analyzed with large, rigorousrials that demonstrated little additionalenefit overall.Because both the STAN ECG and Om-

iview-SisPorto 3.5 are reserved forigh-risk patients and require internal

etal monitoring, neither will replacetandard external EFM. As a result, wetill need to seek ways to perfect EFM usen the low-risk patient. Improvements inFM interpretation may be seen with the

ntroduction of the new category systemutlined in the 2008 National Institute ofhild Health and Human Developmentorkshop report, but the clinical appli-

ation of these recommendations hasust begun, and no studies exist atresent to demonstrate their validity ortility. Evaluation and refinement of

hese recommendations may be our bestffort at remedying the problematicpplication of EFM in our daily

ractice. f