data cleaning using odm cdisc esug meeting andrew newbigging vice president, integrations...

19
Data Cleaning Using ODM CDISC ESUG Meeting Andrew Newbigging Vice President, Integrations Development 13 th July 2010 Medidata Solutions, Inc. Proprietary - Medidata and Authorized Clients Only. This document contains proprietary information that shall be distributed, routed or made available only within Medidata, except with written permission of Medidata.

Upload: leonard-mcdowell

Post on 06-Jan-2018

221 views

Category:

Documents


1 download

DESCRIPTION

© 2010 Medidata Solutions Worldwide Medidata uses ODM to integrate eClinical systems Metadata driven integrations support “define-once” strategy, or DRY (“Don’t Repeat Yourself”) ODM provides good support for form and question definitions How much of the data validation specification can be included in ODM? Introduction DesignAnalyzeCapture Transform & Extract Datasets Export (S)AE Case Data Import Clinical Data Import / Export Metadata Metadata Library eDiary IVRS LIMS SafetySAS Analysis Warehouse CTMS Payment Export Operational Data E2BODMODM CDASH SDTM ODM 3

TRANSCRIPT

Page 1: Data Cleaning Using ODM CDISC ESUG Meeting Andrew Newbigging Vice President, Integrations Development 13 th July 2010 Medidata Solutions, Inc. Proprietary

Data Cleaning Using ODMCDISC ESUG Meeting

Andrew NewbiggingVice President, Integrations Development13th July 2010

Medidata Solutions, Inc. Proprietary - Medidata and Authorized Clients Only. This document contains proprietary information that shall be distributed, routed or made available only within Medidata, except with written permission of Medidata.

Page 2: Data Cleaning Using ODM CDISC ESUG Meeting Andrew Newbigging Vice President, Integrations Development 13 th July 2010 Medidata Solutions, Inc. Proprietary

IntroductionODM 1.3 - FeaturesODM 1.3 - LimitationsVendor ExtensionsFuture PlansQuestions

Agenda

Page 3: Data Cleaning Using ODM CDISC ESUG Meeting Andrew Newbigging Vice President, Integrations Development 13 th July 2010 Medidata Solutions, Inc. Proprietary

© 2010 Medidata Solutions Worldwide

• Medidata uses ODM to integrate eClinical systems

• Metadata driven integrations support “define-once” strategy, or DRY (“Don’t Repeat Yourself”)

• ODM provides good support for form and question definitions

• How much of the data validation specification can be included in ODM?

IntroductionDesign AnalyzeCapture

Transform &Extract

Datasets

Export(S)AE Case

Data

ImportClinical Data

Import /Export

Metadata

Metadata Library

eDiary

IVRS

LIMS

Safety SAS

AnalysisWarehouse

CTMS

Payment

ExportOperational

Data

E2BE2BODMODMODMODM

CDASHCDASH SDTMSDTMODMODM

3

Page 4: Data Cleaning Using ODM CDISC ESUG Meeting Andrew Newbigging Vice President, Integrations Development 13 th July 2010 Medidata Solutions, Inc. Proprietary

IntroductionODM 1.3 - FeaturesODM 1.3 - LimitationsVendor ExtensionsFuture PlansQuestions

Agenda

Page 5: Data Cleaning Using ODM CDISC ESUG Meeting Andrew Newbigging Vice President, Integrations Development 13 th July 2010 Medidata Solutions, Inc. Proprietary

© 2010 Medidata Solutions Worldwide

ODM 1.3 – Features - DataType• ODM 1.3 introduced typed data definitions:

• So we know that this data is not valid:

• Data values can also be checked against DataType attribute (ODM 1.2 and 1.3):

5

Page 6: Data Cleaning Using ODM CDISC ESUG Meeting Andrew Newbigging Vice President, Integrations Development 13 th July 2010 Medidata Solutions, Inc. Proprietary

© 2010 Medidata Solutions Worldwide

ODM 1.3 – Features - LengthConstrain acceptable size of text, string, integer

and float data items:

6

Page 7: Data Cleaning Using ODM CDISC ESUG Meeting Andrew Newbigging Vice President, Integrations Development 13 th July 2010 Medidata Solutions, Inc. Proprietary

© 2010 Medidata Solutions Worldwide

ODM 1.3 – Features - CodeListConstrains data values to a specified list:

7

Page 8: Data Cleaning Using ODM CDISC ESUG Meeting Andrew Newbigging Vice President, Integrations Development 13 th July 2010 Medidata Solutions, Inc. Proprietary

© 2010 Medidata Solutions Worldwide

ODM 1.3 – Features - RangeCheckSpecification of simple uni-variate checks

8

Page 9: Data Cleaning Using ODM CDISC ESUG Meeting Andrew Newbigging Vice President, Integrations Development 13 th July 2010 Medidata Solutions, Inc. Proprietary

IntroductionODM 1.3 - FeaturesODM 1.3 - LimitationsVendor ExtensionsFuture PlansQuestions

Agenda

Page 10: Data Cleaning Using ODM CDISC ESUG Meeting Andrew Newbigging Vice President, Integrations Development 13 th July 2010 Medidata Solutions, Inc. Proprietary

© 2010 Medidata Solutions Worldwide

ODM 1.3 - Limitations

• FormalExpression contains free text – no specification of content

• Different systems use widely varying implementations (SQL, Javascript, VBScript, C#, SAS procedures, etc)

• Difficult to create complex, multi-variate checks that are system-independent and transferrable

10

Page 11: Data Cleaning Using ODM CDISC ESUG Meeting Andrew Newbigging Vice President, Integrations Development 13 th July 2010 Medidata Solutions, Inc. Proprietary

IntroductionODM 1.3 - FeaturesODM 1.3 - LimitationsVendor ExtensionsFuture PlansQuestions

Agenda

Page 12: Data Cleaning Using ODM CDISC ESUG Meeting Andrew Newbigging Vice President, Integrations Development 13 th July 2010 Medidata Solutions, Inc. Proprietary

© 2010 Medidata Solutions Worldwide

Vendor Extensions• ‘Recommended for … information that cannot

be expressed conveniently in the ODM model’• Medidata have created vendor extensions to

completely represent an EDC study in ODM• Edit checks and derivations are expressed in an

XML vendor extension structure

12

Page 13: Data Cleaning Using ODM CDISC ESUG Meeting Andrew Newbigging Vice President, Integrations Development 13 th July 2010 Medidata Solutions, Inc. Proprietary

© 2010 Medidata Solutions Worldwide

Vendor Extensions - Example

13

Page 14: Data Cleaning Using ODM CDISC ESUG Meeting Andrew Newbigging Vice President, Integrations Development 13 th July 2010 Medidata Solutions, Inc. Proprietary

IntroductionODM 1.3 - FeaturesODM 1.3 - LimitationsVendor ExtensionsFuture PlansQuestions

Agenda

Page 15: Data Cleaning Using ODM CDISC ESUG Meeting Andrew Newbigging Vice President, Integrations Development 13 th July 2010 Medidata Solutions, Inc. Proprietary

© 2010 Medidata Solutions Worldwide

Future PlansMedidata already makes extensive use of

automated application software testing using Cucumber:

‘Cucumber is a tool that can execute plain-text functional descriptions as automated tests. The language that Cucumber understands is called Gherkin. ‘

http://cukes.info/15

Page 16: Data Cleaning Using ODM CDISC ESUG Meeting Andrew Newbigging Vice President, Integrations Development 13 th July 2010 Medidata Solutions, Inc. Proprietary

© 2010 Medidata Solutions Worldwide

Using Gherkin to define edit checks

16

Page 17: Data Cleaning Using ODM CDISC ESUG Meeting Andrew Newbigging Vice President, Integrations Development 13 th July 2010 Medidata Solutions, Inc. Proprietary

© 2010 Medidata Solutions Worldwide

Multi-variate checks

17

Page 18: Data Cleaning Using ODM CDISC ESUG Meeting Andrew Newbigging Vice President, Integrations Development 13 th July 2010 Medidata Solutions, Inc. Proprietary

© 2010 Medidata Solutions Worldwide

Cross form checks

18

Page 19: Data Cleaning Using ODM CDISC ESUG Meeting Andrew Newbigging Vice President, Integrations Development 13 th July 2010 Medidata Solutions, Inc. Proprietary

IntroductionODM 1.3 - FeaturesODM 1.3 - LimitationsVendor ExtensionsFuture PlansQuestions

Agenda