from print to the cloud and beyond: the story of a century old company and its resiliency to...

29
From Print to the Cloud and Beyond The Story of a Century Old Company and its Resiliency to Ever-Evolve

Upload: prolifics

Post on 16-Jul-2015

106 views

Category:

Technology


0 download

TRANSCRIPT

From Print to the Cloud and Beyond

The Story of a Century Old Company and its Resiliency to Ever-Evolve

Agenda

CAS Overview

CAS - In the Beginning… There was Print

CAS - The Age of Silos

CAS - IBM Integration. To the Cloud… and Beyond

Future Considerations

CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.2

Agenda

CAS Overview

CAS - In the Beginning… There was Print

CAS - The Age of Silos

CAS - IBM Integration. To the Cloud… and Beyond

Future Considerations

CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.3

CAS helps scientists around the world benefit from the published

work of their colleagues by monitoring, abstracting and indexing the

world's chemistry-related literature

CAS has been supporting scientists for more than 100 years

CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.4

Since 1907, CAS’s objective

has been to find, collect, and

organize all publicly disclosed

chemistry substance

information

CAS helps scientists around the world benefit from the published work of their colleagues

CAplusSM

CAS REGISTRYSM

CHEMLIST®

CIN®

CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.5

Markush

Indexing

Authority

Processing

Source

Selection

Document

Indexing

Reaction

Indexing

MARPAT®

CHEMCATS®

CAS scientists monitor, abstract and index the world's chemistry-

related literature

Proprietary, standardized indexing in CAS databases ensures

consistent, comprehensive search results.

CASREACT®

CAS products and services make it faster and easier for scientist to find the information they need for their research

CAS Registry Numbers® uniquely identify each

chemical substance without the ambiguity of multiple

naming conventions

STN® combines industry-leading search and retrieval

with unique and comprehensive content

SciFinder® offers a one-stop shop experience with

flexible search and discover options based on user

input and workflow

Science IP®, the CAS information search service

provides fast, comprehensive and accurate searches

of the world’s scientific and technical literatureCAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.6

CAS Registry Number 58-08-2

CAFFEINE!

Agenda

CAS Overview

CAS - In the Beginning… There was Print

CAS - The Age of Silos

CAS - IBM Integration. To the Cloud… and Beyond

Future Considerations

CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.7

CAS Timeline108 Years of Progress (and Counting)

CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.8

“CAS Knows Jack”Jack and Friends Beside Printed Chemical Abstracts

CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.10

Agenda

CAS Overview

CAS - In the Beginning… There was Print

CAS - The Age of Silos

CAS - IBM Integration. To the Cloud… and Beyond

Future Considerations

CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.11

Data Ingestion Data Transformation Data Validation Data Normalization Data Persistence

CAS End-To-End Architecture“The Age of Silos”

Data Ingestion Data Transformation Data Validation Data Curation Data Integration Data Persistence

Data Transformation Data Validation Data Integration Data Presentation

CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.12

Silo Challenges

Multiple Data Ingestion Points– In some cases, the same data is being ingested twice

Multiple Views of the Data– Each silo must perform complex transformations to its specific view

– Editorial manufactures normalized data based on a print model

– Product Development wants de-normalized, complete data

– Content Delivery has a mixed view of the data

Multiple Vocabulary Conventions– Differing data definitions causes confusion across silos

No Unified, Authority Data Store– Each silo has their own copy of the data in its own specific vocabulary

CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.13

Editorial Legacy Systems

Many disparate databases used to store relational data– Becomes difficult to maintain and support

Multiple database technologies used– No unified platform

Challenges to support legacy systems– Some legacy technologies are no longer supported

– Succession planning difficult to support legacy systems

– Special IT used so that legacy code would not need to be touched

CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.14

Content Delivery Systems

Data was transformed into one common data model to bridge

gap between Editorial and Product View– One common schema model was complex and unwieldy

– Common model contained “unnecessary” complexities

– Common model did not align with Product Development’s specifications

CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.15

Product Development Systems

Product Development must code for “unnecessary” complexities

Data not completely de-normalized– Additional development necessary to compile data

CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.16

Silo Challenges

CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.17

By the Numbers

Thousands of journals ingested per day– Approximately 1 TB of data per week

Over 100 other data feeds ingested per day

Over 1.2 million messages processed per day– Synced up with product data daily in less than 10 minutes

Over 6 TB of compiled data created per day

CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.18

What is an Architect to Do?

CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.19

Unify…Integrate…Simplify

Unify Data; Processes; Transformations; Data Ingestion

Integrate Disparate Systems; Services; Applications; and

Data Consumers

Simplify the Architecture!!!

CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.20

• Run proof-of-concept and/or proof-of-technology and/or pilot project as needed

• Negotiate contract

• Adjust as needed

• Selection team members score vendor solutions

• Aggregate scores

• Select vendor with best aggregate score (judgement required)

• Bake-off if winner is too close to call

• Send RFP document to prospective vendors

• Hold clarification meetings with vendor teams

• Vendors send RFP response documents

• Vendors present their solutions and answer questions

• Create technology selection team

• Identify key requirements (based on architecture and tech stack governance)

• Assign weights

• Create RFP document and scorecard spreadsheet

Request For Proposal

Create RFP

Engage vendors

Score-driven selection

Validate selection

CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.

Requirements

Data Integration

Durable Message Bus with Guaranteed Delivery

Any-to-Any Connectivity

Architectural Flexibility

Excellent Support

A Proven Solution

CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.22

Agenda

CAS Overview

CAS - In the Beginning… There was Print

CAS - The Age of Silos

CAS - IBM Integration. To the Cloud… and Beyond

Future Considerations

CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.23

Unify…Integrate…Simplify

Data Curation

Data Ingestion Data Transformation Data Validation Data Normalization Data Integration

Data Transformation Data Validation Data Integration Data Presentation

CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.24

Data Persistence Data Flow Orchestration

Agenda

Overview

CAS - In the Beginning… There was Print

CAS - The Age of Silos

CAS - IBM Integration. To the Cloud… and Beyond

Future Considerations

CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.25

To the Cloud… and Beyond!

Off-Prem Processing Bursting Capabilities Data Center Relief Co-Location Capabilities

New Mobile Applications

Service Unification Service Management Service Integration

CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.26

CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.27

Questions

CAS is a division of the American Chemical Society. Copyright 2015 American Chemical Society. All rights reserved.28

Connect with CAS:

Joseph Sapp

Lead Enterprise Application Architect

[email protected]

www.linkedin.com/in/joesapp