khemeia tdw live presentation nov16

14
1 Generating value from your legacy content – you are not alone TDW Live – Nov 16 2016 Content Transformation Software: Khemeia™ Maria Shiao VP Business Development Stelae Technologies

Upload: maria-shiao

Post on 17-Jan-2017

13 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Khemeia TDW Live Presentation NOV16

1

Generating value from your legacy content – you are not aloneTDW Live – Nov 16 2016

Content Transformation Software: Khemeia™

Maria ShiaoVP Business DevelopmentStelae Technologies

Page 2: Khemeia TDW Live Presentation NOV16

2

Big Data and its myths…

21/11/2016 Stelae Technologies – TDW Live 2

Page 3: Khemeia TDW Live Presentation NOV16

3

Paper = unstructuredDigital ≠ structured

Also … unstructured ≠ old

• Reusable• Searchable• Compliant

• Industry/RegulatoryStandards

• Internal data and meta data standard – enables big data analytics21/11/2016 Stelae Technologies – TDW Live 3

Page 4: Khemeia TDW Live Presentation NOV16

4

What’s happening in other industries…

LegalLegislationRegulation

FinancialAccountingRiskCompliance

Publishing

Scale and automationStandardizationSkills optimizationEnterprise-wide insights (faster, better)Managing compliance and risk

21/11/2016 Stelae Technologies – TDW Live 4

Page 5: Khemeia TDW Live Presentation NOV16

5

What makes unstructured data valuable?

21/11/2016 Stelae Technologies – TDW Live 5

Page 6: Khemeia TDW Live Presentation NOV16

6

The value chain of (unstructured) data

StructuredContent(20%)

UnstructuredContent (80%)

Unified Standardised

Structured Content

Management Search

Analytics

ManagementEnd Users(devices,

mobile, AR)

21/11/2016 Stelae Technologies – TDW Live 6

Page 7: Khemeia TDW Live Presentation NOV16

7

What are the options today?

Input: PDF Word Text

OCR Scans

Output: XML DITA S1000D XBRL/iXBRL html

Outsourcing or semi-automated scripts• Cost• Speed• Quality• Bias

Text and semantic meta-tagging• Size of dictionaries• Visual structures• Language• Bias

21/11/2016 Stelae Technologies – TDW Live 7

Page 8: Khemeia TDW Live Presentation NOV16

8

Typical Editor-based Transformation Workflow

Source Documents: Multi Format searchable PDF/ Word

Create DM Ref

OCR images

Start Workflow

Glossary of s1000D tags

Manual copy paste from text into s1000D editor

Extract images from PDF –TIFF/JPEG

Bring it together into s1000D editor

Draft and Quality Check/ Approval

Publish Final Copy

End

21/11/2016 Stelae Technologies – TDW Live 8

Page 9: Khemeia TDW Live Presentation NOV16

9

Challenges

TIME

TRAINING ON EDITOR

TRAINING ON TAGS – PROCEDURES, DESCRIPTIVE, IPC …

COPY PASTE OPERATION

SKILL

KNOWLEDGE OF S1000D

HIGH COST DUE TO

TRAINED RESOURCES

IN-ABILITY TO SCALE

PRODUCTIVITY CONSTRAINTS

TIME FOR RESOURCE RAMP-UP

21/11/2016 Stelae Technologies – TDW Live 9

Page 10: Khemeia TDW Live Presentation NOV16

10

Khemeia based Transformation Workflow

Source Documents: Multi Format searchable PDF/ Word

Create DM Ref

OCR images

Start Workflow

Glossary of s1000D tags

Manual copy paste from text into s1000D editor

Extract images from PDF –TIFF/JPEG

Bring it together into s1000D editor

Draft and Quality Check/ Approval

Publisher of Editor

End

Import to CSDB/ Publisher

21/11/2016 Stelae Technologies – TDW Live 10

Page 11: Khemeia TDW Live Presentation NOV16

11

Easy Conversion with Khemeia™

Minimal Change Management

Drag file

Drop file

Hot Folder Configurations

identify Document Type and Layout

Identify tables and content layout

Uses Glossary to Tag words

Automatable to directly drop into the folders

Feedback to existing process

21/11/2016 Stelae Technologies – TDW Live 11

Page 12: Khemeia TDW Live Presentation NOV16

12

Khemeia overcomes Challenges

TIME

TRAINING ON EDITOR

TRAINING ON TAGS – PROCEDURES, DESCRIPTIVE, IPC …

COPY PASTE OPERATION

SKILL

EVERY OPERATOR DOES

NOT NEED KNOWLEDGE OF

S1000D

HIGH COST DUE TO

TRAINED RESOURCES

IN-ABILITY TO SCALE

PRODUCTIVITY CONSTRAINTS

TIME FOR RESOURCE

RAMP-UP

21/11/2016 Stelae Technologies – TDW Live 12

Page 13: Khemeia TDW Live Presentation NOV16

13

Conclusions

Unstructured legacy data has value Why

Which data/meta-data sets have the most relevance and impact downstream

This value has to be extracted upfront Direct cost

Operational, lifecycle cost

Time

Technology is available to largely automate the process Proven in other industries

Significant early adopters in the A&D sector

21/11/2016 Stelae Technologies – TDW Live 13

Page 14: Khemeia TDW Live Presentation NOV16

14

Q&AThank You!

Contact:Maria Shiao, VP Business Development+44 7779 77 89 [email protected]@mariashiao

www.stelae-technologies.com@stelaetech