xml work flow

30

Upload: heyday-software-solutions

Post on 01-Jul-2015

1.244 views

Category:

Technology


4 download

TRANSCRIPT

Page 1: Xml Work Flow
Page 2: Xml Work Flow

Current Projects

E-Publishing– IMF– Wiley UK– VST

Page 3: Xml Work Flow

PDF to XML Work Flow

Data Capture Coding Validation E-Deployment

Page 4: Xml Work Flow

Data Capture

Capture Text,Box-Text and Box-FootNotes from Source PDF - only Chapters

Capture Chapter/Article-FootNotes from Source PDF- only Chapters

Capture Images & Tables as JPG and Image Related Text as HTML from Source PDF

Capture Table Content from the Source PDF as Text and add IMF TAGS

Page 5: Xml Work Flow

Coding

Merging of all the Data capture tasks as per IMF specification

Creating Front Matter from source PDF parts ( TOC,Preface,Abbrevations,Main Messages)

Creating Back Matter from source PDF parts(appendixes,Glossary,References)

Image Editing as per IMF specification Merging of all the above tasks as per IMF

specification

Page 6: Xml Work Flow

Validation

QC With Epsilon QC With Browsers for desired View QC With Oxygen

Page 7: Xml Work Flow

E-Deployment

Deploy in Customer Desired Format

Page 8: Xml Work Flow

TASK 2 TASK 3 TASK 8TASK 7TASK 4 TASK 6

TASK 12 TASK 11 TASK 10

TASK 9

QC DEPARTTASK 13

DELIVERABLE (XML)

INPUT (PDF)

TASK 1

TASK 5

Page 9: Xml Work Flow

TASK 1Capture Text, Box-Text and Box-Footnotes

from Source PDF-Chapters

TASK 2Capture Footnotes of Chapter/Article

from Source PDF

TASK 3Capture Images & Tables as JPG

from Source PDF-Chapters

TASK 4Capture Table Data as Text from Source PDF

and Add IMF-Table Tags

TASK 6Capture Front Matter from Source PDF

(TOC,Preface,Abbrevations,Main Messages)

TASK 7Capture Back Matter from Source PDF

(Appendixes, Glossaries and References)

TASK 5Merge all previous Tasks output into one

and add Required IMF Tags

TASK 8Edit all Images to set required resolution and Size

TASK 9Merge Tasks (from 5 to 8) to get final output

Val

idat

ion

Thr

ough

Eps

ilon

Val

idat

ion

Thr

ough

B

row

ser

for

Des

ired

Vie

w

Val

idat

ion

agai

nst o

f IM

F-

DT

D u

sing

Oxy

gen

Page 10: Xml Work Flow

Team Members

Team Leaders

Quality Analyst

Abbyy FineReader

Epsilon Editor

EpsilonDTDXSL

Oxygen

Task 1, Task 2, Task 3, Task 4

Task 5, Task 6, Task 7, Task 8, Task 9

Task 10, Task 11, Task 12, Task 13

Do

Do

DoUsing

Using

Using

Tasks Distribution and Methodology

Capturing Various Type of Data

Code around the Data

Validate the Code and Data

Page 11: Xml Work Flow

TASK 1 : SAMPLE

Description : Capture Text from Source PDF (Only Chapters) Using OCR Tool

Input : Source PDF

Page 12: Xml Work Flow

TASK 1 : SAMPLE

Output : One HTML file for each Chapter/Article

Page 13: Xml Work Flow

TASK 2 : SAMPLE

Description : Capture Chapter/Article-Foot Notes from Source PDF- Only Chapters

Input : Source PDF

Page 14: Xml Work Flow

TASK 2 : SAMPLEOutput : One html or multiple html when footnote repeats its ID for each Chapter/Article

Page 15: Xml Work Flow

TASK 3 : SAMPLEDescription : Capture Images & Tables as JPG and Image Related Text as HTML from Source PDF

Input : Source PDF

Page 16: Xml Work Flow

TASK 3 : SAMPLE

Output : Multiple JPG’s & One HTML

Page 17: Xml Work Flow

TASK 4 : SAMPLE

Description : Capture Table Content from the Source PDF as Text and add IMF TAGS

Input : Source PDF

Page 18: Xml Work Flow

TASK 4 : SAMPLE

Output : HTML

Page 19: Xml Work Flow

TASK 5 : SAMPLE

Description : Merging of all the above Tasks(1 to 4) as per IMF specification

Input : Task 1 to Task 4

Output: HTML

Page 20: Xml Work Flow

TASK 6 : SAMPLE Input : Source PDFDescription : Capture Front Matter from source PDF parts ( TOC, Preface,

Abbreviations, Main Messages)

Page 21: Xml Work Flow

TASK 6 : SAMPLE

Output : HTML

Page 22: Xml Work Flow

TASK 7 : SAMPLE

Description : Capture Back Matter from source PDF parts (Appendixes, Glossary, References)

Input : Source PDF

Page 23: Xml Work Flow

TASK 7 : SAMPLE

Output :HTML

Page 24: Xml Work Flow

TASK 8 : SAMPLE

Description : Image Editing as per IMF specification

Output : Final JPG’s

Input : Source PDF

Page 25: Xml Work Flow

TASK 9 : SAMPLE

Description : Merging of all the above tasks(5,6,7,8) as per IMF specification

Output : Final XML without Validation

Input : Task 5 to Task 8

Page 26: Xml Work Flow

TASK 10 : SAMPLE

Description : First Level Validation With Epsilon

Output : XML

Input : Task 9 - XML

Page 27: Xml Work Flow

TASK 11 : SAMPLEDescription : Validation With Browsers for desired View

Output : Final XML Validation- Second Level

Page 28: Xml Work Flow

TASK 12 : SAMPLE

Description : Validation With Oxygen against of IMF-DTD

Output : Final XML Validation- Third Level

Page 29: Xml Work Flow

TASK 13 : SAMPLE

Description : Packing Process in Desired Manner

Output : Deliverable Product

Page 30: Xml Work Flow