semantic technology in document management

46
Applications of Semantic Technology in Document Management Washington DC, November 2011 George Roth, Adonis Damian www.recognos.com

Upload: george-roth

Post on 16-May-2015

1.281 views

Category:

Technology


6 download

DESCRIPTION

This is the vision of Recognos about the future of Semantic Technology in Document Management. The presentation was created for the SemTech Conference in November, 2011 in Washington DC.

TRANSCRIPT

Page 1: Semantic Technology in Document Management

Applications ofSemantic Technology in Document Management

Washington DC, November 2011George Roth, Adonis Damianwww.recognos.com

Page 2: Semantic Technology in Document Management

What is Document Management

A document management system (DMS) is a computer system (or set of computer programs) used to track and store electronic documents and/or images of paper documents. It is usually also capable of keeping track of the different versions created by different users (history tracking). The term has some overlap with the concepts of content management systems. It is often viewed as a component of enterprise content management (ECM) systems and related to digital asset management, document imaging, workflow systems and records management systems.

Make the formatted equivalent with non-formatted !

November 2011

Page 3: Semantic Technology in Document Management

DMS is changing !!!!

CLASSICAL

Metadata Integration Capture Indexing Storage Retrieval Distribution Security Workflow Collaboration Versioning Search Publishing …

NEW

Compliance Accessibility Interactivity Augmentation Translation Linking – Relationships Sentiment Analysis New Search (Semantic Tagging,

Deep Search, NL Questions)

November 2011

Page 4: Semantic Technology in Document Management

To Process Documents is harder and harder !!!!

Volume Labor extensive The “research project” – 40% – 60%

data gathering Metadata independent of content Shallow Search Hard to understand by non-experts

November 2011

Page 5: Semantic Technology in Document Management

New Tools: Semantic Technologies

NLP Natural Language Processing – understand the meaning of documents (statistic, machine learning, hybrid, graph based)

Semantic Search – tagging Data Integration Sentiment Analysis Linked Open Data – Linked Data Inference - Reasoning

November 2011

Page 6: Semantic Technology in Document Management

Semantic Technologies – Outside and Inside the Enterprise

Inside – Controlled Environment - TRUST Inside – Security issues Same techniques as outside the

enterprise Integrates non-formatted with formatted

data Easy to measure the effects - ROI Add on to the existing KM models Emerging area – Semantic technologies

started on the wwwNovember 2011

Page 7: Semantic Technology in Document Management

Document Management is changing !!!!

New features will become commodity in 2-3 years

Compliance Data Extraction, Comparison,

Change Analysis Interactivity Augmentation Translation Linking – Relationships Sentiment Analysis New Search (Semantic Tagging, Deep

Search, NL Questions)November 2011

Page 8: Semantic Technology in Document Management

Biggest Acquisitions

Microsoft: Powerset (Bing), Fast Search, Jinni

Google: Freebase, Needlebase Apple: SIRI Etc…

November 2011

Page 9: Semantic Technology in Document Management

New Document Management

Embedded Compliance Rules

November 2011

Page 10: Semantic Technology in Document Management

Compliance Rules

Example there is a rule: – email – Rule 0134C: “Not allowed to mention a

percentage as a profit promise investing with the firm”

In an email: “ Dear John, Our company has an amazing

method to invest, so that you will make at least 10% profit in 3 months !!!! “

The email was stopped – sent to Compliance with the message: “Violation of the Rule 0134C”

November 2011

Page 11: Semantic Technology in Document Management

Data Compliance

MFIP data extraction Link to the original document

November 2011

Page 12: Semantic Technology in Document Management

New Document Management

Data Extraction, Comparison, Change Analysis

November 2011

Page 13: Semantic Technology in Document Management

Data Extraction – Semantic Rules

November 2011

Page 14: Semantic Technology in Document Management

Data Comparison– Semantic Rules

November 2011

Page 15: Semantic Technology in Document Management

Change Analysis - Alarms - Semantic

Create Alarm when Trading Policy Changes

Create Alarm when Commissions Change (fields)

Create Alarms when member of the Board Changes

November 2011

Page 16: Semantic Technology in Document Management

New Document Management

Interactivity

November 2011

Page 17: Semantic Technology in Document Management

Interactivity

November 2011

Page 18: Semantic Technology in Document Management

New Document Management

Augmentation

November 2011

Page 19: Semantic Technology in Document Management

Augmentation

November 2011

Page 20: Semantic Technology in Document Management

New Document Management

Automated Translation

November 2011

Page 21: Semantic Technology in Document Management

Translation

Google Translate Great for simple translation – emails,

non technical documents

Language Weaver Specialized translation through machine

learning Train the system per domains

November 2011

Page 22: Semantic Technology in Document Management

New Document Management

Sentiment Analysis

November 2011

Page 23: Semantic Technology in Document Management

Sentiment Analysis

Media Sentry Open Amplify, Expert Systems,

Lymbix NLP and machine learning

November 2011

Page 24: Semantic Technology in Document Management

Sentiment Analysis

November 2011

Page 25: Semantic Technology in Document Management

New Document Management

Search

November 2011

Page 26: Semantic Technology in Document Management

Shallow Search vs. Deep Search

November 2011

Page 27: Semantic Technology in Document Management

Deep Document Search

November 2011

Page 28: Semantic Technology in Document Management

Faceted Search

November 2011

Page 29: Semantic Technology in Document Management

Faceted Search

November 2011

Page 30: Semantic Technology in Document Management

Ask Questions – Document Adviser

November 2011

Page 31: Semantic Technology in Document Management

NLP Search

November 2011

Page 32: Semantic Technology in Document Management

New Document Management

Complex App Samples

November 2011

Page 33: Semantic Technology in Document Management

Complex Apps: Media Monitoring – Media Sentry

November 2011

Page 34: Semantic Technology in Document Management

Media Sentry – Under the Hood

Internal Message Storage

WWW

GoogleAlerts

MeltwatersAlerts

Twitter FacebookForums /

Blogs

External Data Pull

TwitterAdapter

FacebookAdapter

80legsAdapter

DiffbotAdapter

ExchangeAdapter

ExchangeServer

FileServer

Natural Language Processing

Websites

UploadedTaxonomyESSEX

Data StorageWeb User Interface

MS SQL Server

November 2011

Page 35: Semantic Technology in Document Management

CRM – Intelligent Call Center Amdocs AIDA (AMDOCS Intelligent Decision

Automation)

November 2011

Page 36: Semantic Technology in Document Management

CRM - Event Extraction

November 2011

Page 37: Semantic Technology in Document Management

Interactive Book

November 2011

Display Linked Data Ask a question – semantic search

Entity Lookup

Page 38: Semantic Technology in Document Management

Interactive Books - sirBook

November 2011

Page 39: Semantic Technology in Document Management

Counterparty Risk

November 2011

Page 40: Semantic Technology in Document Management

Counterparty Risk

November 2011

Page 41: Semantic Technology in Document Management

Counterparty Risk

November 2011

Page 42: Semantic Technology in Document Management

Discovery through Inference (1)

November 2011

Page 43: Semantic Technology in Document Management

Discovery through Inference (2)

November 2011

Page 44: Semantic Technology in Document Management

Ultimate Goal – The Smart Document

Interactive - Exists Search – Semantic Search, Q&A Semantic Tagging – Summarization LOD with domains Linked : People, Companies,

Locations, Specific Terms Example a travel book

November 2011

Page 45: Semantic Technology in Document Management

Used technologies

The following technologies were used:- iQser – GIN- Clark & Parsia – Spanner, StarDog- Expert System – NLP- GATE- Smart Logic – Enterprise Query Platform – Fast Search –

Microsoft Sharepoint 11- Revelytix- Cognition- Franz Systems- DiffBot- Ontotext

November 2011

Page 46: Semantic Technology in Document Management

Contact

George RothPresident and CEO Recognos Inc.San [email protected]

Drew WarrenCEO Recognos FinancialNew [email protected]

November 2011