watson api use case demos for the nittany watson challenge

29
Watson Use Case Demos for the Nittany Watson Challenge January 2017 Mike Pointer, Watson Sr. Solution Architect

Upload: penn-state-edtech-network

Post on 07-Feb-2017

63 views

Category:

Education


3 download

TRANSCRIPT

Page 1: Watson API Use Case Demos for the Nittany Watson Challenge

Watson Use Case Demosfor the Nittany Watson Challenge

January 2017Mike Pointer, Watson Sr. Solution Architect

Page 2: Watson API Use Case Demos for the Nittany Watson Challenge

IBM WATSON CAPABILITIES

2

Watson Microservices

Language Services

Speech Services

Vision Services

Data Services

Embodied Cognition

Watson Knowledge Studio

25+ Services

Page 3: Watson API Use Case Demos for the Nittany Watson Challenge

Discovery

25+ WATSON MICROSERVICES

3

Language ServicesAlchemyLanguageConversationDocument ConversionLanguage TranslatorNatural Language ClassifierPersonality InsightsRetrieve and RankTone AnalyzerEntity ExtractionSentiment AnalysisEmotion AnalysisKeyword ExtractionConcept TaggingRelation ExtractionTaxonomy ClassificationAuthor Extraction

Language DetectionText ExtractionMicroformats ParsingFeed DetectionLinked Data SupportSpeech ServicesSpeech to TextText to SpeechVision ServicesVisual RecognitionSimilarity SearchData Insight ServicesAlchemyData NewsDiscoveryTradeoff AnalyticsEmbodied Cognition ServicesIntu

Watson’s APIs are the cognitive building blocks that harness our data.

Vision Recognition

Conversation

Page 4: Watson API Use Case Demos for the Nittany Watson Challenge

Use Cases and Demos

Speech to Text – Multiple SpeakersChatbot – School NavigatorWatson Conversation ServiceWatson Discovery ServiceWatson Knowledge Studio

Page 5: Watson API Use Case Demos for the Nittany Watson Challenge

Speech to Text

5

Page 6: Watson API Use Case Demos for the Nittany Watson Challenge

SPEECH TO TEXT WITH DIARIZATION – MULTIPLE SPEAKERS

6© Copyright IBM Corporation 2016 https://speech-to-text-demo.mybluemix.net/

Page 7: Watson API Use Case Demos for the Nittany Watson Challenge

Speech to SpeechSpeech to Text Language Translation Text to Speech

7

Page 8: Watson API Use Case Demos for the Nittany Watson Challenge

SPEECH TO SPEECH – WITH LANGUAGE TRANSLATION

8© Copyright IBM Corporation 2016

https://speech-to-speech-app.mybluemix.net/?cm_mc_uid=75059567007614843110528&cm_mc_sid_50200000=

Page 9: Watson API Use Case Demos for the Nittany Watson Challenge

Watson Virtual AgentEngagement Your Customers, Students, Faculty, Citizens

9

Page 10: Watson API Use Case Demos for the Nittany Watson Challenge

© Copyright IBM Corporation 2016 10

Page 11: Watson API Use Case Demos for the Nittany Watson Challenge

NYC School FinderMatching Personality with Assisted Decision Making (Tradeoff Analytics)

11

Page 12: Watson API Use Case Demos for the Nittany Watson Challenge

NYC SCHOOL FINDER

12

https://nyc-school-finder.mybluemix.net/?cm_mc_uid=75059567007614843110528&cm_mc_sid_50200000=

Page 13: Watson API Use Case Demos for the Nittany Watson Challenge

School NavigatorWatson Conversation Service Chatbot

13

Page 14: Watson API Use Case Demos for the Nittany Watson Challenge

©IBM 2016

Choosing a graduate school is BIG decision, let Watson help…

14

Many students are overwhelmed when deciding which graduate school to attend. Many do not know who to ask, and turn to costly advisors who do not know what is best for their specific needs and goals.

THE OPPORTUNITY: Watson can help aspiring graduate students find the best graduate school for them by understanding their preferences, and previous academic scores to build a candidate profile to match to relevant schools. Watson will also educate the user on everything from the admissions process to the best practices for the GMAT exam. The School Navigator uses an interactive interface to engage and advise aspiring students in one of their most important decisions.

Page 15: Watson API Use Case Demos for the Nittany Watson Challenge

©IBM 2016

Link: http://schoolnavigator.mybluemix.net/#/username: watsonpassword: p@ssw0rd

SchoolNavigator Demo

15

Page 16: Watson API Use Case Demos for the Nittany Watson Challenge

Expertise Finder

16

Page 17: Watson API Use Case Demos for the Nittany Watson Challenge

EXPERTISE FINDER

17

ChallengeOrganizations often staff projects with available resources without certainty that the resource is the best fit. That’s because it can be extremely time consuming to align relevant personnel and expertise to projects and tasks. This can lead to under-utilization of resources, delivery challenges, cost overruns and missed opportunities for organic growth. Especially relevant for Financial Services, Law, Research & Development, Consulting & Engineering organizations.

Watson inspired solution to the problem Watson can enable enterprises to efficiently locate and identify expertise across the firm. By matching needed expertise with relevant employee experience, Watson streamlines a company’s internal discovery process for resource matching to projects.

Expertise Finder - Legal

Username: watson

Password: w@ts0n

Page 18: Watson API Use Case Demos for the Nittany Watson Challenge

Multimedia EnrichmentGetting concepts from video and audio

18

Page 19: Watson API Use Case Demos for the Nittany Watson Challenge

©IBM 2015 IBM Confidential IBM Internal ONLY

Live Demo

1/24/2017 19

http://cnn-media.mybluemix.net/#/Username – cnnPassword – cnn123

Page 20: Watson API Use Case Demos for the Nittany Watson Challenge

©IBM 2015 IBM Confidential IBM Internal ONLY

The Accelerator provides processing of multimedia content to acquire, ingest and enrich it

• Input can be any type of multimedia content (images, social media text, video transcripts, audio etc.) and Output is the enriched metadata based on processing them using relevant Watson API’s.

• Leverages relevant Watson APIs

• Index the enriched content in an efficient storage based on an canoncial schema.

1/24/2017 20

Page 21: Watson API Use Case Demos for the Nittany Watson Challenge

©IBM 2015 IBM Confidential IBM Internal ONLY

Multimedia Enrichment Pipeline for Audio enrichment

1/24/2017 21

Audio Ingestion (optional)• Extract Audio from the

videos• Preserve metadata

Transcribe Audio (optional)• Transcribe Audio• Maintain start & end times• Capture word alternatives• Speech to Text

Entity Extraction• Invoke AlchemyLanguage

and/or Custom WKS AlchemyLanguage Model

Keywords• Highly relevant terms &

phrases• AlchemyLanguage

Expand Keywords• Word2Vec as a API• Trained on Wikipedia and

other generally available corpus

Tone Analysis• Extract the top emotion,

social tone and writing tone• Tone Analysis

Sources

Pull / Poll

Taxonomy & Relationships• Classify or categorize content• Different types of relations

between detected entities• AlchemyLanguage

Visual Recognition• Use random frames / samples• Class & Face Detection• Visual Recognition

Store in Enrichment DB’s(Cloudant, IBM Graph)

Closed Caption Transcripts

Page 22: Watson API Use Case Demos for the Nittany Watson Challenge

©IBM 2015 IBM Confidential IBM Internal ONLY

Multimedia Enrichment Pipeline for Video files

1/24/2017 22

Enrichment Pipeline Service• Pass through all text related

pipelines

Store in Enrichment DB’s(Cloudant, IBM Graph)

VideoProcessorService• Transcribe• ImageCapture/Visual

Recognition

Video Workload Manager • UI to select file/URI and have

‘processed’, indicate progress, See Results

Video URI

Status/Complete events

Start Event w/ json ID in DB.

Insert JSON/Update JSON

Read JSON/Update JSON

A workload manager would drive workload. We can horizontally scale the VideoProcessor and Enrichment Pipeline.

VideoProcessorService• Transcribe• ImageCapture/Visual

Recognition

Enrichment Pipeline Service• Pass through all text related

pipelines

Page 23: Watson API Use Case Demos for the Nittany Watson Challenge

©IBM 2015 IBM Confidential IBM Internal ONLY

Multimedia Enrichment Pipeline - Video Processor

1/24/2017 23

Video Ingestion• Preserve metadata• Extract Audio from the videos• Generate Sample Frames

Transcribe/Normalize Audio • Transcribe Audio• Maintain start & end times• Capture word alternatives• Speech to Text

Visual Recognition• Use random frames / samples• Class & Face Detection• Visual Recognition

Store in Enrichment DB’s(Cloudant, IBM Graph)

jsonInitial

metadata

A URI is passed to the Video Ingestion and we use ffmpeg to ingest the video:1. We instantiate a metadata

object and save any existing metadata in the file.

2. a) As the video is read, we pass audio to STT (if we don’t have a transcript)b) Every TIME_INTERVAL ffpmeg saves a Screen capture of the file.

3. We save Transcript, the filename and time taken to the Metadata.

4. Send image URI to VR5. Update JSON Metadata in

DB w/ VR Results.

URI

PNGimagePNG

imagePNGimagePNG

image

1

2a

2b

3

4

5

The main idea here is that we must process the file 1 time (at least) and it will take as long as it takes to play the file (for the most part) This process will extract all data possible during the initial processing PRIOR to handing it off to the TranscriptionAnalyzer. One possibility is if we already have a transcription and can skip 2a, we may be able to process video faster for just images.

Page 24: Watson API Use Case Demos for the Nittany Watson Challenge

©IBM 2015 IBM Confidential IBM Internal ONLY

Multimedia Enrichment Pipeline - Text Enrichment

1/24/2017 24

Entity Extraction• Invoke AlchemyLanguage

and/or Custom WKS AlchemyLanguage Model

Keywords• Highly relevant terms & phrases• AlchemyLanguage

Expand Keywords• Word2Vec as a API

• Trained on Wikipedia and other generally available corpus

Tone Analysis• Extract the top emotion,

social tone and writing tone

• Tone Analysis

Taxonomy & Relationships• Classify or categorize content• Different types of relations between

detected entities• AlchemyLanguage

Store in Enrichment DB’s(Cloudant, IBM Graph)

Retrieve JSON Document for processing, Update when complete

Enrichment Pipeline• Lookup document and retrieve results

from Enrichers

Each stage will pass on its enriched JSON file to the next stage which will work on the data as it sees fit. At the end of each stage an even to the Enrichment Pipeline indicating the stage is finished will be generated.

A set of node.js Streams

event: ‘start’Id: docID

event: ‘’finished’Id: docID

Page 25: Watson API Use Case Demos for the Nittany Watson Challenge

Watson Knowledge Studio

25

Page 26: Watson API Use Case Demos for the Nittany Watson Challenge

FACT EXTRACTION WITH WATSON KNOWLEDGE STUDIO

26© Copyright IBM Corporation 2016 http://laser1.watson.ibm.com/sire/ie2.php

Page 27: Watson API Use Case Demos for the Nittany Watson Challenge

WKS RULES

27

Page 28: Watson API Use Case Demos for the Nittany Watson Challenge

Thank [email protected]

January 2017

Page 29: Watson API Use Case Demos for the Nittany Watson Challenge

Backup Slides

January 2017