speech descriptors generation software utilized for classification and recognition purposes lukasz...

12
SPEECH DESCRIPTORS GENERATION SOFTWARE UTILIZED FOR CLASSIFICATION AND RECOGNITION PURPOSES Lukasz Laszko ([email protected]) Department of Biomedical Engineering, Faculty of Electronics, Telecommunications and Informatics, Gdansk University of Technology

Upload: marshall-may

Post on 05-Jan-2016

217 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: SPEECH DESCRIPTORS GENERATION SOFTWARE UTILIZED FOR CLASSIFICATION AND RECOGNITION PURPOSES Lukasz Laszko (lukaszlaszko@gmail.com) Department of Biomedical

SPEECH DESCRIPTORS GENERATION SOFTWAREUTILIZED FOR CLASSIFICATION AND RECOGNITION

PURPOSES

Lukasz Laszko ([email protected])

Department of Biomedical Engineering,Faculty of Electronics, Telecommunications and Informatics,

Gdansk University of Technology

Page 2: SPEECH DESCRIPTORS GENERATION SOFTWARE UTILIZED FOR CLASSIFICATION AND RECOGNITION PURPOSES Lukasz Laszko (lukaszlaszko@gmail.com) Department of Biomedical

General architecture

SDR client SDR server

ASR server

SDR DB

Service Oriented Architecture

External systems

Page 3: SPEECH DESCRIPTORS GENERATION SOFTWARE UTILIZED FOR CLASSIFICATION AND RECOGNITION PURPOSES Lukasz Laszko (lukaszlaszko@gmail.com) Department of Biomedical

Spoken content retrivial – implementation

ASR Server architecture

Status : implemented

ASR Engine

Metadata datbase

ORM mapper

Multithread execution pool

Web service

Java Concurrency Framework

JAX-WS 2.1 withWSIT

onApache Jetty 6

CMU Sphinx-4

Network

SOAP + MTOMover HTTPS

Page 4: SPEECH DESCRIPTORS GENERATION SOFTWARE UTILIZED FOR CLASSIFICATION AND RECOGNITION PURPOSES Lukasz Laszko (lukaszlaszko@gmail.com) Department of Biomedical

Spoken content retrivial – implementation

SDR Server architecture

External services agents

NetworkStatus :

under development

ASR Connector

Services FrontEnd

Diagnostic portal

Data Access LogicWorkflow Runtime

Data ConnectorIndexing

Service

Search Service

Indexing Workflow

SearchWorkflow

SCDDatabase

Page 5: SPEECH DESCRIPTORS GENERATION SOFTWARE UTILIZED FOR CLASSIFICATION AND RECOGNITION PURPOSES Lukasz Laszko (lukaszlaszko@gmail.com) Department of Biomedical

SDR database model

- Holds ASR temporary results- Holds extraction metadata- Supports task queuing- Performance measurement

Page 6: SPEECH DESCRIPTORS GENERATION SOFTWARE UTILIZED FOR CLASSIFICATION AND RECOGNITION PURPOSES Lukasz Laszko (lukaszlaszko@gmail.com) Department of Biomedical

SDR database model

Page 7: SPEECH DESCRIPTORS GENERATION SOFTWARE UTILIZED FOR CLASSIFICATION AND RECOGNITION PURPOSES Lukasz Laszko (lukaszlaszko@gmail.com) Department of Biomedical

SDR architecture – Microsoft P&P architecture approach retail model with mappings

SDR database

ASR web service

IoC with policy injection Web portal

+ SOAP interfaces

Page 8: SPEECH DESCRIPTORS GENERATION SOFTWARE UTILIZED FOR CLASSIFICATION AND RECOGNITION PURPOSES Lukasz Laszko (lukaszlaszko@gmail.com) Department of Biomedical

SDR Component - technology5 Layer architecture (3 layer architecture extension):-Data management layer – PL/SQL stored procedures hosted on

Oracle Database 10g-Data access layer – data accessors for stored procedures and ORM mapping

(Apache iBatis + Oracle Data Provider for .NET)-Business Logic Layer – business rules encapsulation-Presentation layer – ASP .NET web application + Flash communication server -Client presentation layer – client JavaScript code and Adobe Flex Flash forms

communicating with presentation layer via Flash Remoting gateway interface

Additionally IoC container for model views load is used – Spring .NET with Policy Injection Aspect programing interface for validation and error handling Routines at Business Logic / Presentation boundry.

Security:-Windows Integrated Security – integrated authentication in MS Windows

domains, background authorization in components supporting WIS

-Standalone, buildin security – custom membership and role providers for ASP.NET providing authentication and authorization according to credentials stored in the database

Page 9: SPEECH DESCRIPTORS GENERATION SOFTWARE UTILIZED FOR CLASSIFICATION AND RECOGNITION PURPOSES Lukasz Laszko (lukaszlaszko@gmail.com) Department of Biomedical

Indexing and retreval methods

Indexing methods

Word indexing Sub-word indexing

Page 10: SPEECH DESCRIPTORS GENERATION SOFTWARE UTILIZED FOR CLASSIFICATION AND RECOGNITION PURPOSES Lukasz Laszko (lukaszlaszko@gmail.com) Department of Biomedical

Indexing and retreval methods

Spoken document indexing and retrivial methods = extension for full-text indexing methods in textual databases

Retrieval Status Value (RSV) – a relevance score calculated for each document stored in the database according to specified Information Retrevial (IR) query. This value reflects how much a given document satisfies requirements

defined in the query.

IR models adaptedfor SDR purposes

Similarity based models Probabilistic models

Page 11: SPEECH DESCRIPTORS GENERATION SOFTWARE UTILIZED FOR CLASSIFICATION AND RECOGNITION PURPOSES Lukasz Laszko (lukaszlaszko@gmail.com) Department of Biomedical

Similarity based models

In this models RSV is defined as a measure of similarity reflecting the degree of resemblance between the query and the document descriptions. The most popular similarity based models are based on the vector space model (VSM)

1. Boolean matching searching

2. Best matching-searching (Salton and Backley method)

Weighting Methods

Weighting methods

fd(t) is the frequency of term t in document description D

fq(t) is the frequency of term t in query Q

Nc is the total number of documents in thecollection and nct the number of documents containing term t

Weighting methods

Page 12: SPEECH DESCRIPTORS GENERATION SOFTWARE UTILIZED FOR CLASSIFICATION AND RECOGNITION PURPOSES Lukasz Laszko (lukaszlaszko@gmail.com) Department of Biomedical

Current status

Implementation:

Documentation:

ASR components – implemented and validatedASR connector for SDR system – partially implementedSDR Client portal – partially implementedSDR Documents indexing – implemented, with partial user interfaceSDR Documents search – implemented with unit tests

Requirements specification – compilant with IEEE-830 standard and Volere template

Functional specification – compilant with IEEE-1470.1 standardDesign specification for both ASR and SDR systemsTest cases and validation scenarios

Solution descriptions