speech descriptors generation software utilized for classification and recognition purposes lukasz...

Post on 05-Jan-2016

218 Views

Category:

Documents

1 Downloads

Preview:

Click to see full reader

TRANSCRIPT

SPEECH DESCRIPTORS GENERATION SOFTWAREUTILIZED FOR CLASSIFICATION AND RECOGNITION

PURPOSES

Lukasz Laszko (lukaszlaszko@gmail.com)

Department of Biomedical Engineering,Faculty of Electronics, Telecommunications and Informatics,

Gdansk University of Technology

General architecture

SDR client SDR server

ASR server

SDR DB

Service Oriented Architecture

External systems

Spoken content retrivial – implementation

ASR Server architecture

Status : implemented

ASR Engine

Metadata datbase

ORM mapper

Multithread execution pool

Web service

Java Concurrency Framework

JAX-WS 2.1 withWSIT

onApache Jetty 6

CMU Sphinx-4

Network

SOAP + MTOMover HTTPS

Spoken content retrivial – implementation

SDR Server architecture

External services agents

NetworkStatus :

under development

ASR Connector

Services FrontEnd

Diagnostic portal

Data Access LogicWorkflow Runtime

Data ConnectorIndexing

Service

Search Service

Indexing Workflow

SearchWorkflow

SCDDatabase

SDR database model

- Holds ASR temporary results- Holds extraction metadata- Supports task queuing- Performance measurement

SDR database model

SDR architecture – Microsoft P&P architecture approach retail model with mappings

SDR database

ASR web service

IoC with policy injection Web portal

+ SOAP interfaces

SDR Component - technology5 Layer architecture (3 layer architecture extension):-Data management layer – PL/SQL stored procedures hosted on

Oracle Database 10g-Data access layer – data accessors for stored procedures and ORM mapping

(Apache iBatis + Oracle Data Provider for .NET)-Business Logic Layer – business rules encapsulation-Presentation layer – ASP .NET web application + Flash communication server -Client presentation layer – client JavaScript code and Adobe Flex Flash forms

communicating with presentation layer via Flash Remoting gateway interface

Additionally IoC container for model views load is used – Spring .NET with Policy Injection Aspect programing interface for validation and error handling Routines at Business Logic / Presentation boundry.

Security:-Windows Integrated Security – integrated authentication in MS Windows

domains, background authorization in components supporting WIS

-Standalone, buildin security – custom membership and role providers for ASP.NET providing authentication and authorization according to credentials stored in the database

Indexing and retreval methods

Indexing methods

Word indexing Sub-word indexing

Indexing and retreval methods

Spoken document indexing and retrivial methods = extension for full-text indexing methods in textual databases

Retrieval Status Value (RSV) – a relevance score calculated for each document stored in the database according to specified Information Retrevial (IR) query. This value reflects how much a given document satisfies requirements

defined in the query.

IR models adaptedfor SDR purposes

Similarity based models Probabilistic models

Similarity based models

In this models RSV is defined as a measure of similarity reflecting the degree of resemblance between the query and the document descriptions. The most popular similarity based models are based on the vector space model (VSM)

1. Boolean matching searching

2. Best matching-searching (Salton and Backley method)

Weighting Methods

Weighting methods

fd(t) is the frequency of term t in document description D

fq(t) is the frequency of term t in query Q

Nc is the total number of documents in thecollection and nct the number of documents containing term t

Weighting methods

Current status

Implementation:

Documentation:

ASR components – implemented and validatedASR connector for SDR system – partially implementedSDR Client portal – partially implementedSDR Documents indexing – implemented, with partial user interfaceSDR Documents search – implemented with unit tests

Requirements specification – compilant with IEEE-830 standard and Volere template

Functional specification – compilant with IEEE-1470.1 standardDesign specification for both ASR and SDR systemsTest cases and validation scenarios

Solution descriptions

top related