cassi speech recognition

Upload: praveen-lvv

Post on 09-Apr-2018

221 views

Category:

Documents


0 download

TRANSCRIPT

  • 8/8/2019 CASSI Speech Recognition

    1/14

    CASSI Speech Recognition:

    Adding Speech Recognition to Embedded Devices

    by

    Praveen lvv

  • 8/8/2019 CASSI Speech Recognition

    2/14

    INTRODUCTION

    What is CASSI ?

    Conversay Advanced Symbolic Speech Interface

    It can be used in a variety of embedded systems.

    It runs on either single or dual-processor hardware designs

    > CASSI provides continuous, speaker-independent

    speech recognition

    Conversay developers and customers write application

    code that uses the CASSI API to integrate speechrecognition and text-to-speech (TTS) capability into

    embedded products.

  • 8/8/2019 CASSI Speech Recognition

    3/14

    What is TTS ?

    Text-To-Speech (TTS):

    CASSI contains two modules for performing TTS:Rosetta and a TTS synthesis module.

    Rosetta, the text-to-phonetics unit, accepts

    arbitrary written text as input and outputs a string ofphonemes forCASSI to synthesize

    process

    of incorporating speech technology

    1. Definition of capabilities

    2. Analysis of hardware resources

    3. User interface design

    4. Development

  • 8/8/2019 CASSI Speech Recognition

    4/14

    HARDWARE ENVIRONMENT:

    Modular nature.

    Suitable for a variety of systems.

    Used with single processor designs where one

    processor handles all component execution.

    Feature extraction and TTS synthesis may be

    separated onto theirown DSP (orother front-end signal

    processor)

    Front-End Block:The front-end block is used for recognition and TTS functions

    ProcessorBlock (Back-End):

    The processor block performs all other code functions, includingtopic management and search

  • 8/8/2019 CASSI Speech Recognition

    5/14

    AUTOMATIC SPEECH RECOGNISATION

    What does speaker

    dependent / adaptive / independent mean?

  • 8/8/2019 CASSI Speech Recognition

    6/14

    What does continuous speech and isolated-word mean?

    A continuous speech system operates on speech inwhich words are connected together, i.e. not separated

    by pauses.

    An isolated-word system operates on single words at a

    time - requiring a pause between saying each word.

    This is the simplest form of recognition

    Continuous speech is more difficult to handle because of a variety

    of effects.

  • 8/8/2019 CASSI Speech Recognition

    7/14

    The Process of Speech Recognition

    Acoustic-Phonetic

    Pattern Recognition

    Artificial Intelligence

    INTERFACE

  • 8/8/2019 CASSI Speech Recognition

    8/14

    The Experiment

    Yes spoken by first person

    Yes spoken by the second

    person

  • 8/8/2019 CASSI Speech Recognition

    9/14

    Divide the sound wave into evenly spaced blocks.

    Process each block for important characteristics .

    Attempt to associate each block with a

    Phone, which is the most basic unit of speech,

    producing a string of phones.

    Find the word whose model is the most likely match

    The Basic Steps

  • 8/8/2019 CASSI Speech Recognition

    10/14

    speech recognition systems use the basic three-stage

    Architecture:

    Feature detection in which the

    raw acoustic waveform is

    represented in a more useful

    space

    Probabilistic classification of

    the feature vectors, in which the

    frames are scored as looking

    more or less likely as versions

    Search forbest word-

    sequence hypothesis in which

    a word sequence is found that is

    consistent with the constraints of

    lexicon and grammar

  • 8/8/2019 CASSI Speech Recognition

    11/14

    ADVANTAGES OF SPEECH RECOGNISATION

    Easy search and index recorded audio and video data.

    Speech recognition is also useful as a form of input.

    people working in active environment such as hospitals to use computers.

    people with handicaps to use computers.

  • 8/8/2019 CASSI Speech Recognition

    12/14

    CONCLUSION !!!

    Visual cues to help computers decipher speech sounds that

    are obscured by environmental noise.

    Speech-to-speech translation project for spontaneous speech

    Multi-engine Spanish-to-English machine translation system

    Building synthetic voices

  • 8/8/2019 CASSI Speech Recognition

    13/14

  • 8/8/2019 CASSI Speech Recognition

    14/14

    Thank YouThank You