speech recognition using hidden markov model_mee_03_19 (1)

Upload: sundaram-natarajan

Post on 03-Jun-2018

226 views

Category:

Documents


0 download

TRANSCRIPT

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    1/79

    Speech Recognition using Hidden Markov Model

    An implementation of the theory on a

    DSK ADSP-!"## $%-K&' (&'$ R$) *+"

    ,ick ardici

    .rn Skarin

    ////////////////////////////////////////////////////

    Degree of Master of Science in $lectrical $ngineering

    M$$-0#-*1

    Supervisor2 Mikael ,ilsson

    School of $ngineering

    Department of 'elecommunications and Signal Processing

    lekinge &nstitute of 'echnology

    March3 4005

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    2/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    A6stract

    'his master degree proect is ho7 to implement a speech recognition system on a DSK

    ADSP-!"## $%-K&' (&'$ R$) *+" 6ased on the theory of the Hidden Markov Model

    8HMM9+ 'he implementation is 6ased on the theory in the master degree proect Speech

    Recognition using Hidden Markov Model 6y Mikael ,ilsson and Marcus $narsson3 M$$-

    0*-4:+ 'he 7ork accomplished in the proect is 6y reference to the theory3 implementing a

    M!;;3 Mel !rer ut p> att implementera en r.stigenk?nningssystem p> en DSK

    ADSP-!"## $%-K&' (&'$ R$) *+" 6aserad p> teorin om HMM3 Hidden Markov Model+

    &mplementeringen ?r 6aserad p> teorin i e=amensar6etet Speech Recognition using Hidden

    Markov Model av Mikael ,ilsson och Marcus $narsson3 M$$-0*-4:+ Det som gorts i

    ar6etet ?r att utifr>n teorin implementerat en M!;;3 Mel !re DSP2n 'e=as &nstruments 'MDS#40=5:**+ Sedan utv?rderadesrealtidstill?mpningen+

    ///////////////////////////////////////////////////////////////////////////4

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    3/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    Contents

    *+ A6stract 2

    4+ ;ontents 3

    #+ &ntroduction 6

    @+ Speech signal to !eature )ectors3 Mel !re

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    4/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    5+#+* 3 output distri6ution matri= #"

    5+#+4 E3 'he !or7ard varia6le #B

    5+#+# F3 ack7ard Algorithm @0

    5+#+@ c3 scaled3 the scaling factor3 E scaled3 F scaled @*

    5+#+" (og8P8GI993 (og(ikelihood @"

    6.4 ,eestimation 46

    5+@+* A/reest3 reestimated state transition pro6a6ility matri= @:

    5+@+4 J/reest3 reestimated mean @B

    5+@+# /reest3 variance matri= @B

    5+@+@ ;heck threshold value @B

    6.& +he res"lt 5 the model 49

    5+"+* 'he Model @1

    7. HMM'he testing of a 7ord against a model 'he determination pro6lem "0

    7.1 SP##C S-/( &2:+*+* Speech signal "4

    7.2 P,#P,*C#SS-/ &2

    :+4+* M!;; "4

    7.3 --+-(-(+-* &2

    :+#+* (og8A93 state transition pro6a6ility matri= of the model "4

    :+#+4 J3 mean matri= from model "4

    :+#+# 3 variance matri= from model "4

    :+#+4 (og893 initial state pro6a6ility vector "4

    7.4 P,*8(8--+ #:((+-* &3

    :+@+* (og89 "#

    :+@+4 L3 delta "#:+@+# 3 psi "#

    :+@+@ (og8PN9 "#

    :+@+"

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    5/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    %.6 F#(+,# :#C+*,S 5 el Fre?"enc@ Cepstr"m Coe))icients 64

    %.7 +#S+-/ 6&

    %.% --+-(-(+-* *F +# *'# +* 8# S#' 66

    B+B+* (og8A93 state transition pro6a6ility matri= of the model 55

    B+B+4 J3 mean matri= from model 55

    B+B+# 3 variance matri= from model 55

    B+B+@ (og893 initial state pro6a6ility vector 55

    %.9 P,*8(8--+ #:((+-* 66

    B+1+* (og89 55

    B+1+4 L3 delta 55

    B+1+# 3 psi 55

    B+1+@ (og8PN9 55

    B+1+"

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    6/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    #+ &ntroduction

    &n our minds the aim of interaction 6et7een a machine and a human is to use the most natural

    7ay of e=pressing ourselves3 through our speech+ A speech recogniCer3 implemented on a

    machine as an isolated 7ord recogniCer 7as done through this proect+ 'he proect also

    included an implementation on a DSK 6oard due to the porta6ility of this device+

    !irst the feature e=traction from the speech signal is done 6y a parameteriCation of the 7ave

    formed signal into relevant feature vectors+ 'his parametric form is then used 6y the

    recognition system 6oth in training the models and testing the same+

    'he techni

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    7/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    @+ Speech signal to !eature )ectors3 Mel !re

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    8/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    Fig"re 4.1

    ///////////////////////////////////////////////////////////////////////////B

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    9/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    @+* SP$$;H S&Q,A(

    @+*+* Speech signal

    'he original analogue signal 7hich to 6e used 6y the system in 6oth training and testing isconverted from analogue to discrete3 =8n9 6oth 6y using the program ;ool$dit3

    http2777+cooledit+comand 6y using the DSK ADSP-!"## $%-K&' (&'$ R$) *+"3

    http2777+6lackfin+org+ 'he sample rate3 !s used 7as *5kHC+ An e=ample of a signal in

    7aveform sampled is given in Fig"re 4.2+ 'he signals used in the follo7ing chapters are

    denoted 7ith anxand an e=tension_ffte+g+x_fft(n)if an fft is applied to it+ 'he original

    utterance signal is denotedx_utt(n)3 sho7n 6elo7+

    Fig"re 4.2 5 Sampled signalA "tterance o) B)ram in !ae)orm

    ///////////////////////////////////////////////////////////////////////////1

    http://www.cooledit.com/http://www.blackfin.org/http://www.cooledit.com/http://www.blackfin.org/
  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    10/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    @+4 PR$PRG;$SS&,Q

    @+4+* Preemphasis'here is a need for spectrally flatten the signal+ 'he preemphasiCer3 often represented 6y a

    first order high pass !&R filter is used to emphasiCe the higher fre

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    11/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    Fig"re 4.3b 5 *riginal signalD@DnEE and preemphasiedD$DnEE

    &n Fig"re 4.3b it sho7s ho7 the lo7er fre

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    12/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    @+4+4 )AD3 )oice Activation Detection

    hen you have got access to a sampled discrete signal it is significant to reduce the data

    to contain only the samples 7hich is represented 7ith signal values3 not noise+ 'herefore

    the need of a good )oice Activation Detection function is needed+ 'here are many 7aysof doing this+ 'he function used is descri6ed inEq.4.2+

    hen 6eginning the calculation and estimation of the signal it is useful to do some

    assumptions+ !irst 7e needed to divide the signal into 6locks+ 'he length of each 6lock is

    needed to 6e 40ms according to the stationary properties of the signal OMM+ hen using

    the !s at *5 kHC3 it 7ill give us a 6lock length of #40 ms+ ;onsider the first *0 6locks to

    6e 6ackground noise3 then mean and variance could 6e calculated and used as a reference

    to the rest of the 6locks to detect 7here a threshold is reached+

    B+04+03var33 ===+= wwwwww meant Eq. 4.2

    'he threshold in our case 7here tested and tuned to 1.2N tw+ 'he result of the

    preemphasiCed signal cut do7n 6y the )AD is presented in Fig"re 4.4a and )ig"re 4.4b+

    ///////////////////////////////////////////////////////////////////////////*4

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    13/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    Fig"re 4.4a

    Fig"re 4.4b

    ///////////////////////////////////////////////////////////////////////////*#

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    14/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    @+# !RAM$(G;K&,Q V &,DG&,Q

    @+#+* !rame6locking

    'he o6ective 7ith fram6locking is to divide the signal into a matri= form 7ith anappropriate time length for each frame+ Due to the assumption that a signal 7ithin a frame

    of 40 ms is stationary and a sampling rate at *5000HC 7ill give the result of a frame of

    #40 samples+

    &n the fram6locking event the use of an overlap of 543"W 7ill give a factor of separation

    of *40 samples+

    Fig"re 4.&

    @+#+4 indo7ing using Hamming 7indo7

    After the frame6locking is done a Hamming 7indo7 is applied to each frame+ 'his

    7indo7 is to reduce the signal discontinuity at the ends of each 6lock+

    'he e

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    15/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    Fig"re 4.&

    ///////////////////////////////////////////////////////////////////////////*"

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    16/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    ///////////////////////////////////////////////////////////////////////////*5

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    17/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    'he Fig"re 4.6sho7s the result of the frame6locking3 6lock num6er 40+

    Fig"re 4.6

    Fig"re 4.7 sho7s the 6lock 7indo7ed 6y the 7indo7 in Fig"re 4.7

    Fig"re 4.7

    'he result gives a reduction of the discontinuity at the ends of the 6lock+

    ///////////////////////////////////////////////////////////////////////////*:

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    18/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    @+@ !$A'XR$ $Y'RA;'&G,

    'he method used to e=tract relevant information from each frame6lock is the mel-

    cepstrum method+ 'he mel-cepstrum consists of t7o methods mel-scaling and cepstrum

    calculation+

    @+@+* !!' on each 6lock

    Xse "*4 point !!' on each 7indo7ed frame in the matri=+ 'o adust the length of the

    40ms frame length3 Cero padding is used+ 'he result for the 6lock num6er 40 is given in

    Fig"re 4.%.

    Fig"re 4.%

    ///////////////////////////////////////////////////////////////////////////*B

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    19/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    ///////////////////////////////////////////////////////////////////////////*1

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    20/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    @+@+4 Mel spectrum coefficients 7ith filter6ank

    'he fact that the human perception of the fre

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    21/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    'he element inx_mel(1,1)are o6tained 6y summing the contri6ution from the first

    filtertap denoted * 8Mat(a6 notation+ mel6ank8*24"532993 then elementx_mel(2,1)is

    o6tained 6y summing the contri6ution from the second filtertap in mel6ank and so on+

    ///////////////////////////////////////////////////////////////////////////4*

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    22/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    @+@+# Mel-;epstrum coefficients3 D;' Discrete ;osine 'ransform

    'o derive the mel cepstrum of the 7arped mel fre

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    23/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    @+@+@ (iftering3 the cepstral domain e

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    24/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    @+@+" $nergy Measure

    'o add an e=tra coefficient containing information a6out the signal the log of signal

    energy is added to each feature vector+ &t is the coefficient that 7ere e=changed mentioned

    in the previous section+ 'he log of signal energy is defined 6yEq.4.%+

    1+@+9Z8/log*

    0

    4 Eqmkw&n'(we'xEK

    k

    m

    =

    =

    ///////////////////////////////////////////////////////////////////////////4@

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    25/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    @+" D$('A V A;;$($RA'&G, ;G$!!&;&$,'S

    'he delta and acceleration coefficients are calculated to increase the information of the

    human perception+ 'he delta coefficients are a6out time difference3 the acceleration

    coefficients are a6out the second time derivative+

    @+"+* Delta coefficients'he delta coefficients are calculated according toEq.4.1.

    *0+@+

    99Z89Z88

    4

    P*O Eq

    mncmnc

    *

    *

    *

    *

    hh

    =

    =

    +

    =

    ///////////////////////////////////////////////////////////////////////////4"

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    26/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    @+"+4 Acceleration coefficients

    'he acceleration coefficients are calculated according toEq.4.11.

    **+@+

    9*4898

    9Z89*489Z8

    4@44

    44

    P4OEq

    *

    mnc*mnc

    *

    *

    *

    *

    *

    *

    *

    *

    hh

    *

    *

    +

    +++=

    = =

    = ==

    ///////////////////////////////////////////////////////////////////////////45

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    27/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    @+"+# 4-nd order polynomial appro=imation

    Xsing the P0O 8Eq.+.1293 P*O and P4O the appro=imation of the mel-cepstrum

    traectories could 6e appro=imated according to Eq.4.1+.'he Fig"re 4.10 is the result of

    using the fitting 7idth P [ #+

    *4+@+4

    *9Z8

    *4

    * 4P4OP0O Eqmnc*

    *

    *

    *

    h

    +

    +=

    ==

    *#+@+44P4OP*OP0O Eq =++

    Fig"re 4.10

    ///////////////////////////////////////////////////////////////////////////4:

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    28/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    @+5 PGS'PRG;$SS&,Q'o achieve some enhancement in ro6ustness there is a need for postprocessing of the

    coefficients+

    @+5+* ,ormaliCation

    'he enhancement done is a normaliCation3 meaning that the feature vectors are

    normaliCed over time to get Cero mean and unit variance+ ,ormaliCation forces the feature

    vectors to the same numerical range OMM+ 'he mean vector3 called 98nf, 3 can 6e

    calculated according to $

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    29/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    @+: R$SX('

    @+:+* !eature vectors Mel !re

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    30/79

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    31/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    5+ HMM 'he training of a model of a 7ord'he re-estimation

    pro6lem

    Qiven a , num6er of o6servation se

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    32/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    ///////////////////////////////////////////////////////////////////////////#4

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    33/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    5+* M$A, A,D )AR&A,;$

    5+*+* Signal 'he utterance

    'he signal used for training purposes are ordinary utterances of the specific 7ord3 the 7ord to6e recogniCed+

    5+*+4 M!;; Mel !re

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    34/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    5+*+# 3 mean

    hen the M!;; is achieved3 there is a need to normaliCe all the given training utterance+ 'he

    matri= is divided into a num6er of coefficients times num6er of states+ 'hen these are used for

    calculating the mean and variance of all the matrices3 see section @+*+@ for variance

    calculation+ 'he mean us calculated usingEq6.1

    c(lumncnxN

    xN

    n

    cc ==

    =

    398* *

    0

    /

    Eq.6.1

    ,ote that if multiple utterances are used for training there is a need of calculating the mean of

    x_(m,n)for that num6er of utterances+

    @+*+@ 3 variance

    'he variance is calculated usingEq6.2andEq6.++

    c(lumncnxN

    xN

    n

    cc ==

    =

    398* *

    0

    4/4

    Eq. 6.2

    c(lumncxx ccc =

    = 34

    //44 Eq. 6.+

    A more e=plicit e=ample of calculating a certain inde= e+g thex_7(1,1) is done according to

    the follo7ing e

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    35/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    5+4 &,&'&A(&%A'&G,

    5+4+* A3 the state transition pro6a6ility matri=3 using the left-to-right model

    'he state transition pro6a6ility matri=3 A is initialiCed 7ith the e

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    36/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    use+ 'his is less complicated in calculations 6ut it uses a vector

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    37/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    Gne =/mfcc feature vector is in the estimation versus each;and7 vector+ i+e+ $ach feature vector is

    calculated for allx_;andx_ 7 clumn! one 6y one+

    ///////////////////////////////////////////////////////////////////////////#:

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    38/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    'he resulting state-dependent o6servation sym6ol pro6a6ilities matri=+ 'he columns gives theo6servation pro6a6ilities for each state+

    5+#+4 E3 'he !or7ard Algorithm

    hen finding the pro6a6ility of an o6servation se

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    39/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    'he definition of 98&t is that 98&t is the pro6a6ility at time tand in state &given the

    model3 having generated the partial o6servation se

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    40/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    Return to step 4 if t 'Z

    Gther7ise3 terminate the algorithm 8goto step @9+

    @+ 'ermination

    989H8*

    &

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    41/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    *+ &nitialiCation

    Set t [ ' *Z

    N&&4 = *3*98

    4+ &nduction

    N&(5a&&N

    9

    t9&9tt ==

    ++ *3989898*

    **

    #+ Xpdate time

    Set t[ t- *Z

    Return to step 4 if t 0Z

    Gther7ise3 terminate the algorithm+

    @+#+@ c3 the scaling factor3 E scaled3 F scaled

    Due to the comple=ity of precision range 7hen calculating 7ith multiplications of

    pro6a6ilities makes a scaling of 6oth >and?necessary+ 'he comple=ity is that the

    pro6a6ilities is heading e=ponentially to Cero 7hen t gro7s large+ 'he scaling factor for

    scaling 6oth the for7ard and 6ack7ard varia6le is dependent only of the time tand

    independent of the state &+ 'he notation of the factor is tc and is done for every t and state &3

    N&* + Xsing the same scale factor is sho7n useful 7hen solving the parameter estimation

    pro6lem 8pro6lem # ORa6B193 7here the scaling coefficients for >and?7ill cancel out eachother e=actly+

    'he follo7ing procedure sho7s the calculation of the scale factor 7hich as mentioned is also

    used to scale?+ &n the procedure the denotation 98&t is the unscaled for7ard varia6le3

    98f &t denote the scaled for7ard varia6le and 98ff &t denote the temporary for7ard varia6le

    6efore scaling+

    *+ &nitialiCation

    Set t 84ZN&(5&

    && = *93898 **

    N&&& = *93898ff **

    =

    = N

    &

    &

    c

    *

    *

    *

    98

    *

    9898f *** &c& =

    4+ &nduction

    ///////////////////////////////////////////////////////////////////////////@*

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    42/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    N&a9(5&N

    9

    9&tt&t = =

    *398f9898ff

    *

    *

    ==

    N

    &t

    t

    &

    c

    *

    98ff

    *

    N&&c& ttt = *938ff98f

    #+ Xpdate time

    Set t[ t *Z

    Return to step 4 if t Z

    Gther7ise3 terminate the algorithm 8goto step @9+

    @+ 'ermination

    =

    =4

    t

    tc

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    43/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    Fig"re 6.3

    ///////////////////////////////////////////////////////////////////////////@#

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    44/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    'he resulting 6eta/scaled2

    Fig"re 6.4

    ///////////////////////////////////////////////////////////////////////////@@

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    45/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    5+#+" (og 8P8GI993 save the pro6a6ility of the o6servation se

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    46/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    5+@ R$-$S'&MA'&G, G! 'H$ PARAM$'$RS !GR 'H$ MGD$(3 [8 3

    (3 89

    'he recommended algorithm used for this purpose is the iterative aum-elch algorithm that

    ma=imiCes the likelihood function of a given model [8 3 (3 89 OMMORa6ODavid+ !orevery iteration the algorithm reestimates the HMM parameters to a closer value of the

    _glo6al` 8e=ist many local9 ma=imum+ 'he importance lies in that the first local ma=imum

    found is the glo6al3 other7ise an erroneous ma=imum is found+

    'he aum-elch algorithm is 6ased on a com6ination of the for7ard algorithm and the

    6ack7ard algorithm+

    'he

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    47/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    5+@+* A/reest3 reestimate the state transition pro6a6ility matri=

    hen solving pro6lem three3 GptimiCe model parameters ORa6B13 an adustment of the

    parameters of the model is done+ 'he aum-elch is used as mentioned in the previous

    section of this chapter+ 'he adustment of the model parameters should 6e done in a 7ay that

    ma=imiCes the pro6a6ility of the model having generated the o6servation se

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    48/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    'he reestimation of theAmatri= is

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    49/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    5+" 'H$ R$SX(' 'H$ H&DD$, MARKG) MGD$(

    5+"+* Save the Hidden markov Model for that specific utterance

    After the reestimation is done+ 'he model is saved to represent that specific o6servationse

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    50/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    :+ HMM 'H$ '$S'&,Q G! A, GS$R)A'&G, 'he decoding pro6lem

    hen comparing an o6servation se

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    51/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    ///////////////////////////////////////////////////////////////////////////"*

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    52/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    :+* SP$$;H S&Q,A(

    :+*+* Speech signal

    'he signals used for testing purposes are ordinary utterances of the specific 7ord3 the 7ord to

    6e recogniCed+

    :+4 PR$PRG;$SS&,Q

    :+4+* M!;; Mel !re

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    53/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    :+@ PRGA&(&'j $)A(XA'&G, (oglikelihood3 using

    'he Alternative )iter6i Algorithm

    :+@+* (og89

    'he continuous o6servation pro6a6ility density function matri= is calculated as i the previous

    chapter 5 +he training o) a model+ 'he difference is that the logarithm is used on

    the matri= due to the constraints of 'he Alternative )iter6i Algorithm+

    :+@+4 L3 delta

    'o 6e a6le to search for the ma=imiCation of a single state path the need for the follo7ing

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    54/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    :+@+: Alternative )iter6i Algorithm

    'he follo7ing steps are included in the Alternative )iter6i Algorithm O* O4 ORa6B1+

    "+ PreprocessingN&&& = *93log8k @+:@ MM

    N9&aa &9&9 = 3*93log8k

    @+:" MM

    5+ &nitialiCation

    Set t [ 4Z

    N&(5(5&& = *9938log898

    k** @+:5 MM

    N&(5& && += *938kk98

    k**

    @+:: MM

    N&& = *3098* @+:B MM

    :+ &nduction

    N9(5(5 t9t9 = *9938log898k

    @+:1 MM

    N9a&(59 &9tN&

    ttt ++=

    *P3k98k

    Oma=98k

    98k

    **

    @+B0

    M

    N9a&9 &9tN&

    t +=

    *P3k98k

    Oma=arg98 **

    @+B* MM

    B+ Xpdate time

    Set t[ t *Z

    Return to step # if t 'Z

    Gther7ise3 terminate the algorithm 8goto step "9+1+ 'ermination

    9P8k

    Oma=k

    *

    N &* 4N&

    = @+B4 MM

    9P8k

    Oma=arg*

    &q 4N&

    4

    = @+B# MM

    *0+ Path 8state se

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    55/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    ;onsider a model 7ith , [ # states and an o6servation of length ' [ B+ &n the initialiCation 8t

    [ *9 is * 8*93 * 849 and * 8#9 found+ (ets assume that * 849 is the ma=imum+ ,e=t time 8t [

    49 three varia6les 7ill 6e used namely 4 8*93 4 849 and 4 8#9+ (ets assume that 4 8*9 is no7

    the ma=imum+ &n the same manner 7ill the follo7ing varia6les # 8#93 @ 8493 " 8493 5 8*93 :8#9 and B 8#9 6e the ma=imum at their time3 see !ig+ :+*+

    !ig+ :+* sho7s that the )iter6i Algorithm is 7orking 7ith the lattice structure+

    Fig"re 7.1$=ample of )iter6i search

    'o find the state path the 6acktracking is used+ &t 6egins in the most likely end state and

    moves to7ards the start of the o6servations 6y selecting the state in the t 8&9 that at time t-*

    refers to current state+

    :+B R$SX('

    :+B+* Score'he score according to the )iter6i algorithm+ 'he same as the calculated value (og8PN93

    the ma=imiCation of the pro6a6ility of a single state path is saved as a result for each

    comparison+ 'he highest score is naturally the highest pro6a6ility that the comparedmodel has produced the given test utterance+

    ///////////////////////////////////////////////////////////////////////////""

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    56/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    ///////////////////////////////////////////////////////////////////////////"5

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    57/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    ///////////////////////////////////////////////////////////////////////////":

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    58/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    B 'he !"## DSP

    B+* 'he !"## $%-K&' (ite'he DSP used in this proect is an !"## $%-K&' (ite3 7hich is an fi= point digital signal

    processor developed 6y Analog Devices+ 'his DSP is offering a good performance at a verylo7 po7er consumption+ &t have audio inout ports as 7ell as video inout+

    ain )eat"res

    ;lock speed of :"0MHC

    Audio ports3 three in and t7o out

    !our B-6it )ideo A(Xs

    @0-6it shifter

    Dual *5-it multiplication accumulation 8MA;9

    !riendly and easy to use compiler support+

    'he soft7are used is )isual DSP3 also developed 6yAnal0 :e&ce!+All the programming on the !-"## is done in programming language c+

    Fig"re %.1

    ///////////////////////////////////////////////////////////////////////////"B

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    59/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    B+4 SP$$;H S&Q,A('he human natural speech is collected to the DSP 6y a microphone connected to one of the

    audio connectors on the !-"##+

    B+4+* 'he talkthrough modification'o get started easier 7ith the programming of the DSP3 the test program alkthu0hprovided together 7ith the )isual DSP 3 is used+ After implementing that and after testing

    that the audio inout from the DSP is 7orking 7e started the Speech Recognition

    Programming3 6ased on the alkthu0he=ample+

    B+4+4 &nterrupts'he ne=t issue to solve 7as to generate an interrupt 6y pushing a 6utton on the !-"##+

    !or this3 the e=ample _link` provided 7ith D&!ual :/* ==is modified.

    &nterrupts are programmed in that 7ay that every time one pushes a certain 6utton3 the Direct

    Memory Access Serial Port3 DMA SPGR'3 is ena6led and the DSP is listening for incoming

    speech signal from the microphone+,e=t time the 6utton is pushed the DMA SPGR' is disa6led and the speech recognition

    program 6egins+

    B+4+4 DMA3 Direct Memory Access'he DMA is a device for transferring data to or from other memory locations or peripherals

    8in our case microphone9 7ithout the attention of the ;PX+ DMA* and DMA4 are mapped to

    SPGR'/RY and SPGR'/'Y respectively3 and are configured to use *5-6it transfers+

    B+4+4 !iltering

    'he sampling fre

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    60/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    B+# PR$PRG;$SS&,Q

    B+#+* Preemphasis'he filtering is done similarly as in matla6+ Same filter is used

    cn!t flat hI2J 8 1,;.%"L

    See Fig"re %.1for plot from the input signal 6efore preemphasiCing and Fig"re %.2after.

    Fig"re %.2 'he signal =/utt8n9 6efore preemphasis

    Fig"re %.3 'he 6lue signal =/utt8n93and the preemphasiCed red signal3 =/preemp8n9

    ///////////////////////////////////////////////////////////////////////////50

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    61/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    B+#+4 )oice Activation Detectionhen you have got access to a sampled discrete signal it is significant to reduce the data

    to contain only the samples 7hich is represented 7ith signal values3 not noise+ 'herefore

    the need of a good )oice Activation Detection function is needed+ 'here are many 7ays

    of doing this+ )oice activation detection is calculated similarly as in matla6 8chapter @9+

    'he varia6le alpha is though different from the matla6 version+ 'his varia6le is tuned due

    to another noise to ratio- and processing values compare to matla6 environment+

    See $

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    62/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    B+@ !RAM$(G;K&,Q V &,DG&,Q

    B+@+* !rame6locking'he filtering is done similarly as in matla6+ 'he frame length is #40 and an overlap of 5031W

    is used+ Fig"re %.4illustrates frame 40 after frame6locking+

    Fig"re %.&

    ///////////////////////////////////////////////////////////////////////////54

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    63/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    B+@+4 indo7ing using Hamming 7indo7After the frame6locking is done a Hamming7indo7 is applied to each frame+ 'his

    7indo7 is to reduce the signal discontinuity at the ends of each 6lock+ 'he formula used

    to apply the 7indo7ing is sho7n inEq.+.+.

    #+#+9*

    4cos8@530"@3098 Eq

    K

    kkw

    =

    Fig"re %.& illustrates frame 40 after 7indo7ing+ ,ote that the result gives a reduction of the

    discontinuity at the ends of the 6lock+

    Fig"re %.6

    ///////////////////////////////////////////////////////////////////////////5#

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    64/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    B+" !$A'XR$ $Y'RA;'&G,A "*4 pont !!' is used on each 7indo7ed frame+ 'o adust the length3 Ceropadding is used+

    Se figure :+:+

    Fig"re %.7

    'he functionfft256.c O6ook3 is modified in order to calculate the fft for the signal after

    7indo7ing+ $=actly the same filter6ank as in matla6 is used+ See chapter #+@+4 _Mel spectrum

    coefficients 7ith filter6ank`+ Mel-;epstrum coefficients are calculated usingEq. +.6+ A lo7-time lifter is then used in order to remove the first t7o coefficients using formula inEq. +."+

    An e=tra coefficient 7hich contains the log of signal energy is added for each frame3 as sho7n

    inEq. +.%+

    B+5 !$A'XR$ )$;'GRS Mel !re

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    65/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    B+: '$S'&,Q'he models created as 7e descri6ed in chapter " are no7 stored in the DSP memory+ After 7e

    processed the input speech from the microphone to the Mel !re

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    66/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    Fig"re %.%

    'o ma=imiCe P 8

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    67/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    ;alculating the state 7hich gave the largest (og8PN9 at time '+ Xsed in 6acktracking later

    on+

    B+1+5 PathState se

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    68/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    Gther7ise3 terminate the algorithm+

    B+*0 D$('A V A;;$($RA'&G, ;G$!!&;&$,'S

    'he delta and acceleration coefficients are calculated to increase the information of thehuman perception+ 'he delta coefficients are a6out time difference3 the acceleration

    coefficients are a6out the second time derivative+

    Delta and acceleration coefficients are though not used 7hen implementing the Speech

    recognition in DSK3 6ecause of its small effect in the final result+

    Another reason for not using the delta V acceleration coefficients in DSK is to save po7er

    from processor and to shorten the calculation time+

    B+** 'H$ R$SX('

    B+**+* 'he Score

    'he score from each stored model tested to7ards the input 7ord is stored in a result vector+

    'he highest score is naturally the highest pro6a6ility that the compared model has produced

    the given test utterance+

    Depending of 7hich the score 7as3 7e turn on different ($Ds on the !"##+

    7ord ($Ds ($Ds on DSP_one` 00000*

    _t7o` 0000*0

    _three` 0000**

    _four` 000*00

    _five` 000*0*

    ///////////////////////////////////////////////////////////////////////////5B

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    69/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    1+ $valuation

    1+* MA'(A

    1+*+* Mat(a6 Result

    'he Mat(a6 result is given in +able 1A +able 2A +able 3A +able 4 and +able &. 'he different

    ta6les sho7s the score for the different models to 6e the one generating the utterances tested+

    'he num6er of utterances tested are "0 times " 8" different 7ords9+ 'here is a recognition rate

    of *00W+

    ///////////////////////////////////////////////////////////////////////////51

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    70/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    'he follo7ing columns are the result score from testing utterances of Mne against the

    different models+

    *+0e00# N

    Bone Bt!o Bthree B)o"r B)ie

    -0+1::0 -4+1:55 -#+4"B# -4+4#:: -4+"0:0 -*+00B5 -#+*":* -#+4##@ -4+"#B@ -4+50#B

    -0+B"0@ -4+10*4 -4+11#1 -4+*"55 -4+0:1" -0+1@@# -4+55@1 -4+1##4 -4+*5"# -4+4:"@

    -0+1@5: -#+04@* -#+*B*4 -4+"5#@ -4+@"0:

    -0+B1@0 -4+1B:@ -#+**4: -4+#1"5 -4+@#1* -0+1#B5 -4+"1@4 -4+1@4# -4+##5B -4+"44B

    -0+BB4" -4+50B: -4+B@5# -4+@@B# -4+#1:B -0+B"B5 -4+:04B -4+1#B1 -4+45@B -4+#@45

    -0+:1"0 -4+"B@# -4+5"1: -4+4*#4 -*+1*4# -0+B"4# -4+5*1@ -4+:1:# -4+@14@ -4+04#:

    -0+1#1B -4+:"1# -4+:*BB -4+#"1: -4+41@# -0+14"* -4+::"4 -#+*"10 -4+:#1: -4+"5#5

    -0+1:*B -4+1#4" -#+4@1B -4+5@#* -4+5#** -0+B:#: -4+:55* -#+*#"0 -4+"014 -4+@#:5

    -0+1#"" -4+1:#0 -#+*:1: -4+B#@B -4+#:10 -0+1#0" -4+:1*5 -4+BB@: -4+B"@1 -4+4@#:

    -0+1##4 -4+::*0 -#+0@#* -4+:#54 -4+@@55 -0+1#5B -#+0@1* -#+##B1 -4+501: -4+"#0#

    -0+B"15 -4+B4:@ -4+B5#0 -4+*540 -4+#15# -0+1**0 -4+1@14 -4+1*11 -4+*5"4 -*+1:*B

    -0+B0"* -4+B@@B -4+1:@4 -4+05:1 -4+4B*1 -0+B"1: -4+B:5* -4+1#00 -4+010# -4+#45@

    -0+BB01 -#+*B"0 -#+*1*B -4+#B50 -4+#555

    -0+:BB" -4+@::* -4+@:11 -4+014" -*+1"00 -0+B5*# -4+5:B: -4+:"#1 -4+#1B@ -4+**0#

    -0+B5@@ -4+:@5: -4+1@01 -4+4#"5 -4+"40"

    -0+B154 -4+50:* -4+B@B: -4+""@* -4+0404 -0+B0*0 -4+"*0: -4+51:@ -4+@4*5 -4+0**:

    -0+:@45 -4+4"#4 -4+4"B# -*+B"5" -*+"B:5 -*+00*0 -4+B01@ -#+4":: -4+#0:5 -4+B"0B

    -0+1B4: -4+1""# -#+*#1: -4+*4": -4+@511 -0+1:11 -#+0"40 -#+0@"" -4+@5#: -4+@1B1

    -0+14#: -#+0B5: -#+**@: -4+*#5@ -4+040@ -0+1"@1 -4+B1*4 -4+1"#* -4+0B54 -4+@0@1

    -*+0:5# -#+@*0: -#+@5#B -4+4B1" -4+@BB@ -*+0##1 -#+@@14 -#+@:5* -4+45B# -4+5#":

    -0+1#@" -#+0**0 -4+B@#* -4+01#: -4+04B1 -*+0B0" -#+5@#0 -#+"@#0 -4+1@0B -4+5B*4

    -*+4"40 -@+001B -#+1:4B -#+0:"* -4+B"54 -*+**"* -#+45B5 -#+4105 -4+:*00 -4+"::4

    -0+1@4@ -#+4:B@ -#+**B4 -4+#"01 -4+410B -*+0*1B -#+#011 -#+*B:1 -4+445" -4+@@@*

    -0+1*5B -4+15:: -4+1:45 -4+4":0 -4+"011

    -*+014@ -#+"5*@ -#+@":B -4+B14: -4+5404 -*+00@" -#+4B#0 -#+01"# -4+"@#0 -4+@#@:

    -*+0"50 -#+4@@@ -#+4@*: -4+54"@ -4+:0#0

    -*+0B04 -#+#B"* -#+@"1" -4+@::4 -4+##1@ -0+1"": -#+404@ -#+45"1 -4+#51B -4+#B15

    -*+**#: -#+@:@@ -#+"44* -4+"B"* -4+:404+able 9.1

    'he first column al7ays sho7s the highest score+ Means that the recognition rate for theutterance Mneis *00W of the "0 utterances tested+

    ///////////////////////////////////////////////////////////////////////////:0

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    71/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    'he follo7ing columns are the result score from testing utterances of Mtw against the

    different models+

    *+0e00# N

    Bone Bt!o Bthree B)o"r B)ie

    -*+B4:* -0+5:15 -*+#5B5 -4+*:*0 -*+@044 -*+"@"5 -0+511: -*+@@5* -4+44** -*+4@1*

    -*+B":5 -0+5::: -*+4*#" -4+*0B: -*+#00@ -*+"":4 -0+5@#4 -*+*"** -4+*4"0 -*+*5*B

    -*+"5B* -0+541# -*+4*#0 -*+1*54 -*+40:4

    -*+:0B1 -0+554* -*+4@5B -4+44*1 -*+4B0B -*+5#5@ -0+5B:5 -*+"*4# -4+"04@ -*+@#*"

    -*+5@:B -0+50@* -*+#01# -4+4:0: -*+@4## -*+1:@0 -0+5:55 -*+41B4 -4+##*B -*+"40*

    -*+B"00 -0+5:"" -*+4#*# -4+*""B -*+#0#* -*+B4** -0+:**4 -*+@010 -4+45B@ -*+@":@

    -*+51** -0+5:00 -*+#BB5 -4+@5B0 -*+@@00 -*+:B14 -0+51"1 -*+@@0B -4+5*B4 -*+@:1*

    -*+:5@5 -0+:**# -*+#@4@ -4+4#BB -*+@4": -*+44## -0+""5: -*+*1B" -*+B:B# -*+01:0

    -*+#51: -0+5:45 -*+@#0* -4+*@*: -*+450* -*+@:41 -0+5"15 -*+#0#: -4+*4@1 -*+451"

    -*+:*5" -0+515" -*+"04B -4+4@1B -*+@0*5 -*+"#:1 -0+:*4" -*+"*44 -4+#1B4 -*+#B45

    -*+@@0" -0+:450 -*+#:": -4+#*## -*+#0#4 -*+@@4# -0+5#:: -*+#*40 -4+0##: -*+4@*#

    -*+"B"B -0+"1## -*+40B@ -4+*5B* -*+4::: -*+5**1 -0+:00B -*+#5#@ -4+#:#@ -*+41"#

    -*+"*4B -0+:*54 -*+@"B# -4+"*": -*+4:*1

    -*+#:0" -0+5@@0 -*+#*05 -4+0B5B -*+*B:0 -*+"4": -0+5@1# -*+#1#B -4+451" -*+4@00

    -*+54:B -0+55*" -*+@41" -4+#B41 -*+#B##

    -*+B51@ -0+54:0 -*+45@0 -4+#*54 -*+#4@5 -*+:4:4 -0+55:1 -*+4#"1 -4+**#" -*+4#*1

    -*+#5#* -0+"B0" -*+*4B# -*+1#44 -*+0@": -*+@454 -0+54:5 -*+#4** -4+0":: -*+01*#

    -*+"1B# -0+50*# -*+*"#4 -4+*0#@ -*+#@## -*+5@*0 -0+"B*0 -*+*@:B -*+B1:1 -*+*1BB

    -*+50B4 -0+":B: -*+*4#0 -*+1#*" -*+44@# -*+:::5 -0+5B01 -*+##4# -4+4#*4 -*+41BB

    -*+:0@0 -0+:##0 -*+#@## -4+0@"@ -*+@#** -4+40:* -0+:55# -*+4B*1 -4+@5*: -*+"@"5

    -4+0*@@ -0+5B@" -*+#@#: -4+#:@1 -*+@5#* -4+*#:1 -0+:0@: -*+#:@5 -4+5410 -*+5045

    -*+BB:1 -0+5:@B -*+4410 -4+4"B4 -*+@0"0 -*+BB04 -0+5B05 -*+40#1 -*+11*: -*+@#B@

    -4+0414 -0+:4#B -*+@##* -4+40"B -*+50:B -*+54"" -0+5*@B -*+445* -*+14*B -*+4#4#

    -*+1#44 -0+5@B5 -*+*B:0 -4+**"4 -*+45"1

    -4+**5: -0+55** -*+401B -4+0:*1 -*+#@B: -*+::": -0+5"B@ -*+##@5 -*+1B:B -*+45"*

    -*+B"11 -0+50*1 -*+**05 -4+01"# -*+*B05

    -*+B4:0 -0+5@#5 -*+*"11 -*+1B": -*+#55@ -*+14:4 -0+5"*0 -*+40@@ -4+4@"1 -*+@50"

    -*+1@@0 -0+5::5 -*+445" -4+4#B5 -*+@44B+able 9.2

    'he first column al7ays sho7s the highest score+ Means that the recognition rate for theutterance Mtwis *00W of the "0 utterances tested+

    ///////////////////////////////////////////////////////////////////////////:*

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    72/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    'he follo7ing columns are the result score from testing utterances of Mthee against the

    different models+

    *+0e00# N

    Bone Bt!o Bthree B)o"r B)ie

    -4+B*:B -*+"4@# -0+:5@: -4+"@#B -*+":05 -4+"@:* -*+#151 -0+:4"0 -4+#1"1 -*+@#B0

    -4+"@#* -*+5:0* -0+B0"1 -4+:B4B -*+:@4@ -*+:@15 -*+0@01 -0+"@:* -4+0#41 -*+*:B"

    -4+"@41 -*+@@"5 -0+5"@: -4+"0B1 -*+"0B*

    -4+*"00 -*+4""4 -0+5#1B -4+@*41 -*+#B#B -4+##1B -*+#@05 -0+5#4@ -4+#:*@ -*+@@4B

    -4+B@:" -*+"B10 -0+5:5@ -4+@144 -*+"445 -4+:5#0 -*+@5:# -0+:*#1 -4+"@0" -*+"#0B

    -#+44@1 -*+55"1 -0+10## -4+51BB -*+:5@@ -4+5:*B -*+@1B@ -0+:4*: -4+""## -*+"51*

    -4+"B0@ -*+5*44 -0+5"5B -4+":0* -*+54:4 -4+"5"* -*+@#"@ -0+5*5# -4+05@# -*+#"4@

    -4+B*#0 -*+"@45 -0+:### -4+@@*# -*+"00* -4+B":* -*+5:0" -0+::5@ -4+:B:" -*+545*

    -4+#*00 -*+#*B" -0+5@#@ -4+#*@# -*+#*B: -#+#B:" -*+:55: -0+:B0B -4+:*@* -*+:*#4

    -#+0@@B -*+"B1: -0+:0:* -4+@:#: -*+"#B: -4+"*@@ -*+#"*0 -0+5"0" -4+*4*4 -*+#:14

    -4+*"5* -*+44*" -0+"B*: -4+*@5: -*+#@01 -4+BB"B -*+5"#B -0+:B05 -4+4:"B -*+54@B

    -#+00:# -*+"@1: -0+55#1 -4+@#05 -*+@#44 -#+0:45 -*+"@*1 -0+:"*5 -4+"#": -*+"@:0

    -4+B551 -*+55*@ -0+::0* -4+BB4# -*+:#14

    -#+@4@: -*+:1:1 -0+:*"0 -4+#110 -*+"::5 -4+#:#@ -*+#:0# -0+5#1" -4+"#*5 -*+@5#*

    -4+@4#* -*+410B -0+"1B1 -4+@@04 -*+@*4:

    -4+B*15 -*+@B00 -0+::@B -4+B4BB -*+5:"5 -4+B5"0 -*+51@1 -0+:40B -4+:""B -*+5B:"

    -4+B0B: -*+:4#* -0+:"1* -4+@14: -*+5541 -4+5:45 -*+""41 -0+5B:# -4+55#4 -*+"B#5

    -4+@454 -*+@:5# -0+50B4 -4+":@1 -*+"*:" -#+#@BB -*+BB51 -0+:"** -4+1@4: -*+B@0B

    -4+@40@ -*+"##" -0+5:@# -4+:5:@ -*+5":B -4+:@#B -*+":"@ -0+:#5@ -4+1@*: -*+B#":

    -4+"5:B -*+@5B# -0+:01: -4+B*0@ -*+5B@B -4+#40B -*+@0:* -0+5B"4 -4+B0"5 -*+555*

    -4+#:5# -*+##*B -0+:*** -4+B4B4 -*+5#4B -4+*":B -*+#"1@ -0+5410 -4+"*@@ -*+":@#

    -4+@@:1 -*+@::B -0+51*B -4+@#4B -*+"#1# -4+5**" -*+:@04 -0+:4@: -4+@:5" -*+:*:*

    -4+450" -*+@@1@ -0+55@* -4+54#1 -*+5*1* -4+#44" -*+@0*0 -0+5@4: -4+54#: -*+"B#5

    -4+::@4 -*+"::5 -0+5"51 -4+5410 -*+55#5

    -4+"515 -*+"@@* -0+5@@B -4+":0@ -*+""B4 -4+:004 -*+54B0 -0+1"1B -4+:*0* -*+:@5@

    -4+*#14 -*+@*15 -0+5#11 -4+"4": -*+@"4*

    -4+#5:# -*+@11* -0+54:5 -4+"0#1 -*+@B5@ -4+*B": -*+#541 -0+5"44 -4+#1:B -*+#1#1

    -4+:00# -*+"":B -0+:@:@ -4+#":@ -*+@"5:+able 9.3

    'he first column al7ays sho7s the highest score+ Means that the recognition rate for theutterance Mtheeis *00W of the "0 utterances tested+

    ///////////////////////////////////////////////////////////////////////////:4

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    73/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    'he follo7ing columns are the result score from testing utterances of Mfu against the

    different models+

    *+0e00# N

    Bone Bt!o Bthree B)o"r B)ie

    -4+#B## -#+*@@B -4+@:B" -0+B10: -*+B50* -4+B@"5 -#+#5@1 -4+:1@# -*+04@* -4+0#""

    -4+401# -4+:1"" -4+#@44 -0+B::5 -*+1#:1 -4+@5@0 -#+#""1 -4+:145 -0+155B -4+0"BB

    -0+5B:0 -4+@04# -4+#*0" -0+50#4 -*+51*"

    -4+4::B -#+04** -4+"0*@ -0+10B: -*+1*"5 -*+44"0 -4+@@:* -4+":1# -0+5:40 -*+B#4#

    -*+0"50 -4+:B0# -4+5514 -0+514: -4+0#@1 -*+010* -4+#4"* -4+*"B4 -0+5*#" -*+5*B#

    -0+B@5# -4+05#B -*+BB4B -0+"1#4 -*+@B@" -*+0#** -4+@*B" -4+*@50 -0+5#4* -*+"B"4

    -4+1*#5 -#+*:5* -4+:0@4 -*+0"5: -4+04@1 -*+B"*@ -4+5*04 -4+#5@# -0+B#"B -*+:115

    -*+@#:* -4+"B5: -4+#B"" -0+5B0: -*+B0B: -*+*44: -4+"455 -4+5*5" -0+5B"5 -*+B5"B

    -4+#5"4 -#+*0"# -4+B"B0 -0+1:4@ -4+*"*0 -*+44"0 -4+@@:* -4+":1# -0+5:40 -*+B#4#

    -*+00#5 -4+:0*: -4+:B:: -0+:**1 -4+0*0@ -4+0*@# -4+1#0: -4+10:1 -0+B"*" -4+0@@#

    -0+1##0 -*+:1B@ -*+BB*0 -0+55*# -*+4105 -*+:@"" -4+@@0@ -4+44*# -0+:B@@ -*+:*0#

    -*+*:"" -4+:451 -4+:#1: -0+5B@* -*+1*5" -0+1"14 -4+"#1* -4+54#4 -0+5@1# -*+B001

    -*+B4"B -4+1#*# -4+B:01 -0+BB50 -4+0B":

    -*+@4:5 -4+5@5@ -4+@@15 -0+:#1: -*+B*@1 -*+1"#" -4+B#0@ -4+@110 -0+B""@ -*+11:B

    -0+1#"" -4+"4*" -4+"0@: -0+55:1 -*+B**0

    -4+@#"* -4+4B1@ -*+"1@1 -0+B:#* -*+@5## -0+1151 -4+#BB1 -4+41*" -0+5:4# -*+:"4*

    -*+*0#* -4+41"B -4+45@5 -0+51B4 -*+5:#: -0+B11" -4+#4B1 -4+*150 -0+"14B -*+":*0

    -*+@:"B -4+5055 -4+@5*# -0+:@:1 -*+B050 -*+4### -4+B**1 -4+::4* -0+:B14 -*+5150

    -0+10B@ -4+##*# -4+##:# -0+5*** -*+5@1: -0+1:#0 -4+4*"1 -4+#0:1 -0+5**: -*+"*#4

    -*+0#B4 -4+5055 -4+B"50 -0+5B41 -*+10B: -*+*:05 -4+"0#: -4+5@"B -0+:#04 -*+:#45

    -*+**1: -4+""*4 -4+50** -0+:0#@ -*+:441 -*+*5B" -4+B"10 -4+51:0 -0+:@:* -*+15:0

    -*+B**" -4+:444 -4+5:5* -0+B@"1 -*+1514 -4+45:0 -4+"@:5 -4+4:40 -0+:5:4 -*+"#"5

    -*+05*0 -4+5#4" -4+:0*@ -0+5BB* -*+B5BB -*+@":0 -4+:"1" -4+1#@4 -0+B*5" -4+0*5B

    -4+5:@1 -#+4*@" -4+B"50 -0+1B"@ -4+0::B

    -*+5"55 -4+:*0" -4+:B"B -0+::*0 -*+1450 -*+414# -4+:#5: -4+:B@0 -0+:#@0 -*+B51"

    -*+40"# -4+54@4 -4+"1:* -0+5:"4 -*+::0B

    -*+@#B" -4+::00 -#+0##0 -0+:14# -*+155@ -*+1:** -4+:0B4 -4+":*@ -0+B"*@ -*+:"*@

    -*+*:4: -4+""4@ -4+#*#* -0+:0B# -*+:#@B+able 9.4

    'he first column al7ays sho7s the highest score+ Means that the recognition rate for theutterance Mfuis *00W of the "0 utterances tested+

    ///////////////////////////////////////////////////////////////////////////:#

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    74/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    'he follo7ing columns are the result score from testing utterances of Mf&e against the

    different models+

    *+0e00# N

    Bone Bt!o Bthree B)o"r B)ie

    -*+""5# -4+00"1 -*+5454 -*+B@5@ -0+::"# -4+0B0" -*+1B14 -*+#@@@ -*+@45B -0+:1"B

    -*+:@#: -4+00#@ -*+:"*4 -4+#11B -0+B4"5 -*+40"B -*+@:5" -*+*#1# -*+0:": -0+"01"

    -*+@#:5 -*+@00B -*+*4@5 -0+1B0" -0+"1*B

    -*+0B5* -*+#B@@ -*+4#5@ -*+*"@# -0+"0@4 -*+#B:5 -*+""#@ -*+4#04 -*+4*54 -0+":1B

    -*+"1@B -*+:B11 -*+"B05 -*+"B4@ -0+5"5* -*+111" -*+:540 -*+4:5: -*+*0:" -0+55##

    -*+B#@0 -*+1:04 -*+515@ -*+B"5" -0+:015 -4+5B1# -4+50B0 -4+##4@ -4+BB"# -*+00*1

    -*+40B@ -*+@#1B -*+44:0 -*+**01 -0+"*@5 -*+#@@5 -*+54:5 -*+@*1@ -*+B5:B -0+551*

    -*+:##B -*+B511 -*+51@B -*+1@@* -0+:"0# -*+011@ -*+@1B@ -*+#*## -*+*:0* -0+"#@4

    -*+#@"1 -*+5"5B -*+5*#5 -*+1:"5 -0+5"*5 -*+04*: -*+@0#: -*+*41" -*+4@"B -0+"@#:

    -4+0@00 -4+#0#@ -4+**11 -4+::41 -0+1045 -*+4B41 -*+5*#1 -*+@5B* -*+1::1 -0+:@11

    -4+4@*1 -4+*B*4 -*+55*5 -*+5":0 -0+B@@* -*+#01B -*+5"11 -*+5*"1 -*+:@:# -0+51::

    -*+BB*5 -*+B:0: -*+#155 -*+@"0@ -0+:":B -*+:#"0 -*+:1:B -*+@05B -*+#0*# -0+5:45

    -*+B00# -*+B::* -*+541# -*+54B0 -0+5B0B

    -*+@:4# -*+:1"0 -*+"":1 -*+55"" -0+5"5@ -*+:001 -*+""*4 -*+4*"0 -*+@4*# -0+":1#

    -0+:BB: -*+*::0 -0+1115 -*+*B@" -0+"*4B

    -*+*@:" -*+@051 -*+4#:" -*+B#*B -0+"11@ -4+005" -*+B1"* -*+#"01 -*+#""B -0+:0*4

    -0+B*1B -0+11*" -0+B14: -*+0B## -0+#B1* -*+50B0 -*+B@11 -*+#@@0 -*+@#0" -0+551:

    -4+0:4# -4+5@4: -4+#0## -4+@B:4 -*+0*@5 -*+@B5B -*+"144 -*+#@"1 -*+##:@ -0+""4@

    -4+*B#1 -4+*"5@ -*+1":B -*+1:#" -0+B*:# -*+@1:# -*+:""@ -*+"4@: -*+#"#B -0+504*

    -4+0B01 -*+B1:* -*+5044 -*+5""B -0+51## -*+@#:4 -*+101: -*+B4@# -4+0@40 -0+:*@B

    -4+*50@ -4+04@5 -*+5#@B -*+115B -0+::41 -*+B5B4 -*+B@@# -*+@:5* -*+#50" -0+5#41

    -4+*1*" -*+1@#0 -*+@B#@ -*+4@0" -0+510* -*+1451 -*+B45" -*+#104 -*+@010 -0+5B51

    -4+5":B -4+*05B -*+5@*0 -*+B@B4 -0+BBB1 -4+@5*" -4+40@4 -*+1*:: -*+B15* -0+B@:4

    -*+1:## -4+*@0B -*+54*B -4+*"#4 -0+B#:0

    -*+5#:" -*+B410 -*+:B*1 -*+1B:@ -0+:#41 -*+*B@: -*+4455 -*+40#5 -*+@:45 -0+"#B4

    -*+4:4: -*+@B"1 -*+4:54 -*+*@:4 -0+"41:

    -*+5111 -*+5:B5 -*+5*5" -*+51#* -0+5BB4 -*+*5:5 -*+"4"* -*+#:#1 -*+*B#@ -0+"0@5

    -*+41*@ -*+@B*# -*+44*1 -*+*"B# -0+":04+able 9.&

    'he first column al7ays sho7s the highest score+ Means that the recognition rate for theutterance Mf&eis *00W of the "0 utterances tested+

    ///////////////////////////////////////////////////////////////////////////:@

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    75/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    1+4 DSK

    1+4+* DSK Result

    'he ta6les 6elo7 are sho7ing the result 7hen tested the 7ords _one`-_five`+ 'he result is

    very satisfying 6ecause the models are not trained from the same person 7hich tested these

    7ords+

    'he follo7ing columns are the result score from testing utterances of Mne against the

    different models+

    Bone Bt!o Bthree B)o"r B)ie

    *5B4+"*1*# -4*B1+@0:"0 -4#4@+4BB:" -4#"B+@4:4" -*1@@+"5*5#

    -*B*4+0*05# -45*1+#B:4" -454#+001:" -451#+5#@4" -*144+*""BB

    -*:@B+1":#B -4#B"+@#5:" -4"4"+004:" -4":"+@14"0 -*10@+455:"

    -*1:4+*#""0 -4BB:+4044" -41"5+4:B00 -#0@B+@0*"0 -4405+:0#:"

    -*54"+"0@*# -4##*+B4:"0 -4"0*+55":" -4"#1+@004" -*B*#+*"#BB

    -*:B*+*#5:" -4#:4+#1":" -4"#1+B1:4" -454*+54@"0 -*B1@+@#@"0

    -*B0@+*00BB -4"1:+:B#:" -45:1+5@0:" -4:BB+:"*00 -*BBB+@45:"

    -*B4*+5115# -4:10+BB1"0 -4:B1+0##4" -4B1:+B0#00 -40#*+*54#B

    -*:"*+*4*4" -4501+@#4:" -4"1#+01*4" -45*4+#"100 -40#B+@B*"0

    -*BB#+@"B4" -4B@4+4:500 -4B5*+B1@"0 -414"+:044" -4*BB+:#@00

    -*5@:+B14:" -4"*:+4@1:" -4"4B+B05"0 -45B0+#:""0 -*154+:@B"0

    'he follo7ing columns are the result score from testing utterances of Mtw against the

    different models+

    Bone Bt!o Bthree B)o"r B)ie

    -*:*0+1:B00 -*01@+00*5# -400*+:@1BB -400"+1":#B -*54@+#*000

    -*:*1+4@@#B -***4+"514" -*1:*+4"@5# -44#:+#4B00 -*5@5+*BB:"

    -*1:*+@41*# -***:+#11:" -*114+4:"#B -404:+":B:" -*:14+4#1"0

    -40B@+45@5# -**14+4*:00 -*B5B+:10*# -*1#@+"15BB -*B0@+054#B

    -4@4@+"B0:" -*#0:+B1:4" -4@**+B1500 -4":1+#5*00 -44@1+@@@00

    -*:B"+"B@4" -*0*:+5*45# -*5B:+:##"0 -*5"5+B1@00 -*"4#+:"B"0

    -*15:+":1BB -*0"B+B05BB -*:5@+:##5# -*:1*+1B@BB -*:#0+4:5:"

    -*:0@+05@:" -111+1B:*0 -*"##+0*"5# -*:@:+"1*4" -*50"+#1@*#

    -40:"+:0@5# -*04"+4":"5 -*:*1+1B15# -*B51+:"05# -*:BB+B:4:"

    -*B:0+"41#B -1@0+0B0#0 -*@*:+:*45# -*5@B+*1#4" -*"*4+"B5*#

    -*1::+0*@00 -**B:+"*"#B -*B:0+:*45# -*1@"+01:BB -*B05+54BBB

    ///////////////////////////////////////////////////////////////////////////:"

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    76/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    'he follo7ing columns are the result score from testing utterances of Mthee against the

    different models+

    Bone Bt!o Bthree B)o"r B)ie

    -*@@1+@B":" -*#4*+###4" -*0:*+4B1:" -*5@"+:5:4" -*@0B+"*1"0

    -4011+B05"0 -*5B@+0*"5# -*4"4+@"0"0 -40B@+@#B:" -40*1+#4:@@

    -4*15+":1:" -*:B5+:41"0 -*@05+#B*"0 -4*4"+@@:4" -40*5+@"1:"

    -*"51+5*@4" -*#"0+@@@:" -*40*+5"B#B -*:#@+*@45# -*"#5+#4#*#

    -*B1*+:0@5# -*@4@+"4B00 -**1*+:00"0 -*11B+500#B -*501+**400

    -4*4#+@**00 -*BBB+5#"#B -*44#+45#5# -4410+0":00 -*1:1+@0400

    -4*#"+1"44" -*5:0+B*#BB -*4"0+"@@BB -4*00+4@400 -*::*+5:0*#

    -*55:+05::" -*@"*+0:500 -*0"4+5":*# -*:1:+*""00 -*"55+#41BB

    -40@*+15"*# -*"::+5#"5# -*44#+4@04" -40#:+15@"0 -*:@#+4:0"0

    -*1"B+1*15# -*5"4+@#400 -*"*0+151*# -4*00+@"B00 -*:50+51400

    -*5#B+4B:00 -*###+*:""0 -*05:+4145# -*:"5+"@"5# -*"*4+#:#4"

    -*"*@+@:@BB -*":@+*14:" -**B*+"0"5# -40#1+4B#"0 -*""5+0*4#B

    -*B01+BB4BB -*:**+5***# -*4:#+##4BB -444@+#*0"0 -*55"+140#B

    -*B#:+5@0*# -*B0*+B@B"0 -*@0B+:0@BB -44B5+40B:" -*:5@+450:"

    'he follo7ing columns are the result score from testing utterances of Mfu against the

    different models+

    Bone Bt!o Bthree B)o"r B)ie

    -4**B+15::" -45"B+:@1:" -#0#0+:4@:" -40:#+"5@"0 -4514+*:100

    -4"@5+15100 -###1+"514" -#@:@+15100 -4##*+515"0 -411:+4*100

    -*#1"+B*::" -405@+#@*:" -4*@5+1B@4" -*#*5+"*5#B -*5B#+00*5#

    -4:B"+*50:" -#@*0+*#04" -#*11+151:" -4":*+1#500 -41#*+4@"4"

    -*15"+4:05# -44:1+@4#"0 -4"04+5@B00 -*:0#+#1"00 -4*"@+#:0:"

    -*100+@4@*# -4*B:+#"*4" -44*"+"@1"0 -*5*:+:B@*# -4@:*+"1"00

    -#@"1+0B000 -#"05+5:B"0 -#"@:+10#"0 -4:4*+"*:4" -#:*B+*::"0

    -4:#B+#**:" -4:1#+""::" -41*1+*:@00 -44#"+#*54" -41:0+*::4"

    -40:"+:14:" -4B41+#40"0 -4B@1+#@14" -*B:1+450BB -4"0:+@:#"0

    -*B4"+B@0BB -4#04+0044" -4"::+:4:4" -*"B4+45##B -4**1+"@0:"

    ///////////////////////////////////////////////////////////////////////////:5

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    77/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    'he follo7ing columns are the result score from testing utterances of Mf&e against the

    different models+

    Bone Bt!o Bthree B)o"r B)ie

    -45"1+:00:" -#0B*+##@"0 -#***+14*00 -##1"+@"B:" -4*B0+5B44"

    -4@B0+"@4:" -4B10+4**"0 -4B"5+*0":" -#0@"+##4"0 -4*1B+@4*00

    -450:+*5B:" -#*40+@#:00 -4B#:+05#00 -##0B+:@14" -*B*@+4:05#

    -4#""+#*::" -4""0+*:4:" -4"":+#5*4" -41*0+1:*"0 -*1"#+"::4"

    -4#10+"**:" -4514+#:@"0 -41B*+50*4" -#4@*+"#"00 -*B4@+4"B:"

    -444*+@*@4" -4504+"4"00 -45*:+@#*:" -4:4#+:01:" -*5@@+4:B5#

    -4@:@+514:" -451*+114:" -4:#@+:@4"0 -#04@+#B#4" -*B5:+#50"0

    -4*B:+#5400 -4"B4+@#100 -4@*#+55":" -4B5@+4*@"0 -*5:*+"::4"

    -4040+#4@5# -4"04+#:B:" -4451+14400 -4:54+000"0 -*":#+*1:5#

    -4@4*+4:""0 -450:+@B54" -4:BB+:15:" -41""+B4B"0 -40:"+:*:"0

    -*1B@+1044" -4#:*+@14:" -4@*#+B15:" -450#+114:" -*"11+4B4:"

    ///////////////////////////////////////////////////////////////////////////::

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    78/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    *0+ ;onclusion

    *0+* ;G,;(XS&G,

    'he conclusion of this master degree proect is that the theory for creating Mel !re

  • 8/12/2019 Speech Recognition Using Hidden Markov Model_MEE_03_19 (1)

    79/79

    M$$-0#-*1 Speech Recognition using Hidden Markov Model

    An implementation of the theory on a DSK-ADSP-!"## $%-K&' (&'$ R$) *+"

    ///////////////////////////////////////////////////////////////////////////

    **+ References

    OMM Mikael ,ilsson3 Marcus $narsson3 /eech Cec0n&t&n u!&n0 H&''en -ak

    -'el, M$$-0*-4:

    ORA (a7rence R+ Ra6iner+ Proceedings of the &$$$3 )G(+ ::3 ,G+ 43 !$RXARj

    *1B1+

    ODavid David Meier3 /eech Cec0n&t&n u!&n0 H&''en -ak -'el,M$$-1B-*@

    Oohn Deller ohn R+3 r+3 Hansen ohn +(+3 Proakis ohn Q+ 3:&!cete &me

    *ce!!&n0 f /eech /&0nal!3 &$$$ Press3 &S, 0-:B0#-"#B5-4

    O6ook ;hassaing Rulph3 DSP applications using ; and the 'MS#40;5= DSK3 40043

    &S, 0-@:*-40:"@-#

    OSS Anders Sv?rdstr.m3 /&0nale ch !!tem3 &S, 1*-@@-00B**-4