lao text-to-speech synthesis hmm- based method
TRANSCRIPT
![Page 1: Lao Text-to-speech synthesis HMM- based Method](https://reader035.vdocuments.site/reader035/viewer/2022071602/613d576f736caf36b75c2fa2/html5/thumbnails/1.jpg)
Lao Text-to-speech synthesis
HMM- based Method
National Authority for Science Technology
(NAST) , Lao PDR
12/20/0812/20/08 11
(NAST) , Lao PDR
National Electronic and Computer Technology Centre
(NECTEC), Thailand
![Page 2: Lao Text-to-speech synthesis HMM- based Method](https://reader035.vdocuments.site/reader035/viewer/2022071602/613d576f736caf36b75c2fa2/html5/thumbnails/2.jpg)
Status
� Phone inventory design for TTS (Done)
� Preparing resources and tools (Done)
� Designing and creating Lao text processing tools (Done)
12/20/0812/20/08
� Designing and creating Lao text processing tools (Done)
� Sentence selection & speech corpus preparation (Done)
� Training and Creating synthesizer by HMM-based speech
synthesis system toolkit (being trained)
![Page 3: Lao Text-to-speech synthesis HMM- based Method](https://reader035.vdocuments.site/reader035/viewer/2022071602/613d576f736caf36b75c2fa2/html5/thumbnails/3.jpg)
OverviewOverview
� Lao language and Lao writing system
� Phonemes design for Lao TTS
� Lao Sound systems
12/20/0812/20/08 33
� Syllable structure and syllable breaking
� Lao TTS Development process
![Page 4: Lao Text-to-speech synthesis HMM- based Method](https://reader035.vdocuments.site/reader035/viewer/2022071602/613d576f736caf36b75c2fa2/html5/thumbnails/4.jpg)
Introduction to Lao languageIntroduction to Lao language
� Lao language is the official language of Laos. It is a tonalLanguage of Tai family an is closely related to the IsanLanguage of the northeast region of Thailand
� Lao spoken Language can be divided into 3 main groups
12/20/0812/20/08 44
Lao spoken Language can be divided into 3 main groups
� Vientiane Lao
� Northern Lao (Luang Prabang)
� Southern Lao (Champasak)
![Page 5: Lao Text-to-speech synthesis HMM- based Method](https://reader035.vdocuments.site/reader035/viewer/2022071602/613d576f736caf36b75c2fa2/html5/thumbnails/5.jpg)
Writing systemsWriting systems
� Lao language are write from left to right
� No space between word and sentence
� One word can be one or more syllables
� Lao alphabets: Lao language are contained 78 characters, divided
in to 5 groups such as:
12/20/0812/20/08 55
in to 5 groups such as:
� 33 consonants
� 28 vowels
� 4 tone mark
� 10 Lao digits
� 3 special characters
![Page 6: Lao Text-to-speech synthesis HMM- based Method](https://reader035.vdocuments.site/reader035/viewer/2022071602/613d576f736caf36b75c2fa2/html5/thumbnails/6.jpg)
Writing system Cont.Writing system Cont.
Consonants:
� The single consonants can all be used as the main consonant of
a syllable and somes can be used at the end of a syllable or use
as the consonantal.
12/20/0812/20/08 66
Use as final consonant Common Cluster consonants
![Page 7: Lao Text-to-speech synthesis HMM- based Method](https://reader035.vdocuments.site/reader035/viewer/2022071602/613d576f736caf36b75c2fa2/html5/thumbnails/7.jpg)
Writing system Cont.Writing system Cont.
� Mixed Consonants
12/20/0812/20/08 77
![Page 8: Lao Text-to-speech synthesis HMM- based Method](https://reader035.vdocuments.site/reader035/viewer/2022071602/613d576f736caf36b75c2fa2/html5/thumbnails/8.jpg)
Writing system Cont.
�� VowelsVowels:: wewe separatedseparated intointo 22 groupgroup
SingleSingle vowelsvowels andand mixmix vowelsvowels asas youyou seesee thethe firstfirstlineline ofof imageimage bellowbellow isis aa singlesingle vowels,vowels, lastlast twotwo linelineisis mixmix vowelsvowels..
12/20/0812/20/08 88
isis mixmix vowelsvowels..
![Page 9: Lao Text-to-speech synthesis HMM- based Method](https://reader035.vdocuments.site/reader035/viewer/2022071602/613d576f736caf36b75c2fa2/html5/thumbnails/9.jpg)
Writing system Cont.
1.1. ToneTone marksmarks
TonalTonal marksmarks areare anotheranother groupgroup ofof alphabetalphabet lettersletters whichwhich areare
symbolssymbols characterizingcharacterizing thethe changechange ofof thethe soundssounds ofof aa
syllablesyllable toto followfollow thethe rulerule ofof consonantsconsonants soundssounds withwith short,short,
12/20/0812/20/08 99
syllablesyllable toto followfollow thethe rulerule ofof consonantsconsonants soundssounds withwith short,short,
medium,medium, low,low, highhigh tonetone..
![Page 10: Lao Text-to-speech synthesis HMM- based Method](https://reader035.vdocuments.site/reader035/viewer/2022071602/613d576f736caf36b75c2fa2/html5/thumbnails/10.jpg)
Writing system Cont.
1. Special symbol
2.2. Lao digitsLao digits
12/20/0812/20/08 1010
![Page 11: Lao Text-to-speech synthesis HMM- based Method](https://reader035.vdocuments.site/reader035/viewer/2022071602/613d576f736caf36b75c2fa2/html5/thumbnails/11.jpg)
Phoneme design for Lao TTSPhoneme design for Lao TTS
12/20/0812/20/08 1111
See All
![Page 12: Lao Text-to-speech synthesis HMM- based Method](https://reader035.vdocuments.site/reader035/viewer/2022071602/613d576f736caf36b75c2fa2/html5/thumbnails/12.jpg)
SoundSound systems
1. Consonants sound
� Low consonants: kh,ng,s,y,th,n,ph,f,m,l,w,h (ຄຄຄຄ ງງງງ ຊຊຊຊ ຍຍຍຍ ທທທທ ນນນນ ພພພພ ຟຟຟຟ ມມມມ ລລລລ ວວວວ ຮຮຮຮ)
� Mid consonants: k,c,d,t,b,p,j,z (ກກກກ ຈຈຈຈ ດດດດ ຕຕຕຕ ບບບບ ປປປປ ຢຢຢຢ ອອອອ)
� High consonants: kh,s,th,ph,f,h (ຂຂຂຂ ສສສສ ຖຖຖຖ ຜຜຜຜ ຝຝຝຝ ຫຫຫຫ)
12/20/0812/20/08 1212
2. Tones sound
![Page 13: Lao Text-to-speech synthesis HMM- based Method](https://reader035.vdocuments.site/reader035/viewer/2022071602/613d576f736caf36b75c2fa2/html5/thumbnails/13.jpg)
SoundSound systems
3. vowels sound
a, i, v, u, q, e, x, o, @, ua, ia, va
aa, ii, vv, uu, qq, ee, xx, oo, @@, uua, iia, vva
(ອາອາອາອາ ອີອີອີອີ ອືອືອືອື ອູອູອູອູ ເອີເອີເອີເອີ ເອເອເອເອ ແອແອແອແອ ໂອໂອໂອໂອ ອໍອໍອໍອໍ ອົວອົວອົວອົວ ເອຍເອຍເອຍເອຍ ເອືອເອືອເອືອເອືອ) Long
12/20/0812/20/08 1313
(ອະອະອະອະ ອິອິອິອິ ອຶອຶອຶອຶ ອຸອຸອຸອຸ ເອິເອິເອິເອິ ເອະເອະເອະເອະ ແອະແອະແອະແອະ ໂອະໂອະໂອະໂອະ ເອາະເອາະເອາະເອາະ ອົວະອົວະອົວະອົວະ ເອັຍເອັຍເອັຍເອັຍ ເອຶອເອຶອເອຶອເອຶອ) Short
![Page 14: Lao Text-to-speech synthesis HMM- based Method](https://reader035.vdocuments.site/reader035/viewer/2022071602/613d576f736caf36b75c2fa2/html5/thumbnails/14.jpg)
SoundSound systems
12/20/0812/20/08
![Page 15: Lao Text-to-speech synthesis HMM- based Method](https://reader035.vdocuments.site/reader035/viewer/2022071602/613d576f736caf36b75c2fa2/html5/thumbnails/15.jpg)
final sound or tone final sound or tone mark mark ��
final sound of word final sound of word are are
unstoped:thunstoped:the word that e word that ends with ends with
nasal nasal sound:sound:
m,n,ng (m,n,ng (ມມ,,ນນ,,ງງ)) or or
unstoped unstoped vowels vowels sound. sound.
final sound of word final sound of word is stoped is stoped
::k,ng,d,b (k,ng,d,b (ກກ,,ງງ,,ດດ,,ບບ),), and and
vowels length vowels length is sorth is sorth
a,i,v,u,q,e,x,o,a,i,v,u,q,e,x,o,@,ua,ia,va @,ua,ia,va ((ະະ ິິ ຶຶຸຸ ເອິເອິ ເອະເອະ
ແອະແອະ ໂອະໂອະ ເອາະເອາະ ອົວະອົວະ ເອັຍເອັຍເອຶອເອຶອ)) (the (the word ends word ends with stoped with stoped
consonant or consonant or vowel sound)vowel sound)
final sound of word is final sound of word is stoped:stoped:k,ng,d,b (k,ng,d,b (ກກ,,ງງ,,ດດ,,ບບ),), and vowels and vowels
length is long: length is long: aa,ii,vv,uu,qq,ee,xx,oaa,ii,vv,uu,qq,ee,xx,oo,@@,uua,iia,vva (o,@@,uua,iia,vva (າາີີ ືື ູູ ເອີເອີ ເອເອ ແອແອ ໂອໂອ ອໍອໍ ອົວອົວ ເອຍເອຍເອືອເອືອ)) (the word ends (the word ends
with stoped with stoped consonant there are consonant there are
no long,stoped no long,stoped vowels)vowels)
any word with the any word with the first tone first tone
markmark (x(x່່))
any word with the second any word with the second tone marktone mark (x(x້້))
the first consonants of the first consonants of word word ��
12/20/0812/20/08
high consonants: high consonants: kh,s,th,ph,f,h (kh,s,th,ph,f,h (ຂຂ ສສ ຖຖ ຜຜ ຝຝ ຫຫ))
rising (4)rising (4) high (3)high (3) low falling (0)low falling (0) mid (1)mid (1) high falling (2)high falling (2)
mid consonants: mid consonants: k,c,d,t,b,p,j,z (k,c,d,t,b,p,j,z (ກກ ຈຈ ດດ ຕຕ ບບ ປປ ຢຢ ອອ))
rising (4)rising (4) high (3)high (3) low falling (0)low falling (0) mid (1)mid (1) high falling (2)high falling (2)
low consonants: low consonants: kh,ng,s,y,th,n,ph,kh,ng,s,y,th,n,ph,f,m,l,w,h (f,m,l,w,h (ຄຄ ງງ ຊຊ ຍຍ ທທ ນນ ພພ ຟຟ ມມ ລລ ວວ ຮຮ))
high (3)high (3) mid (1)mid (1) high falling (2)high falling (2) mid (1)mid (1) high falling (2)high falling (2)
![Page 16: Lao Text-to-speech synthesis HMM- based Method](https://reader035.vdocuments.site/reader035/viewer/2022071602/613d576f736caf36b75c2fa2/html5/thumbnails/16.jpg)
Lao Syllable structureLao Syllable structure
Lao syllable was designed as “Lao syllable was designed as “CV, CVC, CVV, CV, CVC, CVV,
and CVVCand CVVC ””
-- C: Main or nuclear Consonants C: Main or nuclear Consonants
-- V: vowelsV: vowels
12/20/0812/20/08 1616
-- C: Consonantal or final consonantsC: Consonantal or final consonants
![Page 17: Lao Text-to-speech synthesis HMM- based Method](https://reader035.vdocuments.site/reader035/viewer/2022071602/613d576f736caf36b75c2fa2/html5/thumbnails/17.jpg)
Lao Syllable breakingLao Syllable breaking
1212//2020//0808 1717
![Page 18: Lao Text-to-speech synthesis HMM- based Method](https://reader035.vdocuments.site/reader035/viewer/2022071602/613d576f736caf36b75c2fa2/html5/thumbnails/18.jpg)
Lao TTS developmentLao TTS development
Have 2 main modules: Natural Language Processing or NLPModule and Digital signal processing or DSP modules
12/20/0812/20/08 1818
Natural Language Processing (NLP)
Digital SignalProcessing (DSP)
PhoneTranscription
Prosody
Text Speech
![Page 19: Lao Text-to-speech synthesis HMM- based Method](https://reader035.vdocuments.site/reader035/viewer/2022071602/613d576f736caf36b75c2fa2/html5/thumbnails/19.jpg)
Natural Language Processing
� Text analysis
� Implemented using Lao text corpus (5 MB)
� Sentences end marker by space and sentence long
(11,159 sentences)
1212//2020//0808
(11,159 sentences)
� Syllable breaking based on Lao syllabification techniques
� Grapheme-to-phoneme (G2P) was implemented using Finite state Machines (FSMs) Toolkit
![Page 20: Lao Text-to-speech synthesis HMM- based Method](https://reader035.vdocuments.site/reader035/viewer/2022071602/613d576f736caf36b75c2fa2/html5/thumbnails/20.jpg)
Natural Language Processing
� Speech corpus
� 1,619 sentences were selected for recording
� Including 60 phoneme and 5 tones
� By female speaker
� Time used: 15 hours
1212//2020//0808
� Time used: 15 hours
� Sentence cutting by Manual
� Prosody generation
� Using HTS toolkit to generate speech parameterMel-Cestrum (MCEP), duration and Log fundamental frequency (Log F0) were extracted from each utterance in the speech corpus
![Page 21: Lao Text-to-speech synthesis HMM- based Method](https://reader035.vdocuments.site/reader035/viewer/2022071602/613d576f736caf36b75c2fa2/html5/thumbnails/21.jpg)
Digital Signal Processing (DSP)
HMM-Based for speech synthesizer and now under
Labeling files preparation and being training
12/20/0812/20/08
![Page 22: Lao Text-to-speech synthesis HMM- based Method](https://reader035.vdocuments.site/reader035/viewer/2022071602/613d576f736caf36b75c2fa2/html5/thumbnails/22.jpg)
HTS Labeling files preparation
� mono.dic : list all lao syllables with phones and tones in speech database
� word.mlf : list all lao syllable with sentence marker in speech database
12/20/0812/20/08
� mono.unit: list all phones with tone in speech database
� mono.list : list of all phonemes in speech database.
� questions_qstLao001.hed : list of all context and properties format for tree-based context clustering.
![Page 23: Lao Text-to-speech synthesis HMM- based Method](https://reader035.vdocuments.site/reader035/viewer/2022071602/613d576f736caf36b75c2fa2/html5/thumbnails/23.jpg)
mono.dic word.mlf
1212//2020//0808
![Page 24: Lao Text-to-speech synthesis HMM- based Method](https://reader035.vdocuments.site/reader035/viewer/2022071602/613d576f736caf36b75c2fa2/html5/thumbnails/24.jpg)
questions_qstLao001.hed
1212//2020//0808
![Page 25: Lao Text-to-speech synthesis HMM- based Method](https://reader035.vdocuments.site/reader035/viewer/2022071602/613d576f736caf36b75c2fa2/html5/thumbnails/25.jpg)
Thank you for your attentionThank you for your attention
12/20/0812/20/08 2525
Khop cai lai lai !Khop cai lai lai !