audio workgroup neuro-inspired speech recognition
TRANSCRIPT
![Page 1: Audio Workgroup Neuro-inspired Speech Recognition](https://reader035.vdocuments.site/reader035/viewer/2022081602/55149094550346f06e8b51ea/html5/thumbnails/1.jpg)
Audio WorkgroupAudio Workgroup
Neuro-inspired Speech RecognitionNeuro-inspired Speech Recognition
![Page 2: Audio Workgroup Neuro-inspired Speech Recognition](https://reader035.vdocuments.site/reader035/viewer/2022081602/55149094550346f06e8b51ea/html5/thumbnails/2.jpg)
Audio WorkgroupAudio Workgroup
Localization EffortLocalization Effort
Interaural Time Difference (ITD)
Estimated from time difference between spikes of two matching channels.
Interaural Intensity Difference (IID)
Difference of spike counts between two cochleae.
Azimuth: Combination of ITD and ILD
![Page 3: Audio Workgroup Neuro-inspired Speech Recognition](https://reader035.vdocuments.site/reader035/viewer/2022081602/55149094550346f06e8b51ea/html5/thumbnails/3.jpg)
Audio WorkgroupAudio Workgroup
Localization EffortLocalization Effort
![Page 4: Audio Workgroup Neuro-inspired Speech Recognition](https://reader035.vdocuments.site/reader035/viewer/2022081602/55149094550346f06e8b51ea/html5/thumbnails/4.jpg)
Audio WorkgroupAudio Workgroup
Relational Network (Simple)Relational Network (Simple)
X Y
Z
MM
X
M
Y
M
Z
m
Patches of neuronsEach measureone quantityBidirectionalrelations for feedback/feedforward
![Page 5: Audio Workgroup Neuro-inspired Speech Recognition](https://reader035.vdocuments.site/reader035/viewer/2022081602/55149094550346f06e8b51ea/html5/thumbnails/5.jpg)
Audio WorkgroupAudio Workgroup
Relational Network (example)Relational Network (example)
Input hereRelation specification
Relational feedback
RelationFeedback
![Page 6: Audio Workgroup Neuro-inspired Speech Recognition](https://reader035.vdocuments.site/reader035/viewer/2022081602/55149094550346f06e8b51ea/html5/thumbnails/6.jpg)
Audio WorkgroupAudio Workgroup
ASR Relational NetworkASR Relational Network
Cochlea
Delay
Phone Recognizer
Phone Recognizer
Word Recognizer
A patch of neurons(one of N output)
We don’t know how to represent time
![Page 7: Audio Workgroup Neuro-inspired Speech Recognition](https://reader035.vdocuments.site/reader035/viewer/2022081602/55149094550346f06e8b51ea/html5/thumbnails/7.jpg)
Audio WorkgroupAudio Workgroup
ASR AdvantagesASR Advantages
Not an HMM
Top-Down, Bottom-Up Hypothesis
Hallucinate
![Page 8: Audio Workgroup Neuro-inspired Speech Recognition](https://reader035.vdocuments.site/reader035/viewer/2022081602/55149094550346f06e8b51ea/html5/thumbnails/8.jpg)
Audio WorkgroupAudio Workgroup
Silicon CochleaSilicon Cochlea
Ganglion cells
Basilar membrane
highfrequency
lowfrequency
Inner hair cells
(van Schaik, Liu, 2004)
BASILAR MEMBRANE
INNER HAIR CELLS
GANGLION CELLS
![Page 9: Audio Workgroup Neuro-inspired Speech Recognition](https://reader035.vdocuments.site/reader035/viewer/2022081602/55149094550346f06e8b51ea/html5/thumbnails/9.jpg)
Audio WorkgroupAudio Workgroup
Silicon CochleaSilicon Cochlea
Tone raster plots
Vowel Rate Profiles
![Page 10: Audio Workgroup Neuro-inspired Speech Recognition](https://reader035.vdocuments.site/reader035/viewer/2022081602/55149094550346f06e8b51ea/html5/thumbnails/10.jpg)
Audio WorkgroupAudio Workgroup
Learning ChipLearning Chip
Architecture
Tone Rasters?
Vowel Rasters
Learning Algorithm
Alternative LearningStatistics
LeastSquares
![Page 11: Audio Workgroup Neuro-inspired Speech Recognition](https://reader035.vdocuments.site/reader035/viewer/2022081602/55149094550346f06e8b51ea/html5/thumbnails/11.jpg)
Audio WorkgroupAudio Workgroup
LSM RecognizerLSM Recognizer
![Page 12: Audio Workgroup Neuro-inspired Speech Recognition](https://reader035.vdocuments.site/reader035/viewer/2022081602/55149094550346f06e8b51ea/html5/thumbnails/12.jpg)
Audio WorkgroupAudio Workgroup
Infrastruture DifficultiesInfrastruture Difficulties
RemapperReplace with Matlab
Power ?
Sharing chips?
PC replacement
![Page 13: Audio Workgroup Neuro-inspired Speech Recognition](https://reader035.vdocuments.site/reader035/viewer/2022081602/55149094550346f06e8b51ea/html5/thumbnails/13.jpg)
Audio WorkgroupAudio Workgroup
FPAA/MoteFPAA/Mote
![Page 14: Audio Workgroup Neuro-inspired Speech Recognition](https://reader035.vdocuments.site/reader035/viewer/2022081602/55149094550346f06e8b51ea/html5/thumbnails/14.jpg)
Audio WorkgroupAudio Workgroup
Word RecognizerWord Recognizer
Four example raster plot (silence, A_, A_ with relational, AI)
![Page 15: Audio Workgroup Neuro-inspired Speech Recognition](https://reader035.vdocuments.site/reader035/viewer/2022081602/55149094550346f06e8b51ea/html5/thumbnails/15.jpg)
Audio WorkgroupAudio Workgroup
QuickTime™ and aTIFF (Uncompressed) decompressor
are needed to see this picture.
Software SimulationSoftware Simulation
![Page 16: Audio Workgroup Neuro-inspired Speech Recognition](https://reader035.vdocuments.site/reader035/viewer/2022081602/55149094550346f06e8b51ea/html5/thumbnails/16.jpg)
Audio WorkgroupAudio Workgroup
Behind the CurtainBehind the Curtain
![Page 17: Audio Workgroup Neuro-inspired Speech Recognition](https://reader035.vdocuments.site/reader035/viewer/2022081602/55149094550346f06e8b51ea/html5/thumbnails/17.jpg)
Audio WorkgroupAudio Workgroup
Hardware OverviewHardware Overview
Cochlea
Cochlea
Remapper(in Matlab)
Learning
GiacomoGiacomo
PhonemeWord
skype
PCI-AER (for remapping)
PCI-AER (for remapping)
![Page 18: Audio Workgroup Neuro-inspired Speech Recognition](https://reader035.vdocuments.site/reader035/viewer/2022081602/55149094550346f06e8b51ea/html5/thumbnails/18.jpg)
Audio WorkgroupAudio Workgroup