introduction to speech recognition software scott a. dougherty, m.ed. cricket rizzo, ms, otr/l, atp...
TRANSCRIPT
![Page 1: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009](https://reader034.vdocuments.site/reader034/viewer/2022051820/56649d9e5503460f94a891e1/html5/thumbnails/1.jpg)
Introduction to Speech Recognition Software
Scott A. Dougherty, M.Ed.Cricket Rizzo, MS, OTR/L, ATP
2009 AT-OT-PT Summer InstituteTuesday, June 16, 2009
1:30 – 4:30 pm
![Page 2: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009](https://reader034.vdocuments.site/reader034/viewer/2022051820/56649d9e5503460f94a891e1/html5/thumbnails/2.jpg)
About the Presentation
• Links are programmed into the globes at the lower right corner of the slide
• Links are noted in the slide’s notes• The presentation is available
electronically for download• Information on pricing and features was
current as of May 2009 (i.e., use at your own peril)
![Page 3: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009](https://reader034.vdocuments.site/reader034/viewer/2022051820/56649d9e5503460f94a891e1/html5/thumbnails/3.jpg)
About the Presenters
• Cricket Rizzo, MS, OTR/L, ATP• Scott A. Dougherty, M.Ed.
• Questions are welcome at any time• We like to pick on the people in the
back row• We plan to enjoy these three hours –
we hope you do, too
![Page 4: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009](https://reader034.vdocuments.site/reader034/viewer/2022051820/56649d9e5503460f94a891e1/html5/thumbnails/4.jpg)
Session Description
• Explore the use of speech recognition software for writing
• Explore the use of speech recognition software for computer access
• Discuss common software and hardware
• Demonstrate microphone initialization, dictation, and computer control
• Share typical scenarios and obstacles
![Page 5: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009](https://reader034.vdocuments.site/reader034/viewer/2022051820/56649d9e5503460f94a891e1/html5/thumbnails/5.jpg)
DefinitionsAT SERVICES“Any services that directly assist in the selection, acquisition, or use of an assistive technology device.”
AT DEVICE“Any item, piece of equipment, or product system, whether acquired commercially off the shelf, modified or customized, that is used to increase, maintain, or improve the functional capabilities of individuals with disabilities.”
(PL 100-407, Section 3, 1988)
![Page 6: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009](https://reader034.vdocuments.site/reader034/viewer/2022051820/56649d9e5503460f94a891e1/html5/thumbnails/6.jpg)
Assistive Technology in Legislation• IDEIA 2004 (34 CFR Parts 300 and 301 )• Early Intervention Act (PL-99-336)• Technology-Related Assistance for Individuals with
Disabilities Act of 1988 or “The Tech Act” (PL-100-407)
• Americans with Disabilities Act (PL-101-336)• Entitlement Legislation:
– Rehabilitation Act of 1973 (PL-93-112, as amended)– Rehabilitation Act Amendments of 1998
• Section 508 compliance
![Page 7: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009](https://reader034.vdocuments.site/reader034/viewer/2022051820/56649d9e5503460f94a891e1/html5/thumbnails/7.jpg)
Why explore speech recognition in schools?• My student has difficulty accessing the
computer.
• My student has difficulty using a pencil and/or keyboard.
• My student has anxiety about writing with a pencil and/or keyboard.
• I use it in my law/medical practice. It works beautifully and I believe my Kindergartener needs it, too.
![Page 8: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009](https://reader034.vdocuments.site/reader034/viewer/2022051820/56649d9e5503460f94a891e1/html5/thumbnails/8.jpg)
How does it work?
• Speech converted from analog to digital
• Signal broken into phonemes
• Phoneme pattern compared to dictionary
• Matching term is displayed (text) or used (command)
HowStuffWorks, Inc.
![Page 9: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009](https://reader034.vdocuments.site/reader034/viewer/2022051820/56649d9e5503460f94a891e1/html5/thumbnails/9.jpg)
Microsoft Speech Recognition
• Included as part of Microsoft Windows XP and Vista
• Start > Control Panel > Speech
• Utilizes the Dragon NS engine
![Page 10: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009](https://reader034.vdocuments.site/reader034/viewer/2022051820/56649d9e5503460f94a891e1/html5/thumbnails/10.jpg)
Dragon NaturallySpeaking Professional, Medical, & Legal
• Accurate recognition• Create and edit documents• Email, IM, and web by voice• Voice shortcuts• Audio recorder support• Network compatible for multiple users• Robust commands• Third-party correction (L&M)• Supportable on Electronic Medical
Records systems (M)
![Page 11: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009](https://reader034.vdocuments.site/reader034/viewer/2022051820/56649d9e5503460f94a891e1/html5/thumbnails/11.jpg)
Dragon NaturallySpeaking Preferred• Accurate recognition• Create and edit
documents• Email, IM, and web by
voice• Voice shortcuts• Audio recorder support
![Page 12: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009](https://reader034.vdocuments.site/reader034/viewer/2022051820/56649d9e5503460f94a891e1/html5/thumbnails/12.jpg)
Dragon NaturallySpeaking Standard• Accurate recognition• Create and edit
documents• Email, IM, and web by
voice
![Page 13: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009](https://reader034.vdocuments.site/reader034/viewer/2022051820/56649d9e5503460f94a891e1/html5/thumbnails/13.jpg)
IBM ViaVoice
• Editions– Mac OS X– Pro USB– Advanced– Standard– Personal
• May be a good option for “legacy” computers
![Page 14: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009](https://reader034.vdocuments.site/reader034/viewer/2022051820/56649d9e5503460f94a891e1/html5/thumbnails/14.jpg)
SpeakQ
• Continuous recognition• Discrete recognition• Requires installation of WordQ
version 2• Simultaneous use of keyboard
is encouraged
![Page 15: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009](https://reader034.vdocuments.site/reader034/viewer/2022051820/56649d9e5503460f94a891e1/html5/thumbnails/15.jpg)
MacSpeech Dictate
• Replaced iListen software• Support for audio recorders is
pending• Dictation and command
capability• Read and edit documents that
were not initially created with MacSpeech Dictate
![Page 16: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009](https://reader034.vdocuments.site/reader034/viewer/2022051820/56649d9e5503460f94a891e1/html5/thumbnails/16.jpg)
Hardware Considerations
• Analog microphone• Digital microphone• Desktop array• Analog voice recorder• Digital voice recorder• Jouse• Head pointer
– Tracker Pro– HeadMouse
![Page 17: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009](https://reader034.vdocuments.site/reader034/viewer/2022051820/56649d9e5503460f94a891e1/html5/thumbnails/17.jpg)
Hardware – Analog Microphone
• 3.5 mm (1/8”) connection
• One connection for headphone, second connection for microphone
• Connects to sound card directly or via noise-cancelling adapter
• Boom maintains distance from mouth
![Page 18: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009](https://reader034.vdocuments.site/reader034/viewer/2022051820/56649d9e5503460f94a891e1/html5/thumbnails/18.jpg)
Hardware – Digital Microphone
• Male USB connection
• One connection for headphone and microphone
• Connects via USB port
• Boom maintains distance from mouth
![Page 19: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009](https://reader034.vdocuments.site/reader034/viewer/2022051820/56649d9e5503460f94a891e1/html5/thumbnails/19.jpg)
Hardware – Desktop Array
• May be placed on monitor, desk, or shelf
• Ideal for use at a dedicated station
• Eliminates issues with head microphones
• Adjustable spectrum of reception
![Page 20: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009](https://reader034.vdocuments.site/reader034/viewer/2022051820/56649d9e5503460f94a891e1/html5/thumbnails/20.jpg)
Hardware – Analog Voice Recorder
• Tape-based• Incapable of allowing revision
of recordings• Generally not compatible with
speech recognition software – verify before you buy
![Page 21: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009](https://reader034.vdocuments.site/reader034/viewer/2022051820/56649d9e5503460f94a891e1/html5/thumbnails/21.jpg)
Hardware – Digital Voice Recorder
• Chip-based• Allows for inserts
and deletions• Compatibility with
speech recognition varies
![Page 22: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009](https://reader034.vdocuments.site/reader034/viewer/2022051820/56649d9e5503460f94a891e1/html5/thumbnails/22.jpg)
Hardware - Jouse
• Alternative means of input
• Useful for mouse control and keyboarding
• Another hands-free interface for the computer
![Page 23: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009](https://reader034.vdocuments.site/reader034/viewer/2022051820/56649d9e5503460f94a891e1/html5/thumbnails/23.jpg)
Hardware – Head Pointer
• Hands-free option• Keyboard access is
necessary or desired for use with some speech recognition software
![Page 24: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009](https://reader034.vdocuments.site/reader034/viewer/2022051820/56649d9e5503460f94a891e1/html5/thumbnails/24.jpg)
Hardware – Head Pointer
• Tracker Pro and HeadMouse– uses reflective dot on forehead to reflect
input (head movement) back to camera mounted on computer; camera translates head movement into mouse movement
– need another means to perform clicks, double clicks, right clicks, and drag
• Software i.e. Magic Cursor, Dragger 32– Can dwell or click with a switch
![Page 25: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009](https://reader034.vdocuments.site/reader034/viewer/2022051820/56649d9e5503460f94a891e1/html5/thumbnails/25.jpg)
Hardware – Head Pointer
• HeadMouse Extreme
• Tracker Pro
![Page 26: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009](https://reader034.vdocuments.site/reader034/viewer/2022051820/56649d9e5503460f94a891e1/html5/thumbnails/26.jpg)
Dictation…What’s the process?
• Initialization– Audio detection– Speech recognition
• Voice profile creation• Dictation
– Continuous– Discrete
• EditingBrian Basset and Microsoft
Corporation
![Page 27: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009](https://reader034.vdocuments.site/reader034/viewer/2022051820/56649d9e5503460f94a891e1/html5/thumbnails/27.jpg)
Initialization
• Provides guidance on microphone placement
• Tests the microphone connection
• Analyzes ambient noise levels
![Page 28: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009](https://reader034.vdocuments.site/reader034/viewer/2022051820/56649d9e5503460f94a891e1/html5/thumbnails/28.jpg)
Voice Profile Creation
• Provides sample text for dictation
• Builds a profile that matches speech sounds to expected letter patterns
• May be skipped in some software
• Available for subsequent accuracy trainings
![Page 29: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009](https://reader034.vdocuments.site/reader034/viewer/2022051820/56649d9e5503460f94a891e1/html5/thumbnails/29.jpg)
Dictation
• Continuous dictation – Speak and phrases,
sentences, or words
• Discreet dictation– Input text with
microphone
– Select dictation with keyboard
• Punctuation may be inserted manually or automatically
![Page 30: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009](https://reader034.vdocuments.site/reader034/viewer/2022051820/56649d9e5503460f94a891e1/html5/thumbnails/30.jpg)
Editing• Scratch That• Select/Correct/Delete
That• Train that• Bold/Italicize/Underline
that
• Click [menu item]• Move [direction]
[number] [character/word]
• Move mouse [direction]
![Page 31: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009](https://reader034.vdocuments.site/reader034/viewer/2022051820/56649d9e5503460f94a891e1/html5/thumbnails/31.jpg)
Training for Computer Access• Complete tutorials provided with software• Dictate from book or magazine for practice including
punctuations i.e. , . ? “ and formatting• Dictate from “off the top of your head”
– Usually takes more time when dictating from thoughts rather than hard copy
– Watch for pauses, umms, uhhs, mumbling
• Dictation• Formatting• Use in Applications i.e. Word, E-mail, Web Access• Timeframe varies with cognitive abilities and
computer expertise; typical user – 20 hours to cover basics
![Page 32: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009](https://reader034.vdocuments.site/reader034/viewer/2022051820/56649d9e5503460f94a891e1/html5/thumbnails/32.jpg)
Talking Points
• Initializing struggling readers
• Use as a note taking tool
• Editing errors – keyboard or the mouse?
• Multiple profiles for a single user
• Computer access strategies
• Dragon Naturally Speaking on a Mac?
• Use at school and home
![Page 33: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009](https://reader034.vdocuments.site/reader034/viewer/2022051820/56649d9e5503460f94a891e1/html5/thumbnails/33.jpg)
More Talking Points
• “I want all of the teachers to dictate their lectures”
• Dictating “under the weather”
• Matching microphones to users and environments
• Test-taking
![Page 34: Introduction to Speech Recognition Software Scott A. Dougherty, M.Ed. Cricket Rizzo, MS, OTR/L, ATP 2009 AT-OT-PT Summer Institute Tuesday, June 16, 2009](https://reader034.vdocuments.site/reader034/viewer/2022051820/56649d9e5503460f94a891e1/html5/thumbnails/34.jpg)
Contact informationScott A. DoughertyIDEA Training and Consultation Coordinator, AT
Allegheny Intermediate Unit #3475 East Waterfront DriveHomestead, PA 15120-1144
Cricket RizzoMS, OTR/L, ATPOccupational Therapist
Westmoreland Intermediate Unit #7102 Equity Drive Greensburg, PA 15601
[email protected](724) 836-2460 , ext. 2193
These presentation materials can be downloaded at http://www.aiu3.net/Level3.aspx?id=3822