ways to generate computer speech record a human speaking every sentence hal will ever speak (not...

6

Ways to generate computer speech • Record a human speaking every sentence HAL will ever speak (not likely) • Make a mathematical model of the human vocal tract (synthesis) • Record a human speaking a lot of sentences, and come up with some way of making new sentences out of the recorded ones (concatenation)

Upload: kelley-holt

Post on 18-Jan-2016

218 views

Category:

Documents

0 download

Report

Download

Tags:

Embed Size (px):

TRANSCRIPT

Page 1: Ways to generate computer speech Record a human speaking every sentence HAL will ever speak (not likely) Make a mathematical model of the human vocal

Ways to generate computer speech

• Record a human speaking every sentence HAL will ever speak (not likely)

• Make a mathematical model of the human vocal tract (synthesis)

• Record a human speaking a lot of sentences, and come up with some way of making new sentences out of the recorded ones (concatenation)

Page 2: Ways to generate computer speech Record a human speaking every sentence HAL will ever speak (not likely) Make a mathematical model of the human vocal

What goes into synthesizing speech?

• Have some idea of what human speech actually looks/sounds like– Modeling the shape of a speaker’s mouth– Fricative noises and noises from stops– Pitch changes

• Produce sounds that resemble speech sounds

Page 3: Ways to generate computer speech Record a human speaking every sentence HAL will ever speak (not likely) Make a mathematical model of the human vocal

Synthesis: Putting it all together

• Shape of mouth: 1: 2: 3: all 3:

• Fricative and burst noises:• Shape of mouth and fricative noises:• Shape of mouth, fricative noises, & pitch:

Page 4: Ways to generate computer speech Record a human speaking every sentence HAL will ever speak (not likely) Make a mathematical model of the human vocal

Speech synthesis

• (1980): The Speak & Spell toy used a synthesis process called Linear Predictive Coding (LPC).

• Basically, LPC is a way for a computer to extract all of the different parts of speech from a speech signal, and re-create them using a mathematical model of the vocal tract

• Here’s a better example of LPC (1982):

• LPC is used today for GSM phone systems

Page 5: Ways to generate computer speech Record a human speaking every sentence HAL will ever speak (not likely) Make a mathematical model of the human vocal

Text-to-Speech (TTS) systems• Concatenative synthesis

– Record natural speech– Chop speech up into units– Recombine units according to the phonetic

transcription to be pronounced

• Steps for a TTS system:– Start w/ written text– Convert text to phonetic characters– Find segments of speech in database– Calculate intonation of sentence

Page 6: Ways to generate computer speech Record a human speaking every sentence HAL will ever speak (not likely) Make a mathematical model of the human vocal

Text-to-Speech (TTS) systems

Examples of text from The North Wind and the Sun (Aesop), circa 2005:

• Mike (AT&T)

• Crystal (AT&T)

• British English (Rhetorical Systems)

• Scottish English (Rhetorical Systems)

Optimality Theory and Human Sentence Processing: Towards a … · 2011-04-09 · Optimality Theory and Human Sentence Processing: Towards a Cross-Modular Analysis of Coordination

Structure of Human Speech Chris Darwin Vocal Tract

Comparative Anatomy of the Baboon and Human Vocal Tracts

ULTIMATE VOCAL )ULTIMATE VOCAL ) ULTIMATE VOCAL )

Matching human vocal imitations to birdsong: An ... · Matching human vocal imitations to birdsong: An exploratory analysis Kendra Oudyk1,2, Yun-Han Wu3, Vincent Lostanlen3,4, Justin

Sentence Structure Sentence Types. Sentence Structure Sentence Types

A Study of Optimality Theory and the Human Sentence

Acoustical Measurement of the Human Vocal Tract: Quantifying Speech & Throat-Singing

ME - isbem.org · human vocal tract properties. INTRODUCTION The human voice is produced in the larynx and passes through the vocal tract of specific acoustic properties. These properties

A Dynamic Model of the Human Vocal Tract - Aucklandhomepages.engineering.auckland.ac.nz/~jgre007/Powerpoint and... · A Dynamic Model of the Human Vocal Tract ... we reproduced the

Sentence & Sentence Fragments

Phonetics & Phonology... 6 Articulatory Phonetics Study of how speech sounds are produced by human vocal apparatus Anatomy of vocal organs Air stream Mechanism Voicing 9 Pulmonic Sounds

Treatment of vocal fold scarring with autologous bone ... · Treatment of vocal fold scarring with autologous bone marrow-derived human mesenchymal stromal cells—first phase I/II

Cognitive control of orofacial motor and vocal responses ... · vocal actions. Ventrolateral area 44 (a key component of the Broca’s language production region in the human brain)

The Marvels of the Human Voice: Poem-Melody-Vocal Performance

THE SIMPLE SENTENCE Key Concepts: Phrase, Clause, Sentence, Simple Sentence, Complex Sentence, Compound Sentence

The Vocal Joystick: A Voice-Based Human-Computer Interface for … · 2005-07-08 · The Vocal Joystick: A Voice-Based Human-Computer Interface for Individuals with Motor Impairments

Human Non-linguistic Vocal Repertoire: Call Types and Their … · 2018-02-16 · ORIGINAL PAPER Human Non-linguistic Vocal Repertoire: Call Types and Their Meaning Andrey Anikin1

Vocal 1: Vocal 2

Human Vocal Sentiment Analysis - arxiv.org · Human Vocal Sentiment Analysis Andrew Huang , Martin (Puwei) Bao New York University Shanghai {a.huang, pb1713}@nyu.edu Abstract In this

coquimatlan.gob.mxcoquimatlan.gob.mx/transparencias/ayuntamiento/Decima... · 2020. 1. 30. · 7MO Vocal 8V0 VOCAL 9NO VOCAL (OMO VOCAL suplente 11V0 VOCAL suplente CONTRALOR MUNICIPAL

3D multiscale imaging of human vocal folds using

Abbreviated title: Summary sentence: Key words Grant support · 2009-11-18 · Abbreviated title: microRNA regulation in the human endometrium Summary sentence: Global microRNA and

Baglama & Vocal - Baglama & Vocal

The Interplay of Prosody and Syntax in Sentence Processing ... · investigate auditory sentence comprehension, although this is by far the most common way of human communication

A Minimalist Theory of Human Sentence Processing

JUNTA DIRECTIVA – PERIODO 1965 - 1966 CARGO …...Vocal Moscardi Pasquale Vocal Orioli Battista Vocal Rasetta Antonio Vocal Rispetti Giuseppe Vocal Salterini Vittorio Vocal Sano

‘KNO FEBRUARY2018 - wkno.org · Movies “Human Vocal Production” 11:00 Great British Baking Show Season 3 “Pastry” SATURDAY, 3 7:00 Classic Gospel “Gaither Vocal Band:

HUMAN AND MACHINE RECOGNITION OF THE VOCAL … · HUMAN AND MACHINE RECOGNITION OF THE VOCAL CHARACTERISTICS OF SUICIDE BY Abhraneel Sinha Thesis Submitted to the Faculty of the Graduate

Constraints and flexibility during vocal development ... · (a) Human Vocal apparatus Muscles Nervous system Social interactions Vocal apparatus Muscles Nervous system Social interactions

Characterisation of the Elasticity of the Human Vocal Fold using Electromechanical Measurement Techniques

In Vivo Optical Coherence Tomography of the Human Larynx ... · lial thickness of laryngeal subsites was calculated: true vocal cord (129 m), false vocal cords (124 m), aryepiglottic

L 17 The Human Voice. The Vocal Tract epiglottis

Simple Sentence Compound Sentence Complex Sentence ...stevenwilkie.weebly.com/uploads/9/8/8/0/9880497/sentences.pdf · •Simple Sentence •Compound Sentence •Complex Sentence

VOCAL SOLOS VOCAL SOLOS ACCOMPANIED VOCAL DUETS ... · 1 secular hebrew – 12/16/2016 vocal solos vocal solos accompanied vocal duets vocal duets accompanied vocal trios accompanied