语音信号处理 speech signal processing 教材:数字语音处理 ...
Post on 16-Mar-2016
200 Views
Preview:
DESCRIPTION
TRANSCRIPT
语音信号处理
Speech Signal Processing 教材:数字语音处理
华中科技大学出版社
参考书:语音信号数字处理
电子工业出版社
From prehistory to the new media of the future, speech has been and will be a primary form of communication between humans.
Speech will become the next major trend in the personal computer market in the near future.
学习的目标:
保障现代通信领域中人与人之间、人与机器之间的信息交换的顺畅和自然。
Representation of speech signals
waveform parametric
preamplifier
anti-aliasing filter
A/D conversion
语音信号产生的数字模型Speech Production Model
Discrete
System
SpeechExcitation
根据人类语音的发音的生理过程和语音信号的声学特征来建立。
☆短时时域处理技术☆短时傅立叶分析☆语音信号的线性预测 Linear Predication
☆语音信号的同态滤波和倒谱分析☆矢量量化技术 Vector Quantification
☆隐含马尔可夫模型 HMM(Hide Markov Model)
语音信号的处理方Methods语音信号是时变信号,只是在一个短时短内可近似看作是平稳信号。
语音信号的应用 Application 语音编码技术 Speech Coding 语音合成技术 Speech Synthesis 语音识别技术 Speech Recognition 语音增强技术 Speech Enhancement声源 定位技术 Speaker Tracking/ Sound Source Localization
语音信号处理涉及语言学、声学、认知科学、生理学、心理学和数理统计
等多学科知识。用数字化的方法进行语
音的传送、存储、合成、识别和增强等
技术是整个数字化通信网中最重要最基
本的组成部分。
Speech Coding
Digital Channel
The goal is reduce the information rate, measured in bits per second, while maintaining the quality of the original speech waveform.
PCM:64kb/s waveform coder
LPC-10:2.4kb/s parametric coder
GSM:13kb/s hybrid coder
欢迎使用微软中国研究院中文语音合成系统
采用 TTS技术合成语音http://research. Microsoft.com/speech/tts
Speech Synthesis
金山词霸 /电子词典
从音乐中提取有用信号
Speech Enhancement
the goal is to improve the quality of degraded speech.
Speech Recognition
IBM语音识别软件
SONY推出的 QRIO——家庭娱乐机器人
"白雪公主 "
Furby
Sound card enables the computer to Record sound from a microphone Output sound through a speaker Manipulate sound stored on a disk
Sound card contains A/D and D/A converters Sampling rate generator
Most sound cards use DMA to read and write audio data
声卡的功能
Two types of audio file formats: Raw format, no header
raw Self-describing format, with a file header
WavMidiAiff, au, vocMpegRm, ra, ram
常见的声音文件
Audio file parameters Sampling rate Bits per sample Channels
可在 MATLAB中用 wavread函数将*.wav的文件打开,将所有数据读入一个数组中,数组中每个数的范围在 [-1,1]之间。
例如: [y,fs,Nbits]=wavread(‘c:\tts_30250.wav’);
运行结果: fs=16kHz
Nbits=16
y=[…..,0.0002,-0.0009,0.0081,…..]
常见的语音软件
Cool Edit Pro
Adobe Audition
Cool Edit Pro 2.0.lnk
THANKS
top related