spectral analysis, part ii

31
Spectral Analysis, part II March 6, 2014

Upload: tashya-stevenson

Post on 01-Jan-2016

60 views

Category:

Documents


9 download

DESCRIPTION

Spectral Analysis, part II. March 6, 2014. Friendly Reminders. For those who weren’t here last time: The production exercise #2 scores will be added to the mid-term scores. Korean stops exercise is due Tuesday. Any questions so far? - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Spectral Analysis, part II

Spectral Analysis, part II

March 6, 2014

Page 2: Spectral Analysis, part II

Friendly Reminders• For those who weren’t here last time:

• The production exercise #2 scores will be added to the mid-term scores.

• Korean stops exercise is due Tuesday.

• Any questions so far?

• I messed up the due date on the fourth course project report: it will be due on the 25th.

• There is a mystery spectrogram, and it is waiting for you.

• Today’s goal: finish figuring out where spectrograms come from.

• And then maybe talk a little bit about resonance, too.

Page 3: Spectral Analysis, part II

Fourier Analysis• Building up a complex wave from sinewave components is straightforward…

• Breaking down a complex wave into its spectral shape is a little more complicated.

• In our particular case, we will look at:

• Discrete Fourier Transform (DFT)

• Also: Fast Fourier Transform (FFT) is used often in speech analysis

• Basically a more efficient, less accurate method of DFT for computers.

Page 4: Spectral Analysis, part II

Spectral Slices• The first step in Fourier Analysis is to window the signal.

• I.e., break it all up into a series of smaller, analyzable chunks.

• This is important because the spectral qualities of the signal change over time.

a “window”

• Check out the typical window length in Praat.

Page 5: Spectral Analysis, part II

The Basic Idea• For the complex wave extracted from each window...

• Fourier Analysis determines the frequency and intensity of the sinewave components of that wave.

• Do this about 1000 times a second,

• turn the spectra on their sides,

• and you get a spectrogram.

Page 6: Spectral Analysis, part II

Possible Problems• What would happen if a waveform chunk was windowed like this?

• Remember, the goal is to determine the frequency and intensity of the sinewave components which make up that slice of the complex wave.

Page 7: Spectral Analysis, part II

The Usual Solution• The amplitude of the waveform at the edges of the window is normally reduced...

• by transforming the complex wave with a smoothing function before spectral analysis.

• Each function defines a particular window type.

• For example: the “Hanning” Window

Page 8: Spectral Analysis, part II

• There are lots of different window types...

• each with its own characteristic shape

Hamming Bartlett Gaussian

Hanning Welch Rectangular

Page 9: Spectral Analysis, part II

Window Type Ramifications• Play around with the different window types in Praat.

Page 10: Spectral Analysis, part II

Ideas• Once the waveform has been windowed, it can be boiled down into its component frequencies.

• Basic strategy:

• Determine whether the complex wave correlates with sine (and cosine!) waves of particular frequencies.

• Correlation measure: “dot product”

• = sum of the point-by-point products between waves.

• Interesting fact:

• Non-zero correlations only emerge between the complex wave and its harmonics!

• (This is Fourier’s great insight.)

Page 11: Spectral Analysis, part II

A Not-So-Complex Example• Let’s build up a complex wave from 8 samples of a 1 Hz sine wave and a 4 Hz cosine wave.

• Note: our sample rate is 8 Hz.

1 2 3 4 5 6 7 8

A 1 Hz 0 .707 1 .707 0 -.707 -1 -.707

B 4 Hz 1 -1 1 -1 1 -1 1 -1

C Sum: 1 -.293 2 -.293 1 -1.707 0 -1.707

• Check out a visualization.

Page 12: Spectral Analysis, part II

Correlations, part 1• Let’s check the correlation between that wave and the 1 Hz sinewave component.

1 2 3 4 5 6 7 8

C Sum: 1 -.293 2 -.293 1 -1.707 0 -1.707

A 1 Hz: 0 .707 1 .707 0 -.707 -1 -.707

C*A Dot: 0 -.207 2 -.207 0 1.207 0 1.207

• The sum of the products of each sample is 4.

• This also happens to be the dot product of the 1 Hz wave with itself.

• = its “power”

Page 13: Spectral Analysis, part II

Correlations, part 2• Let’s check the correlation between the complex wave and a 2 Hz sinewave (a non-component).

1 2 3 4 5 6 7 8

C Sum: 1 -.293 2 -.293 1 -1.707 0 -1.707

D 2 Hz: 0 1 0 -1 0 1 0 -1

C*D Dot: 0 -.293 0 .293 0 -1.707 0 1.707

• The sum of the products of each sample is 0.

• We now know that 2 Hz was not a component frequency of the complex wave.

Page 14: Spectral Analysis, part II

Correlations, part 3• Last but not least, let’s check the correlation between the complex wave and the 4 Hz cosine wave.

1 2 3 4 5 6 7 8

C Sum: 1 -.293 2 -.293 1 -1.707 0 -1.707

B 4 Hz 1 -1 1 -1 1 -1 1 -1

C*B Dot: 1 .293 2 .293 1 1.707 0 1.707

• The sum of the products of each sample is 8.

• Yes, 8 happens to be the dot product of the 4 Hz wave with itself.

• its “power”

Page 15: Spectral Analysis, part II

Mopping Up• Our component analysis gave us the following dot products:

• C*A = 4 (A = 1 Hz sinewave)

• C*D = 0 (D = 2 Hz sinewave)

• C*B = 8 (B = 4 Hz cosine wave)

• We have to “normalize” these products by dividing them by the power of the “reference” waves:

• power (A) = A*A = 4 C*A/A*A = 4/4 = 1

• power (D) = D*D = 4 C*D/D*D = 0/4 = 0

• power (B) = B*B = 8 C*B/B*B = 8/8 = 1

• These ratios are the amplitudes of the component waves.

Page 16: Spectral Analysis, part II

Let’s Try Another• Let’s construct another example: 1 Hz sinewave + a 4 Hz cosine wave with half the amplitude.

1 2 3 4 5 6 7 8

A 1 Hz 0 .707 1 .707 0 -.707 -1 -.707

.5*B 4 Hz .5 -.5 .5 -.5 .5 -.5 .5 -.5

E Sum: .5 .207 1.5 .207 .5 -1.207 -.5 -1.207

• Let’s check the 1 Hz wave first:

E Sum: .5 .207 1.5 .207 .5 -1.207 -.5 -1.207

A 1 Hz 0 .707 1 .707 0 -.707 -1 -.707

E*A Dot: 0 .146 1.5 .146 0 .854 .5 .854

• Sum = 4

Page 17: Spectral Analysis, part II

Yet More Dots• Another example: 1 Hz sinewave + a 4 Hz cosine wave with half the amplitude.

• Now let’s check the 4 Hz wave:

E Sum: .5 .207 1.5 .207 .5 -1.207 -.5 -1.207

B 4 Hz 1 -1 1 -1 1 -1 1 -1

E*B Dot: .5 -.207 1.5 -.207 .5 1.207 -.5 1.207

• The sum of these products is also 4.

• = half of the power of the 4 Hz cosine wave.

• The 4 Hz component has half the amplitude of the 4 Hz cosine reference wave.

• (we know the reference wave has amplitude 1)

Page 18: Spectral Analysis, part II

Mopping Up, Part 2• Our component analysis gave us the following dot products:

• E*A = 4 (A = 1 Hz sinewave)

• E*B = 4 (B = 4 Hz cosine wave)

• Let’s once again normalize these products by dividing them by the power of the “reference” waves:

• power (A) = A*A = 4 E*A/A*A = 4/4 = 1

• power (B) = B*B = 8 E*B/B*B = 4/8 = .5

• These ratios are the amplitudes of the component waves.

• The 1 Hz sinewave component has amplitude 1

• The 4 Hz cosine wave component has amplitude .5

Page 19: Spectral Analysis, part II

Footnote• Sinewaves and cosine waves are orthogonal to each other.

• The dot product of a sinewave and a cosine wave of the same frequency is 0.

1 2 3 4 5 6 7 8

A sin 0 .707 1 .707 0 -.707 -1 -.707

F cos 1 .707 0 -.707 -1 -.707 0 .707

A*F Dot: 0 .5 0 -.5 0 .5 0 -.5

• However, adding cosine and sine waves together simply shifts the phase of the complex wave.

• Check out different combos in Praat.

Page 20: Spectral Analysis, part II

Problem #1• For any given window, we don’t know what the phase

shift of each frequency component will be.

• Solution:

1. Calculate the amplitude of the sinewave

2. Calculate the amplitude of the cosine wave

3. Combine the resulting amplitudes with the pythagorean theorem:

At = Asin2 + Acos

2

• Take a look at the java applet online:

• http://www.phy.ntnu.edu/tw/ntnujava/index.php?topic=148

Page 21: Spectral Analysis, part II

Sine + Cosine Example• Let’s add a 1 Hz cosine wave, of amplitude .5, to our previous combination of 1 Hz sine and 4 Hz cosine waves.

1 2 3 4 5 6 7 8

C 1+4: 1 -.293 2 -.293 1 -1.707 0 -1.707

.5*F cos .5 .353 0 -.353 -.5 -.353 0 .353

G Sum: 1.5 .06 2 -.646 .5 -2.06 0 -1.353

• Let’s check the 1 Hz sine wave again:

G Sum: 1.5 .06 2 -.646 .5 -2.06 0 -1.353

A 1 Hz 0 .707 1 .707 0 -.707 -1 -.707

G*A Dot: 0 .043 2 -.457 0 1.457 0 .957

• Sum = 4

Page 22: Spectral Analysis, part II

Sine + Cosine Example• Now check the 1 Hz cosine wave:

G Sum: 1.5 .06 2 -.646 .5 -2.06 0 -1.353

F 1 Hz 1 .707 0 -.707 -1 -.707 0 .707

G*F Dot: 1.5 .043 0 .457 -.5 1.457 0 -.957

• Sum = 2

• Sinewave component amplitude = 4/4 = 1

• Cosine wave component amplitude = 2/4 = .5

• Total amplitude =

(1*1) + (.5* .5) =1.118

• Check out the amplitude of the combo in Praat.

Page 23: Spectral Analysis, part II

In Sum• To perform a Fourier analysis on each (smoothed) chunk

of the waveform:

1. Determine the components of each chunk using the dot product--

• Components yield a dot product that is not 0

• Non-components yield a dot product that is 0

2. Normalize the amplitude values of the components

• Divide the dot products by the power of the reference wave at that frequency

3. If there are both sine and cosine wave components at a particular frequency:

• Combine their amplitudes using the Pythagorean theorem

Page 24: Spectral Analysis, part II

Hold On A Second...• What would happen if our window length was 7 samples long, instead of 8?

• Back to the 1 Hz and 4 Hz wave combo:

1 2 3 4 5 6 7

C: 1 -.293 2 -.293 1 -1.707 0

2 Hz 0 1 0 -1 0 1 0

Dot: 0 -.293 0 .293 0 -1.707 0

• The sum of these products is -1.707, not 0. (!?!)

• The Fourier approach only works for sinewaves that can fit an integer number of cycles into the window.

Page 25: Spectral Analysis, part II

Frequency Range• Q: What frequencies can we consider in the Fourier analysis?

• One possible (but unrealistic) setup:

• A window length of .25 seconds

• A sampling rate of 20,000 Hz

• (Note: 5,000 samples fit into a window)

• Longest period = .25 seconds, so:

• Lowest frequency component = 1 / 0.25 = 4 Hz

• Nyquist frequency = 10,000 Hz.

• A: We can check all frequencies from 4 to 10,000, in steps of 4 Hz.

• (10,000 / 4 = 250 possible frequencies)

Page 26: Spectral Analysis, part II

Frequency Range, Part 2• Q: What frequencies can we consider in the Fourier analysis?

• Another, more realistic possible setup:

• A window length of .005 seconds

• A sampling rate of 20,000 Hz

• (Note: 100 samples fit into a window)

• Longest period = .005 seconds, so:

• Lowest frequency component = 1 / .005 = 200 Hz!

• Nyquist frequency = 10,000 Hz.

• A: from 200 to 10,000, in steps of 200 Hz.

• (10,000 / 200 = 50 possible frequencies)

Page 27: Spectral Analysis, part II

Zero Padding• With short window lengths, we miss out on a lot of interesting frequencies…

• The solution is to “pad” the window with zeroes, until it’s long enough to enable us to look at an interesting frequency range.

• Example:

1 2 3 4 5 6 7 8

Sum: 1 -.293 2 -.293 1 -1.707 0 0

• Q: What effect do you think this would have on the power spectrum?

• Component frequencies have a reduced amplitude.

• Non-component frequencies have a non-zero amplitude.

Page 28: Spectral Analysis, part II

Industrial Smoothing• Zero-padding “smooths” the spectrum.

• Spectral analysis of complex wave formed by 1 Hz and 4 Hz waves, with an 8 Hz sampling rate:

8 sample window 7 sample window, with zero padding

0

0.2

0.4

0.6

0.8

1

1 2 3 4

Frequency (Hz)

Amplitude

0

0.2

0.4

0.6

0.8

1

1 2 3 4

Frequency (Hz)

Amplitude

Page 29: Spectral Analysis, part II

Another Example• Q: What would happen if we padded the window out to 16 samples?

• A: More frequencies we can check (resolution = .5 Hz)

• Also: even more smoothing

• What would happen if we increased the sampling rate?

• Upper end of analyzable frequency range increases

• ( higher Nyquist frequency) 7 sample window, with zero-

padding, 16 Hz sampling rate

0

0.1

0.2

0.3

0.4

0.5

0.6

0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 5.5 6 6.5 7 7.5 8

Frequency (Hz)

Amplitude

Page 30: Spectral Analysis, part II

Trade-Offs• What happens if we increase the window length?

• (independent of zero padding)

• A: Increase the maximum analyzable period, so:

• Better frequency resolution

• ...without the smoothing.

• However:

• Temporal resolution is worse.

• (because the window length is less precise)

• Check it out in Praat.

Page 31: Spectral Analysis, part II

Morals of the Fourier Story• Shorter windows give us:

• Better temporal resolution

• Worse frequency resolution

• = wide-band spectrograms

• Longer windows give us:

• Better frequency resolution

• Worse temporal resolution

• = narrow-band spectrograms

• Higher sampling rates give us...

• A higher limit on frequencies to consider.