new final reportura-reports/101/vanhoy... · 2010. 6. 14. · final report student researchers:...

10
Final Report Student Researchers: Garrett Vanhoy and Alex Salce Faculty Advisor: Christopher Bergevin June 1, 2010 1 Introduction When a small microphone is fitted into an ear canal, one might find that the sound picked up by the microphone would consist of more than just background noise. These detectable sound emissions generated from the inner ear have been named spontaneous otoacoustic emissions or SPOAEs since their discovery by a British engineer named David Kemp in 1977. Since their discovery, otoacoustic emissions (OAEs) have been a subject of research for its clinical applications. Spontaneous otoacoustic emissions are a special type of OAE that arise without an external stimulus. These are of particular interest because their existence suggests that the inner ear has some physiological amplification mechanism. Although more than one mathematical model has been used to describe the dynamics of the inner ear, the identity of the physiological mechanism by which sound is amplified is still under debate [1, 2, 3]. Other studies of OAEs have illuminated other interesting properties such as temperature dependence and responses to external tones [4, 5]. These OAEs are transduced by a microphone into an electrical signal that is fed through an analog to digital converter. The output signal of the analog to digital converter can be modified in its resolution and its sampling rate (the amount of times the device records sound levels per second). The digital signal can be analyzed using a discrete Fourier transform, which mathe- matically breaks a sinusoidal signal into its frequency components. In order to reduce noise, the generated waveform is separated into several fixed intervals. A Fourier transform is then performed on these individual intervals, and finally the amplitudes of each of these waveforms are averaged. Another method to reduce noise is to ”window” the signal and then perform a Fourier transform on the windowed signal. Windowing gives a chosen part of the signal a larger weight according to some normalized function. In using the first method to reduce noise, one potential discrepancy has arisen. One set of measurements has shown that the amplitude of the Fourier transform is affected by the sampling rate of the acoustic measuring device. Although the Fourier transform’s general form stays the same, it is of interest to have a reasonable ex- planation for such phenomena because at least a few articles refering to measurements of this kind do not even mention at what sampling rate their analog to digital converters operate[6, pg. 479]. When they have, the sampling rate has varied from 1600 samples per second to 48 thousand samples per second [11, 10, 9]. 1

Upload: others

Post on 16-Oct-2020

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: New Final Reportura-reports/101/Vanhoy... · 2010. 6. 14. · Final Report Student Researchers: Garrett Vanhoy and Alex Salce Faculty Advisor: Christopher Bergevin June 1, 2010 1

Final Report

Student Researchers: Garrett Vanhoy and Alex SalceFaculty Advisor: Christopher Bergevin

June 1, 2010

1 Introduction

When a small microphone is fitted into an ear canal, one might find that the sound picked upby the microphone would consist of more than just background noise. These detectable soundemissions generated from the inner ear have been named spontaneous otoacoustic emissionsor SPOAEs since their discovery by a British engineer named David Kemp in 1977. Sincetheir discovery, otoacoustic emissions (OAEs) have been a subject of research for its clinicalapplications. Spontaneous otoacoustic emissions are a special type of OAE that arise without anexternal stimulus. These are of particular interest because their existence suggests that the innerear has some physiological amplification mechanism. Although more than one mathematicalmodel has been used to describe the dynamics of the inner ear, the identity of the physiologicalmechanism by which sound is amplified is still under debate [1, 2, 3]. Other studies of OAEshave illuminated other interesting properties such as temperature dependence and responses toexternal tones [4, 5].

These OAEs are transduced by a microphone into an electrical signal that is fed through ananalog to digital converter. The output signal of the analog to digital converter can be modifiedin its resolution and its sampling rate (the amount of times the device records sound levels persecond). The digital signal can be analyzed using a discrete Fourier transform, which mathe-matically breaks a sinusoidal signal into its frequency components. In order to reduce noise,the generated waveform is separated into several fixed intervals. A Fourier transform is thenperformed on these individual intervals, and finally the amplitudes of each of these waveformsare averaged. Another method to reduce noise is to ”window” the signal and then perform aFourier transform on the windowed signal. Windowing gives a chosen part of the signal a largerweight according to some normalized function. In using the first method to reduce noise, onepotential discrepancy has arisen. One set of measurements has shown that the amplitude of theFourier transform is affected by the sampling rate of the acoustic measuring device. Althoughthe Fourier transform’s general form stays the same, it is of interest to have a reasonable ex-planation for such phenomena because at least a few articles refering to measurements of thiskind do not even mention at what sampling rate their analog to digital converters operate[6,pg. 479]. When they have, the sampling rate has varied from 1600 samples per second to 48thousand samples per second [11, 10, 9].

1

Page 2: New Final Reportura-reports/101/Vanhoy... · 2010. 6. 14. · Final Report Student Researchers: Garrett Vanhoy and Alex Salce Faculty Advisor: Christopher Bergevin June 1, 2010 1

2 Methods

2.1 The Fast Fourier Transform (FFT)

The Fast Fourier Transform is a method that calculates the Discrete Fourier Transform (DFT)of a discrete-time signal in an efficient manner. The Cooley-Tukey algorithm for the FFT usesEuler’s identity to simplify this approximation computationally for a set of data with m pointsby calculating the coefficients Ck of

1

m

2m−1∑k=0

Ckeikt (1)

Where Ck complex is computed by

Ck =2m−1∑k=0

xjeikπjm (2)

xm being the mth point in the set of data.

2.2 The rfft Function

Although MATLAB’s fft method of computing the DFT will be used in the analysis of thesignals presented in this report, the fft includes negative frequencies. Instead, an alternativecode written based on the MATLAB’s fft will be used. The rrft computes the DFT of a signaland then ignores the negative frequencies. The magnitude of a negative frequency and positivefrequency with the same absolute value will have the same magnitude in the DFT. A brief proofof this is presented below. The normalized DFT of any signal can be written:

Xn =2π

M

M−1∑m=0

xme−2πinm

M (3)

Breaking each term of the sum into two parts using Euler’s identity:

e−2πinm

M = cos

(−2πinm

M

)− i sin

(−2πinm

M

)(4)

If n→ −n, we have:

cos

(2πinm

M

)− i sin

(2πinm

M

)= cos

(2πinm

M

)+ i sin

(2πinm

M

)(5)

Which means that negative frequencies have the same magnitude, but have a phase shiftfrom the positive frequencies.

2

Page 3: New Final Reportura-reports/101/Vanhoy... · 2010. 6. 14. · Final Report Student Researchers: Garrett Vanhoy and Alex Salce Faculty Advisor: Christopher Bergevin June 1, 2010 1

2.3 Creating Noise

When creating a signal with noise, MATLAB’s randn function was used. This function generatesa sequence of normally distributed values according to the standard normal distribution. Thisis admittedly not a realistic form of noise. However, this type of noise does not change the effectof sampling rate on a noisy signal.

2.4 Spectral Averaging and Time Averaging: A Comparison of Methods

There are two methods of isolating a signal from noise. The first, called time averaging, requiresinformation about the signal. Several segments of any integral number of periods of a signalare superimposed over one another. Then the magnitude at each discrete point in time isaveraged according to the number of segments. The background noise, if it has a zero meandisplacement, should disappear leaving only the signal in the time domain. The second method,called spectral averaging, requires no information about the signal. The DFT of several equallength segments are first computed. Then the magnitudes of the DFT from each segment aresuperimposed over one another. Finally, the magnitudes are averaged according to the numberof segments. To compare the effectiveness of each method, a tone was mixed with standardnormal distributed noise and each method was used to reproduce the tone without noise. Tomeasure the effectiveness of each method the noise floor, or the amplitude required to distinguisha signal from noise, is computed. The noise floor in this comparison is defined to be the averageabsolute magnitude of the frequencies in the DFT not including an interval of 10 frequenciesabove and below the frequency of the present tone. Figure 1 shows that time averaging, whenpossible, allows for a much lower noise floor than spectral averaging.

Figure 1: Tone and Standard Normal Distributed Noise with Time Averaging and SpectralAveraging

Figure 1 shows that spectral averaging does nothing to reduce the noise floor. This does notmean that spectral averaging does nothing to help distinguish a signal fron noise. What is notshown is that spectral averaging helps to spread the energy from the background noise evenlybetween all frequencies. When this happens, you recover the DFT of just the tone but raised

3

Page 4: New Final Reportura-reports/101/Vanhoy... · 2010. 6. 14. · Final Report Student Researchers: Garrett Vanhoy and Alex Salce Faculty Advisor: Christopher Bergevin June 1, 2010 1

by some amount porportional to the energy in the background noise. It is not clear whether ornot time averaging ever reaches a minimum noise floor even after 200 segments are averaged inthe time domain. It is expected that the noise floor approaches absolute zero, but only afteraveraging several orders of magnitude larger than 200 segments. A summary of these results areshown Table 1.

Property Spectral Averaging Time Averaging

Noise Floor No effect Decreases monotonically forat least 200 segments

Requires InformationAbout the Signal

No Yes

Table 1: Comparison of Time Averaging and Spectral Averaging Methods to Reduce Noise

2.5 Simulating Sampling Rate

It is often impossible to produce the exact signal twice in order to sample it at a different rate.In order to accomplish this, we ”re-sampled” a set of data by choosing to take a point from theoriginal data every n points to simulate a sampling rate of SR

n . Where SR is sampling rate ofthe original data.

2.6 Isolating The Effect of Sampling Rate

Examing the correlation between sampling rate and the amplitude of a DFT can be done in twoways. Either the length of time in which the signal is recorded could be kept constant, or theamount of points taken in the DFT is kept constant. The first case would correspond to keepingthe fundamental frequency constant. This is defined as 2π

T , where T is the duration of the signalin seconds. This method will be referred to as ”constant period.” An example of this is shownin figure 2.

Figure 2: Changing Sampling Rate with Constant Period

4

Page 5: New Final Reportura-reports/101/Vanhoy... · 2010. 6. 14. · Final Report Student Researchers: Garrett Vanhoy and Alex Salce Faculty Advisor: Christopher Bergevin June 1, 2010 1

Figure 3: Changing Sampling Rate with Constant Points

The second case corresponds to keeping the quantity fsN constant. Where fs is the sampling

rate and N is the number of points in the time domain. This can be thought of keeping the”bin size” of the DFT constant. This report considers only the first method. An example ofthis method is shown in Figure 3.

3 Results

3.1 The Effect of Sampling Rate on Gaussian Noise

To test the effect of sampling rate on the noise floor of Guassian noise, a signal of a fixed lengthin time was created using MATLAB’s randn function. In order to simulate doubling the SR anew signal was created with each point being spaced in time by half the distance that they werein the original signal. Although this is not realistic, it will not likely change the outcome of theexperiment because taking a signal like this and choosing every other point to be the new signalis just as random as making a new set of random points. The definition of the noise floor wasthe average amplitude in decibels of the entire signal in the frequency domain. This is not atypical definition when signals are present, but because this is entirely noise, the energy in thesignal should be distributed fairly even throughout the frequencies.

The linear fit to the data suggest that if the sampling rate is doubled, then the noise floordecreases by about -3.03 dB. This result is consistent to theory.

5

Page 6: New Final Reportura-reports/101/Vanhoy... · 2010. 6. 14. · Final Report Student Researchers: Garrett Vanhoy and Alex Salce Faculty Advisor: Christopher Bergevin June 1, 2010 1

Figure 4: The Effect of Sampling Rate on Gaussian Noise

3.2 The Effect of Sampling Rate on Gaussian Noise and a Tone

The signal analyzed here is a tone of frequency 2048 and amplitude 1. This frequency waschosen to be a multiple of 2 in order to keep an integer multiple of cycles in the time windowwhen the sampling rate was doubled. This may not be necessary, but it may slightly changethe outcome of the experiment because if it were not an integer multiple in each case, therewould be a significant contribution to the energy in the frequencies not present from the presentsignal (also called spectral leakage.) This would create a higher noise floor. However, the noisefloor should still have the same change if the sampling rate changes. The noise floor in this casewas defined as the average amplitude of the frequencies excluding an interval of 10 frequenciesaround the frequency of the present tone. Also, the noise in each signal was allowed to changewhen the sampling rate changed.

6

Page 7: New Final Reportura-reports/101/Vanhoy... · 2010. 6. 14. · Final Report Student Researchers: Garrett Vanhoy and Alex Salce Faculty Advisor: Christopher Bergevin June 1, 2010 1

Figure 5: 2.048kHz Tone and Gaussian Noise with Changing Sampling Rate

3.3 The Effect of Sampling Rate on an SOAE from a Tokay Gecko

Figure 6: Spectrum of SOAE from Tokay Gecko with Varying Sampling Rate

Figure 6 compares the DFT of a SOAE from a Tokay Gecko when the sampling rate ischanged. Here, when the sampling rate is decreased, the signal increases in amplitude. From a22050 Hz sampling rate to a 11025 Hz sampling rate, the signal does not qualitatively change.However, when the signal is sampled at 5512, the signal is misrepresented because of aliasing.The signal is known to have peaks above 2800 Hz, which is the cut-off frequency for the signalwith a simulated sampling of 5512 Hz.

7

Page 8: New Final Reportura-reports/101/Vanhoy... · 2010. 6. 14. · Final Report Student Researchers: Garrett Vanhoy and Alex Salce Faculty Advisor: Christopher Bergevin June 1, 2010 1

3.4 The Effect of Sampling Rate on an SOAE from Anole

Figure 7: DFT of SOAE from Anole

Figure 7 compares the DFT of a SOAE from an Anole. The results are similar to that ofchanging the sampling rate of a SOAE from a Tokay Gecko; however, there are no significantlylarge emissions above 2800 Hz so the signal is not aliased.

3.5 Summary

When considering a simple signal such as Guassian noise, it is clear that if the fundamentalfrequency of a DFT is kept constant while the sampling rate increases, then the amplitude of theDFT changes predictably. This relationship continues to hold when considering a slightly morecomplex signal that contains both Gaussian noise and a pure tone. However, when consideringexperimental data, this relationship appears to break down. The amplitude of the DFT stillincreases when the sampling rate decreases, but the amplitude does not increase uniformly orby 3 dB on average.

4 Discussion

4.1 Decreasing by 3 dB when Samping Rate is Doubled

Parseval’s Theorem states:

N−1∑n=0

x[n]2 =1

N

N−1∑k=0

X[k]2 (6)

Which means the the energy within a discrete-time signal is entirely distributed among itsharmonics. In Gaussian noise, the energy in a long enough segment is about the same energy ina segment of the same length. So, if the sampling rate is doubled, we have

8

Page 9: New Final Reportura-reports/101/Vanhoy... · 2010. 6. 14. · Final Report Student Researchers: Garrett Vanhoy and Alex Salce Faculty Advisor: Christopher Bergevin June 1, 2010 1

2N−1∑n=0

x[n]2 =1

2N

2N−1∑k=0

X[k]2 (7)

From equations (4) and (5) we see that:

N−1∑n=0

x[n]2 =1

N

N−1∑k=0

X[k]2 =1

4N

2N−1∑k=0

X[k]2 (8)

So,

1

N

2N−1∑k=0

(X[k]

2

)2

=1

N

N−1∑k=0

X[k]2 (9)

Which means that if the sampling rate is doubled, the amplitudes in the frequency domainmust be halved, which corresponds to a 3-decibel decrease in amplitude.

4.2 A Comparison of the Results between Gaussian Noise and ExperimentalData

It was shown that the relationship between the amplitude of the DFT and sampling rate seemsto be clear in the case of a simple signal, but tends to change for a more complex signal.For a more complex signal, decreasing the sampling rate does not increase the noise floor byexactly 3 dB uniformly. This is likely because when the sampling rate is decreased and the thedecreased sampling rate still remains larger than 2 times the largest present signal, the energyis redistributed among the tested frequencies in a non-uniform manner.

4.3 Further Work

As mentioned in section 2.6, there is another method to examine the effect of sampling rate onthe amplitude of the DFT. This method is expected to yield similar reults to the method usedin this report for simple signals (just noise), but it is a mystery whether or not it will producesimilar results for more complex signals. Additionally, there is a fundamental connection betweenan unsampled signal and its sampled one that was not explored in this report. This connectionbetween the sampled signal lies in the limit as both the sampling rate and the duration of thesignal go to infinity. It is expected that some large enough finite sampling rate and durationwill produce a DFT that closely resembles the Fourier transform of the unsampled signal.

References

[1] A model for active elements in cochlear biomechanics

S. T. Neely and D. O. Kim, J. Acoust. Soc. Am. 79, 1472 (1986), DOI:10.1121/1.393674

[2] Experimental confirmation of the two-source interference model for the fine structure ofdistortion product otoacoustic emissions

Carrick L. Talmadge, Glenis R. Long, Arnold Tubis, and Sumit Dhar, J. Acoust. Soc. Am.105, 275 (1999), DOI:10.1121/1.424584

9

Page 10: New Final Reportura-reports/101/Vanhoy... · 2010. 6. 14. · Final Report Student Researchers: Garrett Vanhoy and Alex Salce Faculty Advisor: Christopher Bergevin June 1, 2010 1

[3] Modeling otoacoustic emission and hearing threshold fine structures

Carrick L. Talmadge, Arnold Tubis, Glenis R. Long, and Pawel Piskorski, J. Acoust. Soc.Am. 104, 1517 (1998), DOI:10.1121/1.424364

[4] Spontaneous otoacoustic emissions in the European edible frog (Rana esculenta): Spectraldetails and temperature dependence

P. van Dijk, H.P. Wit and J.M. Segenhout, Hearing Research, 42, 273 - 282 (1989)DOI:10.1016/0378-5955(89)90151-2

[5] Relaxation dynamics of spontaneous otoacoustic emissions perturbed by external tones. I.Response to pulsed single-tone suppressors

W. J. Murphy, C. L. Talmadge, A. Tubis, and G. R. Long, J. Acoust. Soc. Am. 97, 3702(1995), DOI:10.1121/1.412387

[6] Roeser, Ross J., Michael Valente, and Holly Hosford-Dunn. Audiology. New York: ThiemeMedical Publishers, Inc., 2007. Pages 478 - 480.

http://books.google.com/books?id=H_j6v1Nt044C&lpg=PP1&pg=PP1#v=onepage&q=

&f=false

[7] Robinette, Martin S. and Theodore J. Glattke. Otoacoustic Emissions: Clinical Applica-tions. New York: Thieme Medical Publishers, Inc., 2007. Pages 69-70.

http://books.google.com/books?id=xheqzNwrei0C&lpg=PP1&pg=PP1#v=onepage&q=

&f=false

[8] James W. Hall, III. Handbook of Otoacoustic Emissions. Canada: Singular PublishingGroup, 2002. Pages 67 - 70.

http://books.google.com/books?id=X6Ez-C6LAbMC&lpg=PP1&pg=PP1#v=onepage&q=

&f=false

[9] Comparison of the auditory systems of heterosexuals and homosexuals: Click-evoked otoa-coustic emissions

McFadden, Dennis and Edward G. Pasanen, PNAS, 95 1998: 2709-2713.

[10] External and Middle Ear Status Related to Evoked Otoacoustic Emission in Neonates

Chang, Kang W., MD; Betty R. Vohr, MD; Susan J. Norton, PhD; and Mary D. Lekas,MD, Otolaryngology–Head and Neck Surgery, JAMA 1994;271:1698-1700.

[11] Distortion Product Otoacoustic Emissions: Hit and False-Positive Rates in Normal-Hearingand Hearing-Impaired Subjects

Musiek, Frank E. and Baran, Jane A. The American Journal for Otology, 18, 1997, 18:454-461

10