tolerable delay for speech processing: effects of...

19
HADF, Oldenburg, Germany 1 June 2017 TOLERABLE DELAY FOR SPEECH PROCESSING: EFFECTS OF HEARING ABILITY AND ACCLIMATISATION Tobias Goehring, PhD Previous affiliation (this project) : Institute of Sound and Vibration Research University of Southampton Southampton, UK Now at : MRC Cognition and Brain Sciences Unit University of Cambridge Cambridge, UK 1

Upload: others

Post on 07-Jun-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: TOLERABLE DELAY FOR SPEECH PROCESSING: EFFECTS OF …hadf.hoertech.de/2017/downloads/Goehring_DelayTolerance_HADF2017.pdfParticipants: 20 NH Age: 18-45 y, 8 fem. 20 HL Age: 45-81 y,

HADF, Oldenburg, Germany

1 June 2017

TOLERABLE DELAY FOR SPEECH PROCESSING:

EFFECTS OF HEARING ABILITY AND ACCLIMATISATION

Tobias Goehring, PhD

Previous affiliation (this project):

Institute of Sound and Vibration Research

University of Southampton

Southampton, UK

Now at:

MRC Cognition and Brain Sciences Unit

University of Cambridge

Cambridge, UK

1

Page 2: TOLERABLE DELAY FOR SPEECH PROCESSING: EFFECTS OF …hadf.hoertech.de/2017/downloads/Goehring_DelayTolerance_HADF2017.pdfParticipants: 20 NH Age: 18-45 y, 8 fem. 20 HL Age: 45-81 y,

Cochlea picture: http://storage3d.com/storage/2006.10/5770ca614015fd13ae99fc999dff667e.jpg

Bone conduction

Air conduction

Own voice

Processed signal DSP

Processed signal DSP

Delay

Cochlea

Direct signal

External voice

Cochlea

Introduction

2

Level-ratio

Level-ratio

Page 3: TOLERABLE DELAY FOR SPEECH PROCESSING: EFFECTS OF …hadf.hoertech.de/2017/downloads/Goehring_DelayTolerance_HADF2017.pdfParticipants: 20 NH Age: 18-45 y, 8 fem. 20 HL Age: 45-81 y,

Perceptual effects of delay on speech communication:

Audio-visual synchronicity (> 80 ms) [1]

Auditory-proprioceptive feedback (> 80 ms) [2,3]

Distinct echo perception (> 50 ms) [4]

Changes to speech production rate (from 43 ms) [3]

Timbre alterations due to comb-filter effect (< 50 ms)

3

Introduction

[1] McGrath, M., & Summerfield, Q. (1985). Intermodal timing relations and audio-visual speech recognition by normal- hearing adults. Journal of the Acoustical Society of America, 77, 678–685 [2] Lee, B. S. (1950). Effects of delayed speech feedback. Journal of the Acoustical Society of America, 22, 824–826. [3] M. Stone and B. C. J. Moore (2005), “Tolerable hearing-aid delays: IV. effects on subjective disturbance during speech production by hearing-impaired subjects.,” Ear Hear., vol. 26, no. 2, pp. 225–35. and Stone and Moore (2002) [4] M. Stone and B. C. Moore (1999), “Tolerable hearing aid delays. I. Estimation of limits imposed by the auditory path alone using simulated hearing losses.,” Ear Hear., vol. 20, no. 3, pp. 182–92.

Page 4: TOLERABLE DELAY FOR SPEECH PROCESSING: EFFECTS OF …hadf.hoertech.de/2017/downloads/Goehring_DelayTolerance_HADF2017.pdfParticipants: 20 NH Age: 18-45 y, 8 fem. 20 HL Age: 45-81 y,

Literature suggests upper limit of delay for hearing devices: 10 ms

Introduction

X – NH

O – HI

Do hearing-impaired people tolerate longer output delays than

normal-hearing people when tested on the same setup?

Do experienced users of hearing aids tolerate longer delays ?

Does long-term acclimatisation to delay increase tolerance ?

4

Page 5: TOLERABLE DELAY FOR SPEECH PROCESSING: EFFECTS OF …hadf.hoertech.de/2017/downloads/Goehring_DelayTolerance_HADF2017.pdfParticipants: 20 NH Age: 18-45 y, 8 fem. 20 HL Age: 45-81 y,

5

Study 1: Effects of hearing ability and experience with HA

Study 2: Effects of long-term acclimatisation

Overview

Page 6: TOLERABLE DELAY FOR SPEECH PROCESSING: EFFECTS OF …hadf.hoertech.de/2017/downloads/Goehring_DelayTolerance_HADF2017.pdfParticipants: 20 NH Age: 18-45 y, 8 fem. 20 HL Age: 45-81 y,

Participants: 20 NH Age: 18-45 y, 8 fem.

20 HL Age: 45-81 y, 8 fem. 10 new, 10 experienced with HA

Setup: Real-time processing (DSP: Linear mixer and delay, at fs=48 kHz)

Headphones (closed, circumaural, at 65 dB(A))

Fitting gain: Half-gain rule based on hearing thresholds (HL group)

Conditions: 5 delay: [10…50] ms and 3 voice: own and external (2)

AMP DSP EQ Experimenter

Participant

Own voice (OwnV)

AMP DSP EQ

Experimenter Participant

External

voice (Ext0dB, Ext20dB)

Methods: Study 1

6

Page 7: TOLERABLE DELAY FOR SPEECH PROCESSING: EFFECTS OF …hadf.hoertech.de/2017/downloads/Goehring_DelayTolerance_HADF2017.pdfParticipants: 20 NH Age: 18-45 y, 8 fem. 20 HL Age: 45-81 y,

Results: comparison NH vs. HL

Subjective rating of annoyance (7-point scale)

1 min. listening/reading per stimulus condition @ 65 dB(A)

Effect of hearing ability:

Significant effect of hearing ability (NH/HL). [F(1,38)=4.619, p=0.038]

7

Page 8: TOLERABLE DELAY FOR SPEECH PROCESSING: EFFECTS OF …hadf.hoertech.de/2017/downloads/Goehring_DelayTolerance_HADF2017.pdfParticipants: 20 NH Age: 18-45 y, 8 fem. 20 HL Age: 45-81 y,

Results: HI group (PTA)

Split 20 HL in three subgroups based on PTA (500, 1000, 2000 Hz):

LOW < 35 dB HL < MID < 50 dB HL < HIGH, n=5/8/7

8

No significant effect of subgroup (LOW/MID/HIGH).

Significant correlation between average slopes of

ratings and PTA within HL group (r=-0.51, p=0.022).

Page 9: TOLERABLE DELAY FOR SPEECH PROCESSING: EFFECTS OF …hadf.hoertech.de/2017/downloads/Goehring_DelayTolerance_HADF2017.pdfParticipants: 20 NH Age: 18-45 y, 8 fem. 20 HL Age: 45-81 y,

Results: HI group (experience)

Similar ages (67.1 vs. 66.9 years),

but different PTA for NEW and EXP (37.4 vs. 50.2 dB HL).

9

Split HI group in two subgroups based on experience with hearing aids:

NEW and EXP, n=10/10,

No significant effect of experience (NEW/EXP).

Page 10: TOLERABLE DELAY FOR SPEECH PROCESSING: EFFECTS OF …hadf.hoertech.de/2017/downloads/Goehring_DelayTolerance_HADF2017.pdfParticipants: 20 NH Age: 18-45 y, 8 fem. 20 HL Age: 45-81 y,

10

Study 1: Effects of hearing ability and experience with HA

Study 2: Effects of long-term acclimatisation

Overview

Page 11: TOLERABLE DELAY FOR SPEECH PROCESSING: EFFECTS OF …hadf.hoertech.de/2017/downloads/Goehring_DelayTolerance_HADF2017.pdfParticipants: 20 NH Age: 18-45 y, 8 fem. 20 HL Age: 45-81 y,

Participants: 8 NH Age: 20.9 y, 4 fem., NH thresholds

Setup: Real-time processing (DSP: delay, limiter, at fs=48 kHz)

iPhone (4S, 5) with earplugs (in-ear, int. microphone, at 65 dB(A))

Conditions: 4 delay: [10,20,30,40] ms and 3 level-ratio: [0,10,20] dB

iPhone Ear App:

2 delay settings:

Group 1: 20 ms

Group 2: 40 ms

5 days of use (1 week)

App made by Dr. Nick Clark

(Mimi Hearing Technologies)

Methods: Study 2

11

PRE POST 2 3 4

Page 12: TOLERABLE DELAY FOR SPEECH PROCESSING: EFFECTS OF …hadf.hoertech.de/2017/downloads/Goehring_DelayTolerance_HADF2017.pdfParticipants: 20 NH Age: 18-45 y, 8 fem. 20 HL Age: 45-81 y,

Results: Comparison of PRE / POST

12

Contourplots for ALL / PRE / POST test and both groups:

Yellow – higher annoyance, Green – lower annoyance

Group 1 = 20 ms Group 2 = 40 ms

Page 13: TOLERABLE DELAY FOR SPEECH PROCESSING: EFFECTS OF …hadf.hoertech.de/2017/downloads/Goehring_DelayTolerance_HADF2017.pdfParticipants: 20 NH Age: 18-45 y, 8 fem. 20 HL Age: 45-81 y,

Results: Comparison of groups

13

Compare Group1 and Group 2, n=4/4, averages across level-ratios:

Significant difference between groups for post test [F(1,6)=7.665, p=0.032].

Some acclimatisation for Group1, but also bit lower tolerance for Group2 …

5-day acclimatisation

Page 14: TOLERABLE DELAY FOR SPEECH PROCESSING: EFFECTS OF …hadf.hoertech.de/2017/downloads/Goehring_DelayTolerance_HADF2017.pdfParticipants: 20 NH Age: 18-45 y, 8 fem. 20 HL Age: 45-81 y,

Hearing loss increased tolerance of delay over NH (average ratings)

Lower sensitivity to changes in delay with stronger HL (average slopes)

Experience with hearing aids showed some trends but no significant effect of tolerance and potential confound with HL

Long-term acclimatisation increased tolerance of delay for NH (Ear App)

Results extend findings of previous study (Stone and Moore, 2005) to external voice conditions and linear processing (no WDRC)

Limitations of study: linear signal processing, only speech stimuli (e.g. no music) and presentation via closed headphones / earplugs

- most likely different to perception with commercial hearing aids!!

Conclusion

14

Page 15: TOLERABLE DELAY FOR SPEECH PROCESSING: EFFECTS OF …hadf.hoertech.de/2017/downloads/Goehring_DelayTolerance_HADF2017.pdfParticipants: 20 NH Age: 18-45 y, 8 fem. 20 HL Age: 45-81 y,

o Long-term study with HI listeners and actual HA devices ?

o Effect of experience with matched PTA/Age between groups ?

Algorithms with potential benefits from increased delay:

Noise reduction based on machine learning (GMM, DNN)

Decorrelation for feedback cancellation

Wireless streaming of audio (binaural, HA+CI, smartphones…)

More energy-efficient processing with larger time window ?

Future work and ideas

15

Page 16: TOLERABLE DELAY FOR SPEECH PROCESSING: EFFECTS OF …hadf.hoertech.de/2017/downloads/Goehring_DelayTolerance_HADF2017.pdfParticipants: 20 NH Age: 18-45 y, 8 fem. 20 HL Age: 45-81 y,

Funding source

16

The work leading to this deliverable and the

results described therein has received

funding from the People Programme (Marie

Curie Actions) of the European Union’s

Seventh Framework Programme FP7/2007-

2013/ under REA grant agreement n° PITN-

GA-2012-317521

www.icanhear.eu

Acknowledgements

Stefan Bleeck

Jessica Monaghan

Josie Chapman

Sakeena Kanji

Page 17: TOLERABLE DELAY FOR SPEECH PROCESSING: EFFECTS OF …hadf.hoertech.de/2017/downloads/Goehring_DelayTolerance_HADF2017.pdfParticipants: 20 NH Age: 18-45 y, 8 fem. 20 HL Age: 45-81 y,

17

Thank you!

Contact:

[email protected]

Page 18: TOLERABLE DELAY FOR SPEECH PROCESSING: EFFECTS OF …hadf.hoertech.de/2017/downloads/Goehring_DelayTolerance_HADF2017.pdfParticipants: 20 NH Age: 18-45 y, 8 fem. 20 HL Age: 45-81 y,

appendix

18

Page 19: TOLERABLE DELAY FOR SPEECH PROCESSING: EFFECTS OF …hadf.hoertech.de/2017/downloads/Goehring_DelayTolerance_HADF2017.pdfParticipants: 20 NH Age: 18-45 y, 8 fem. 20 HL Age: 45-81 y,

Results: Comparison of PRE/POST

19

Results for PRE / POST test and both groups:

Group 1

(20 ms)

Group 2

(40 ms)