adaptive methods for speaker separation in cars

16
Adaptive Methods for Speaker Separation in Cars DaimlerChrysler Research and Technology Julien Bourgeois [email protected]

Upload: willa

Post on 05-Feb-2016

39 views

Category:

Documents


0 download

DESCRIPTION

Adaptive Methods for Speaker Separation in Cars. DaimlerChrysler Research and Technology Julien Bourgeois [email protected]. General context. s 2 (t). s 1 (t). +Road Noise spatially diffuse. Several simultaneous speakers (sources) spatially located. x 1 (t). x 4 (t). - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Adaptive Methods for Speaker Separation in Cars

Adaptive Methods for Speaker Separation in Cars

DaimlerChrysler Research and TechnologyJulien Bourgeois

[email protected]

Page 2: Adaptive Methods for Speaker Separation in Cars

2

General context

x1(t) x4(t)

Microphones

Goal: provide individual speech input for each passenger

Individual speech flows

s1(t)

s2(t)

+Road Noise spatially diffuse

Several simultaneous speakers (sources) spatially located

Separation

Algorithm

Page 3: Adaptive Methods for Speaker Separation in Cars

3

General context

x1(t) x4(t)

Microphones Individual speech flows

s1(t)

s2(t)

+Road Noise spatially diffuse

Several simultaneous speakers (sources) spatially located

Separation

Algorithm

Mixing system

Goal: provide individual speech input for each passenger

Page 4: Adaptive Methods for Speaker Separation in Cars

4

General context

x1(t) x4(t)

Microphones Individual speech flows

s1(t)

s2(t)

+Road Noise spatially diffuse

Several simultaneous speakers (sources) spatially located

Separation

Algorithm

Software

Goal: provide individual speech input for each passenger

Page 5: Adaptive Methods for Speaker Separation in Cars

5

Plan of the presentation

Overview of existing methods

Supervised/Informed separation vs. Blind separation

Blind separation and prior spatial information

Conclusion and future work

Page 6: Adaptive Methods for Speaker Separation in Cars

6

Existing methods: CASA vs. Multichannel Techniques

CASA: 1 microphone separation

Heuristics based on an analysis of human auditory system

Requires a lot of data (training of parameters)

Multi-microphones techniques: Speech moves much faster than…

the coherence relating two (or more) microphones.

Page 7: Adaptive Methods for Speaker Separation in Cars

7

Existing Methods: Beamforming

Beamforming: Prior information on target position

Constrain the response in the direction of interest

Minimize the output power

Problem of target cancellation if prior

spatial info is not perfect.Filters

Direction of interest

output

Page 8: Adaptive Methods for Speaker Separation in Cars

8

Existing methods: Blind Source Separation

Blind Source Separation (BSS) First applications to speech separation at the end of the

90’s

Only requirement: statistically independent sources

Difficult optimization problem: maximizing a nonlinear function (independence measure).

With many microphones, target cancellation can also appear.

Permutation ambiguity.

Acoustic

Mixing

BSSSources

DependentObservations

IndependentOutputs

Page 9: Adaptive Methods for Speaker Separation in Cars

9

The question is…

Is it possible to merge Beamforming and BSS, and combine their advantages?

In cars, prior knowlegde on speaker positions, separate blindly is suboptimal.

Page 10: Adaptive Methods for Speaker Separation in Cars

10

Blind separation and prior spatial information

Initialisation of BSS according to speakers positions helps optimisation procedure a lot.

Solve permutations problem solved

Target cancellation problem solved

Prior info : positions

Acoustic

Mixing

BSSSources

DependentObservations

IndependentOutputs

Page 11: Adaptive Methods for Speaker Separation in Cars

11

BSS is not that blind…

BSS performances depends dramatically on the type of mixing Strictly causal

Page 12: Adaptive Methods for Speaker Separation in Cars

12

BSS is not that blind…

BSS performances depends dramatically on the type of mixing Strictly causal

Non strictly causal

Page 13: Adaptive Methods for Speaker Separation in Cars

13

Beamforming is not that informed…

Perfect prior spatial information is actually not necessary:

Target cancellation problem can be solved if one can detect activity/silences of each speaker.

The detection problem is strongly related with IDIAP smart meeting room projects.

Filters

Direction of interest

output

Page 14: Adaptive Methods for Speaker Separation in Cars

14

Conclusion and future works

Combining BSS with a beamformer is not gainful.

We may inform BSS efficiently in the case of non-causal mixings

(algorithmic rotation of the microphone array)

Page 15: Adaptive Methods for Speaker Separation in Cars

15

Conclusion and future works

Combining BSS with a beamformer is not gainful.

We may inform BSS efficiently in the case of non-causal mixings

(algorithmic rotation of the microphone array)

Page 16: Adaptive Methods for Speaker Separation in Cars

16

Thank you!