signal processing by neural agenda networks - uncini · dmlp with internal memory aseveral...

Signal Processing by Neural Networks

Aurelio Uncini

INFOCOM Dept.

University of Rome “La Sapienza”

e-mail: aurel@ieee.org

International Joint Conference on Neural Networks, Washington, DC USA, July 18 2001

Agenda

Neural architectures for real-time DSP Non-linear generalizations of FIR-IIR filters by Dynamic Multilayer Perceptron (DMLP) neural networks;Fast adaptive spline neural model for signal processing

Some Applications

linear DSP: it's enough ?

major classical DSP techniques are based on linear models

in the real world the processes are non-linear

linear structures could not be able to model them adequately

specific non-linear architectures

particular class of problemsefficient but specific

e.g.median and bilinear filterssome spectral analysis techniques…

generic non-linear architectures

large class of problemgeneral but complex

e.g.Volterra filtersnon-linear state equationspolynomial filtersfunctional links…

non-linear DSP: classical approaches

specific or generic?

non-linear DSP: desired approaches

"standard" architecture with scalable complexity;

capability to approximate any non-linear (dynamic) behaviour;

"standard" design algorithms;

good implementation efficiency

use of the available know-how on linear filters;use of the available DSP processors for real-time audio applications.

generic and efficient

non-linear DSP: a different approach

dynamic = capability of processing temporal sequences

⌧how to get dynamic behaviours from an ANN;⌧non-linear FIR/IIR architectures;⌧design methods or learning process.

Dynamic Artificial Neural Networks (discrete time ANN)

ANN: Multi-Layer model

w11 21w y

”Artificial neurons" arranged in layers Multi-Layer Perceptron (MLP)

feedforward structure without delaysno dynamics

approximation of any non-linearityCibenko ‘88, Hornik et al. ‘89non-linear universal approximator

Dynamic Multilayer Networks (DMLP)

External memorythe non-linear filter is built using a static MLP inside a FIR/IIR framework

external delay lines ("buffers")Narendra-Partharasarathy 1990

Internal memorythe non-linear filter is built using a MLP with neurons containing FIR/IIR filters

internal delay lines ("dynamic neuron")Waibel et al. 1989, Back-Tsoi1991/94, Frasconi et al. 1992, Campolucci et al. 1999, etc.

Delay lines (memory elements) put dynamic in the multilayer model

DMLP with external memory FIR scheme

allows to use the classical multilayer model;

straightforward generalization of the linear FIR filters: linear filter ≡ DMLP with one neuron + (f(.) = identity)

x[k-1]

x[k-M+1]

y[k]Output

Signal

Input Signal

9Universal Functional Approximator

DMLP with external memory IIR scheme

allows to use the classical multilayer model;

straightforward generalization of the linear IIR filters: linear filter ≡ DMLP with one neuron + f(.) = identity

y[k]OutputSignal

x[k-1]

y[k-1]

y[k-2]

Input Signal

feedback

DMLP with internal memory

several different research contributions (already object of studies);

another generalization of the linear FIR/IIR filters: linear filter ≡ DMLP with one neuron + (f(.)=identity)

multilayer network composed by "dynamic" neurons

Input Signals

OutputSignal

D = dynamic neuron

FIR/IIR synapses

... OUTPUT

y(l)nmx

(l-1)m

FIR/IIR

Activation feedback

BIASIN

y(l)nmx

(l-1)m

OUTPUT

x(l)nf(.)

Output feedback

s(l)ny(l)nmx

(l-1)m

OUTPUT

x(l)nf(.)

DMLP: design methods

Static designdesired response in frequency, windowing, equiripple, Deczky, etc.

Adaptivedesired response in time (or frequency), LMS, RLS, etc.

Linear filters

design ≡ "learning", by examples (supervised or unsupervised), desired response intime.

DMLP - ANN

DMLP: learning with external memory

x[k] Input Signalx[k]

x[k-1]

•••

MLPz[k-1]

z[k-2]

(a) (b)

BP Algorithm

ε [k]

FIR/IIR buffersBackPropagation (Rumelhart 1986), LMS or RLS extension (LS)

(a) Equation error z[k]=d[k](here "teacher forced")

(b) Output error z[k]=y[k]

d[k] Desired Signal

External delay-line Learning Example (supervised)

x[k-1]

•••

MLPz[k-1]

z[k-2]

BP Algorithm

ε [k]

Overlap

Running window

sgm(2)1 [ ]x t(2)

11(0)w

(2)11(2)v

(0)1 [ ]x t

(0)2 [ ]x t

(1)1 [ ]x t

( , ) 1 ( , )

B t qA t q

−−

(2)11(1)w

(2)11(2)w

(2)11(1)v

sgm(1)11w

(1)12w

(1)1 [ ]s t (2)

11 [ ]y t

DMLP: learning with internal memory

1( ) ( ) ( 1) ( ) ( 1)

( ) ( )0 1

( ) ( )

[ ] [ ] [ ];

[ ] [ ];

[ ] sgm [ ] ;

l lnm nm

L Il l l l l

nm nm p m nm p nmp p

l ln n

y t w x t p v y t p

s t y t

x t s t

−− −

= − + −

∑ ∑

Forward mode 1

( , ) [ ] ( )1 ( , )

( , ) [ ]

B t qy t x tA t q

A t q v t q

B t q w t q

−− −

⎛ ⎞= ⎜ ⎟−⎝ ⎠

Internal memory (linear dynamic system)IIR synapse

The learning is non-causal => batch mode ( )2( )

1[ ] [ ] ;

J d t x t=

= −∑

(1)2 [ ]s t

sgm’

( , ) 1 ( , )

B t qA t q

(1)11 [ 1]w t∆ +

sgm’

1 1 ( , )A t q−−

(2)11(1)[ 1]v t∆ +

(2)11(2)[ 1]v t∆ +

(2)1 [ ]x t

(2)11(0)w

(2)11(2)v

(0)1 [ ]x t

(0)2 [ ]x t

(1)1 [ ]x t

( , ) 1 ( , )

B t qA t q

−−

(2)11(1)w

(2)11(2)w

(2)11(1)v

sgm(1)11w

(1)12w

(2)11(0)[ 1]w t∆ +

(2)11(1)[ 1]w t∆ +

(1)1 [ ]s t (2)

11 [ ]y t

(1)12 [ 1]w t∆ +

RBP: General framework to derive several causalisedon-line approximations learning algorithms for DMLP

DMLP: Recursive Backpropagation (RBP)

(2)11(2)[ 1]w t∆ +

RBP signal flow graph

DMLP: complexity, implementation

Computational complexity(fast learning algorithms)

Structural complexity (in terms of number of interconnections)

A suitable activation function can increase the Neural Network representation property

Fewer neurons needed

• small structural complexity• small computational complexity 2

A New Neuron Architecture

Theoretical framework Extention of the Cybenko theorem : (Hornik, Stinchcombe, White, 1989)

neurons which present any "squashing" activation function maintain the universal approximation characteristics

GOALS FOR THE NEW NEURONA "more sophisticate" activation function;Retains the squashing property of the sigmoid;Necessary smoothing characteristic;Easy to implement both in hardware and in softwareFlexible (by the adaptation of few free parameters).

A New Neuron Architecture’s Activation Function

• Squashing characteristic

-G• Flexible shape

Two needed properties

New Neural Network Architecture with Flexible Adaptive Activation Function

Adaptive Spline Neural Network (ASNN)

x0 x1 xNxN-1xN-2xN-2x2 x3

i=0 i=1i=N-4 i=N-3

Using Catmull-Rom or B-splines functions, with uniformly spaced control points, the shape of the activation function is controlled by few parameters (3rd degree: four control points).

Local shape adaptationSmooth characteristic: suitable for signal processing applications

ASNN: Spline Activation Function

iQ1iQ+

2iQ+ 3iQ+

u x∆

( )h x( , )h u i

Catmull-Rom

B-Spline

Spline activation function is specified as the weighted average of the four equally spaced control points

( ) ( ),y h x h u i= =1 2 3, , ,i i i iQ Q Q Q+ + + 2

Spline neuron scheme

( 1)0lx −

( 1)1lx −

( 1)lix −

( 1)lNx −

( )lkNw

SG1 SG2

( )lks

( )lku

( )lki

( )lkx

(l-1) layer

Flexible activation function neuron

iQ1iQ+

2iQ+ 3iQ+

( )h x( , )k kh u i

Flexible spline neuron scheme

u ××××

iQ1iQ +

NQ1NQ −

0 0uc T= M1 1uc T= M

3 3uc T= M2 2uc T= M

2Qi z= ⎢ ⎥⎣ ⎦

−= +∆

u z i= −( ), u iy h u i= = ⋅ ⋅T M Q

3 2 1u u u u⎡ ⎤= ⎣ ⎦T

1 3 3 12 5 4 111 0 1 02

0 2 0 0

− −⎡ ⎤⎢ ⎥− −⎢ ⎥=−⎢ ⎥⎢ ⎥⎣ ⎦

i i i i iQ Q Q Q+ + +⎡ ⎤= ⎣ ⎦Q

Adaptive Spline Neural Networks

Fast performance: suitable for many non-linear DSP applications

Standard MLP

Some Related References

Lorenzo Vecci, Francesco Piazza, Aurelio Uncini, "Learning and Approximation Capabilities of Adaptive Spline Activation Function Neural Networks", Neural Networks, Vol.11, No.2, pp 259-270, March 1998.

A. Uncini, L. Vecci, P. Campolucci, F. Piazza, “Complex-Valued Neural Networks with Adaptive Spline Activation Function for Digital Radio Links Nonlinear Equalization”, IEEE Trans. on Signal Processing, Vol. 47, No. 2, February 1999.

Stefano Guarnieri, Francesco Piazza and Aurelio Uncini, “Multilayer Feedforward Networks with Adaptive Spline Activation Function”, IEEE Trans. On Neural Network, Vol. 10, No. 3, pp.672-683, May 1999.

Mirko Solazzi and Aurelio Uncini, “Artificial Neural Network with Adaptive Multidimensional Spline Activation Functions”, IEEE-INNS-ENNS International Joint Conference on Neural Networks IJCNN2000, Como, Italy, 24-27 July 2000.

Paolo Campolucci, Aurelio Uncini, Francesco Piazza, Bhaskar D. Rao, "On-Line Learning Algorithms for Locally Recurrent Neural Networks", IEEE Trans. on Neural Network, Vol. 10, No. 2, pp.253-271 March 1999.

Paolo Campolucci, Aurelio Uncini and Francesco Piazza “A Signal-Flow-Graph Approach to On-line Gradient Calculation”, 2000 Massachusetts Institute of Technology Neural Computation, Vol. 12, pp. 1901–1927, August 2000.

On-Line Learning of DMLP

Adaptive Spline Neural Networks

Some applications

Low cost Loudspeakers linearisation by predistortion

Speech Quality Enhancement

Subbands Audio Signal Recovering using Neural Non-linear Prediction

Non liner models for sound synthesis

Blind acoustic signal separation/deconvolution29

Linearisation of a non-linear device

Loudspeaker Predistortion

ASNN - DMLP with buffers real-time implemented on ARIEL DSP96 (Motorola 96002) with ARIEL Proport 16 (Uncini, Piazza, BIAS-1997).

DSPASNN-DMLP preamplifier

woofer

Power amplifier

Pre-distortion processing

Loudspeaker: non linear dynamic device

Loudspeaker scheme

Several electrical and mechanical non linearity sources

(hard) dynamic non-linear device

Displacement [mm]-10 -5 0 5 10

[N/A]l

dl∫B

Surround

Suspension

Motor coil

Hystereticalphenomenon

magnet 0.5

-10 -5 0 5 10

x [mm]

Small signal behaviour

10 1000

Frequency [Hz]

[dB]re.to20uPa

Sound Pressure answer to 1 watt power

10 1000

Frequency [Hz]

[dB]re.to20uPa

Sound Pressure answer to 100 watt power

Second-harmonic

Third-harmonic

Frequency response Woofer SIPE - APW300

Predistortion learning scheme 1: on-line

DSPASNN-DMLP A/D Ampli

WeightsPerturbation

( 1) ( ) ( ) ( )ˆˆ ˆ ˆ( )k k k kη+ = −w w g w

ˆ ( ) ( )J⋅ ≈ ∇ ⋅g

( ) ( ) ( ) ( ) ( )

ˆ( )ˆ( )

k k k k k

− −

= + ∆ +

= − ∆ +

( ) ( )

( ) ( )1( )1

( ) ( ) ( ) ( )1( ) ( ) ( )

( ) ( ) ( )

1( )( ) ( )

( ) ( )

ˆ ˆ( )2 2

k k k kk k k

k k k jj

kk k n

z z z zc c

−+ − + −

⎡ ⎤−⎢ ⎥∆ ⎡ ⎤⎢ ⎥ ∆

⎢ ⎥⎢ ⎥⎢ ⎥⎢ ⎥

− − ⎢ ⎥⎢ ⎥= = ∆⎢ ⎥⎢ ⎥∆⎢ ⎥⎢ ⎥⎢ ⎥⎢ ⎥⎢ ⎥⎢ ⎥ ∆⎣ ⎦−⎢ ⎥

⎢ ⎥∆⎣ ⎦

Woofer: SIPE - APW300

Unknown system

Predistortion learning scheme 2: on-line

DSPASNN

Loudspeaker NN model to performe the on-line Backpropagation Through Time

DSPASNN-DMLP

Forward

Backward

Learning scheme 2: on-line

Neural Network pre-training in anechoic chambre

DSPASNN

Loudspeaker ModelAPW300 acquired in anechoic chambre

ε [k]

Learning scheme 2: on-line

200400

600800

200300

Harmonic Distortion Output to Sweep Loudspeaker APW300

SweepFundamental 1th harmonic

2nd harmonic

Output frequencySweep input frequency

200400

600800

200300

Harmonic DistortionOutput to Sweep Neural Model Aliasing

ASNN architecture• 10 input (FIR)• 1 hidden 2 neurons• 1 linear output

Sweep input frequencyOutput frequency

Loudspeaker response

Model response

ASNN architecture• 20 input (FIR)• 1 hidden with 2 spline neurons• 1 linear output

Predistortion Results

IV 2II III IV

A A AA

+ +=THD

Harmonic distortion

0 200 400 600 800 1000-28

MSE [dB]

Predistorsione Off-Line

WithoutpredistorterTDH=20.7 %

With predistorterTDH=4.15 % (14dB reduction at 200Hz) 3

About 20dB of 2nd and 3rd harmonic attenuation at 200 Hz

Predistortion Results

0 200 400 600 800 1000-50

Frequency [Hz]

FFT[dB]

Modello(sp2012p.cfg) a

0 200 400 600 800 1000-50

Frequency [Hz]

FFT[dB]

Controllo(Sp1021.cfg)+Modello(sp2012p.cfg) a 200Hz

The problem is to recover from narrow band signal (telephonic quality) the two missing frequency bands

nominally from 20 Hz to 300 Hz and from 3400 Hz to 8000 Hz.

SQEOperator

sNB[n] → sWB[n]

Narrow Band Speech

Wide Band Speech4

Spectrogram

As supposed by several authors this should be made possible by the human speech production mechanism, which relates the frequency contents of different bands.

We postulate the existence of a non-linear operator for speech enhancement called Quality Enhancement Operator (QEO) denoted as Ψ 4

Soft palate(velum)

Vocal tract

Tongue

Glottis

Nasalcavity Hard

palate

Quality Enhancement Operator

The proposed Ψ operator works in the frequency domain without an excitation signal and does not need any parameter tuning. Is defined as:

Implementation of Ψ operator by Dynamic Complex domain Adaptive Spline Neural Network (D-CASNN)

( ) ( )k kj jn nS e S eω ω⎡ ⎤= Ψ ⎣ ⎦

or in term of Short Time Fourier Transform (STFT):

( )[ ]

[ ] [ ]

j j nn

j l j n

s n S e e

s l w m l e e

ω ω−

⎡ ⎤⎡ ⎤= Ψ⎢ ⎥⎣ ⎦⎣ ⎦⎡ ⎤⎡ ⎤

= Ψ −⎢ ⎥⎢ ⎥⎣ ⎦⎣ ⎦

∑ ∑

∑ ∑ ∑

Proposed Speech Quality Enhancement scheme 1

2 ↑ LPF FFTM

Telephonic freq.

Reconstructed

High freq.

Reconstructed

Low freq.

sNB[n] sWB[n]

Proposed Speech Quality Enhancement scheme 2

2 ↑ LPF FFTM

Telephonic freq.

Reconstructed

High freq.

Reconstructed

Low freq.

sNB[n] sWB[n]

LPC Rectifier HPF

Residual

Additional scheme for the recovery of high frequency unvoiced sounds 4

Speech Quality Enhancement set-up

64 points Hanning windows with an overlap of 32 points;

64 points complex FFT/IFFT;

a DCASNN1 with 12 inputs and 2 outputs, with ∆x=1 for each neuron for the recovery of the low band;

a DCASNN2 with 12 inputs and 19 outputs, with ∆x=1 for the recovery of the high band voiced sounds;

an LPC of order 8 for the recovery of the high band unvoiced sounds followed by a rectifier and a high-pass filter.

45Very small networks implemented in standard low costfloating-point DSP processor (Texas TMSC30)

Preliminary results (working on)

Narrow Band speech

Reconstructed Wide Band speech

Subbands Audio Signal Recovering using Neural Non-linear Prediction

Audio signal recovering is a common problem in digital audio restoration field. The reconstruction of L consecutive missing samples in an audio signal may be considered an extrapolation problem

0 500 1000 1500 2000-0.4

Samples

Ex. of 400 samples (8ms) of missing samples of music audio signal 47

Backward Cross-fade Gain

Backward prediction

Forward prediction

Forward Cross-fade Gain

Reconstructed signal

Missing audiosamples

METHOD: CROSS-FADE OF FORWARD AND BACKWARD PREDICTED SAMPLES

Subbands Multirate Approach

A multirate approach gives the advantage of a reduced

sample rate, and so a decrease of samples number

allowing a longer gap reconstruction.

A direct implementation of multirate technique is a filter

bank which offers two different advantagesThe signal is split in several components. This is similar to a

frequency analysis. The prediction is made easier by slower

evolution of subband signal.

The decimation process which reduces signal length, avoiding

numeric problems.49

The Octave (Constant-Q) Subband Predictor

HLP 2x[n]

fQf∆

∆ff0 f

|Hk(ej2πf)|

x [n]H0(z) M MPredictor

0[ ]x n

0ˆ [ ]x nF0(z)

0[ ]e n

M MPredictor

1[ ]Mx n−

1ˆ [ ]Mx n−

1( )MH z− 1( )MF z−

1[ ]Me n−

The uniform M-CHANNEL Subband Predictor

1 1[ ] 2 [ ] cos(( )[ ] ( 1) )2 2 4

Nh n h n i nMπ π−

= ⋅ ⋅ + − + −

1 1[ ] 2 ( ) cos(( )[ ] ( 1) )2 2 4

Nf n h n i nMπ π−

= ⋅ ⋅ + − − −

Convergence Properties of Subband Prediction Algorithm

The stability region for mean-square convergence of the LMS algorithm is given by 0<µ<λmax (where λmax is the maximum

eigenvalue of the correlation matrix R of the input signal)

The performances of the adaptive system employing LMS-like

algorithm is dependent on the eigenvalues spread : χ (R) = (λmax/λmin) ≤ (Smax/Smin)

The convergence do not take place uniformly and the speed depends on the largest time constant, which is given by τmax=2/µλmin

π/2 π ω

S(ejω)Smax

Convergence Properties of Subband Prediction Algorithm

For a subband system we have:

max SkM ≤ Smax

min SkM ≥ Smin

max λkM ≤ λmax

min λkM ≥ λmin

For a subband algorithm we have a

reduced eigenvalues spread:χ (RS) ≤ χ (R)

The convergence is more uniform

Best and fast convergence performances

π/(2Μ) π/Μ ω

Sk(ejω) maxSkM

minSkN

200 400 600 800 1000 1200 1400 1600 1800 2000-0.5

Samples

original signalreconstructed signal

Example of reconstruction of 2000 samples (45ms) by octave filter bank and spline neural networks.

The MAXERR and MSE vs signal reconstructed length

0 20 40 60 80 100 120ms

UFB+LPUFB+ASNNOFB+LPOFB+ASNN

0 20 40 60 80 100 120-36

Best performances55

The subjective opinion vs signal reconstructed length

10 20 30 40 50 60 70 80 90 100 110 1202.5

Best performances56

Experimental Results

Signal with missing samples

Signal with recovered samples

Non liner models for sound synthesis

Natural extension of non-linear distortion synthesis technique (dynamic non linearity)

parametric control

DMLP-ASNN

signal input signal output

Feed forward scheme

Neural Networks can represent a generalization of several digital sound synthesis techniques

E.g. non linear musical oscillator (lumped circuit model)

Fairly idealized general musical oscillatorNon linear function + linear filter

⌧e(t) = f(y(t) , xE(t)) = excitation signal⌧xE(t) = external control

Energy source

e(t) y(t)

f(.)non linear active element

Resonatorlinear passive element

Example of a single-reed instrument

Mouthpressure

Bore Tone-hole lattice

REED -> External Exitation

BORE -> Resonator

BELL -> Acoustic impedance adaptation

TONE-HOLE -> Fingers control

p-[n]Bore Tone-hole lattice

Complex Non Linear Exitation

Mechanism (NLEM)

Resonator implemented by a

Digital Delay Line

Simple linear filter: two-port

Network (reflection req.>1.5KHz)

Single-reed instrument

Single Reed - Non Linear Excitation Mechanism (NLEM)

A vibrating reed is a pressure-controlled valve with two possible configuration: blown closed and blown open

The reed vibrations control the air flow on the embouchure

Oral cavity

,r ru p

The force over the reed is equal to Srp∆ where Sr is effective reed pressure surface, and p∆=poc-pr.

embouchure

,oc ocu pMouthpiece chamber

-1 -0.6 -0.2 0.2 0.6 1-1

1.5ur/ur0

p∆=0

The reed can be considered as a non-linear mechanical oscillator

02 ( ) ( ( ), ( ))r r rd x dxm x x g p t U tdt dt

µω ω ∆

⎡ ⎤+ + − =⎢ ⎥

⎣ ⎦

( ( ), ( ))g p t U t∆rm

Single Reed Model

mr represents the reed massµ is the dumping factorωr is the mechanical resonance frequencyU(t) is the flow pressure through the reed embouchureg(t) is a non linear function

( ( ), ( ))g p t U t∆rm

Single-reed instrument null-mass Physical Model

Offset embouchure

p+[n-N/2]p0[n]+

p-[n+N/2]

Reed Bore Bell

1 ( )( ) ;1 ( )

( ) ( ) 0.642; ( ) sin(2 );v v

a tR za t z

a t v t v t A f tπ

+= − =

R(z) = Linear FilterAv = Vibrato ampl. (e.g. 0.03)fv = Vibrato freq. (e.g. 5Hz)

B(z) = High-pass filter (ord. 1) 1.5KHz

[J.O. Smith 1993]

Non linear memory-less function

Excitation 1

Excitation M

Excitation 1

Proposal: more general framework

General framework for physical-like model synthesis

TriggerASNN non-linearity

Length N Delay line

Loop Filter

Learning capabilities

Single Reed null-mass physical-like model

Breath pressureGS neuron

Delay-line

Output gain

spline

( )H z

1( ) 0.5(1 )H z z−= −

Learning of clarinet sound

DELAY LINEControl points

SPLINEActFunc

Breath Pressure

( )H z

One Time-Delay Spline Neuron

1 22 1

2 1 21 2

( )1a a z zH z

a z a z

− −

11( ) 0.5(1 )H z z−= −

Single Reed physical-like model

Preliminary results: Sax sound

Delay-line 1 Three-port scattering junction

( )R z

Null-mass model

Learning of clarinet and sax sounds

Three-port scattering junction

Delay-line 2 Delay-line 3

“Chaotic sound” behaviour

Preliminary results: Flute Soundembouchure

Control points

SPLINEActFunc

( )H z

Bore delay-line

mz−Breathpressure

Adaptation of the only non linear function spline parameters

Preliminary results: Trumpet sound

Breathpressure

Lip filter

Delay line

non-linearity

Adaptable parametersα w0 w1 υ1 υ2

Another approach: additive synthesis controlled by ANN1

Loudness

(1) D. Wessel, C. Drame, M. Wright, CNMAT, Berkeley

Solo sax

Solo oct2

Suling

The amplitudes and frequencies (ak ,fk) of the oscillators are controlled by a NN.

Additive SynthesizerNeural Controller

Note: reproduced sounds by record

Audio signal separationBlind separation of linear mixture

( ) = ( ) t t⋅x A s

g1(u1)

g2(u2)

gN(uN)

Unknown mixing matrix

Independent sources

Observed signals

Un-mixing matrix

Flexible activation function

Neural Network

Problem: estimate the un-mixing matrix W such that u(t) = s(t)(several algorithms exist).

Preliminary: signal separation of post non-linear mixture

post non-linear mixture model

s(t) u(t) x(t)

v(t) y(t)

Non linear blind separation structure

( ) = ( ) 1,2,...N

i i i j jj

x t f a s t i M=

⎛ ⎞=⎜ ⎟

⎝ ⎠∑

Flexible spline activation functions

Preliminary: signal separation of non-linear mixing

Separation of non-linear mixture

Non linear mixing model

Non linear blind separation structure

c(t) x(t)

v(t) y(t)

, ,1 1

( ) = ( ) 1,2,...N N

k k i i i j ji j

x t b f a s t k M= =

⎛ ⎞=⎜ ⎟

⎝ ⎠∑ ∑

Preliminary Results: signal separation of post non-linear mixing

( ) = tanh( ( ))( ) = tanh( ( ))

( ) =0.8 ( ) ( )( ) = tanh(0.9 ( ))

x t u tx t u tx t u t u tx t u t

0.2811 0.2926 -0.1364 0.4080-0.4608 -0.1232 -0.2025 0.24920.3908 -0.3373 -0.3908 -0.2917-0.4621 -0.4660 -0.4438 -0.3846

⎡ ⎤⎢ ⎥⎢ ⎥=⎢ ⎥⎢ ⎥⎣ ⎦

s(t) u(t) x(t)

W1,nW2,1

W2,2W2,n

v(t) y(t)

Mixed inputs De-mixed outputs

Some conclusions

Dynamic Neural Networks represent a new class of non-liner DSP algorithms

Fast architectures allow real time applications at high throughput rate

Learning algorithms ensure consistent design methods

A huge application fields in audio/sound processing

Neural Networks: general framework for non linear digital sound analysis/synthesis methods

signal processing by neural agenda networks - uncini · dmlp with internal memory aseveral...

Documents

automated polygon generalization in a multi agent...

hasty generalization

metabolic model generalization

generalization web services

generalization and example

fiocchetti mnl013 (ectf-control-dmlp monitor) 2007-09

stimulus generalization

generalization in deep learning - an overview of...

what is generalization

generalization of isolated word training · 2020. 4....

a generalization of algebraic surface...

i14y 20u - dmlp

generalization and exploration via randomized value...

dempster68 generalization bayesianinference

generalization - msg2018.weebly.com

generalization from qualitative inquiry -...

generalization and scaling in reinforcement...

a natural generalization of the congruent number...

-3- - dmlp

stability and generalization