Independent Component Analysis & Blind Source Separation: Ata Kaban The University of Birmingham

Independent Component Analysis &
Blind Source Separation

Ata Kaban
The University of Birmingham
Overview
Today we learn about
The cocktail party problem -- called also blind
source separation (BSS)
Independent Component Analysis (ICA) for solving
BSS
Other applications of ICA / BSS
At an intuitive & introductory & practical level
A bit like
in the sense of having to find quantities that

are not observable directly
Signals, joint density

Joint density
S 1(t)
S 2(t)
Amplitude Amplitude
Signals
time
marginal
densities
Original signals (hidden sources)

s1(t), s2(t), s3(t), s4(t), t=1:T
The ICA model

s1
a11
x1
s3
s2
a12
x2
a13
s4
a14
x3
xi(t) = ai1*s1(t) +
ai2*s2(t) +
ai3*s3(t) +
ai4*s4(t)
Here, i=1:4.
x4
In vector-matrix
notation, and dropping
index t, this is
x=A*s
This is recorded by the microphones: a

linear mixture of the sources
xi(t) = ai1*s1(t) + ai2*s2(t) + ai3*s3(t) + ai4*s4(t)
The coctail party problem

Called also Blind Source Separation (BSS) problem
Ill posed problem, unless assumptions are made!
The most common assumption is that source signals are
statistically independent. This means that knowing the value
of one of them does not give any information about the other.
The methods based on this assumption are called Independent
Component Analysis methods. These are statistical
techniques of decomposing a complex data set into
independent parts.
It can be shown that under some reasonable conditions, if the

ICA assumption holds, then the source signals can be
recovered up to permutation and scaling.
Determine the source signals, given

only the mixtures
Recovered signals
Some further considerations

If we knew the mixing parameters aij then we would
just need to solve a linear system of equations.
We know neither aij nor si.
ICA was initially developed to deal with problems
closely related to the coctail party problem
Later it became evident that ICA has many other
applications too. E.g. from electrical recordings of
brain activity from different locations of the scalp
(EEG signals) recover underlying components of
brain activity
Illustration of ICA with 2 signals

s2
x2
a1
a2
a1
s1
Original s
x1
Mixed signals
x1 (t ) a11 s1 (t ) a12 s2 (t )
x2 (t ) a21 s1 (t ) a22 s2 (t )
t 1 : T

x2
a1
a2
a1
x1
x1 (t ) a11 s1 (t ) a12 s2 (t )
x2 (t ) a21 s1 (t ) a22 s2 (t )
t 1 : T
Mixed signals
Step1:
Sphering
Step2:
Rotatation

s2
x2
a1
a2
a1
s1
Original s
x1
x1 (t ) a11 s1 (t ) a12 s2 (t )
x2 (t ) a21 s1 (t ) a22 s2 (t )
t 1 : T
Mixed signals
Step1:
Sphering
Step2:
Rotatation
Excluded case
There is one case when rotation
doesnt matter. This case cannot be
solved by basic ICA.
Example of non-Gaussian density (-)

vs.Gaussian (-.)
when both densities are

Gaussian
Seek non-Gaussian sources for two reasons:

* identifiability
* interestingness: Gaussians are not
interesting since the superposition of
independent sources tends to be Gaussian
Computing the pre-processing steps for ICA

0) Centring = make the signals centred in zero
xi xi - E[xi] for each i
1) Sphering = make the signals uncorrelated. I.e. apply a transform V to
x such that Cov(Vx)=I // where Cov(y)=E[yyT] denotes covariance
matrix
V=E[xxT]-1/2 // can be done using sqrtm function in MatLab
xVx
// for all t (indexes t dropped here)
// bold lowercase refers to column vector; bold upper to matrix
Scope: to make the remaining computations simpler. It is known that
independent variables must be uncorrelated so this can be fulfilled
before proceeding to the full ICA
Aapo Hyvarinen (97)
Computing the rotation step

This is based on an the maximisation of an
objective function G(.) which contains an
approximate non-Gaussianity measure.
T
Obj ( W ) G ( W T x t ) ( W T W I )
t 1
Obj
Xg ( W T X)T W 0
W
where g(.) is derivative of G(.),
W is the rotation transform sought
is Lagrange multiplier to enforce that
W is an orthogonal transform i.e. a rotation
Solve by fixed point iterations
The effect of is an orthogonal de-correlation
Fixed Point Algorithm

Input: X
Random init of W
Iterate until convergence:
S WT X
W Xg (S)T
W W ( W T W ) 1
Output: W, S
The overall transform
then to take X back to S is
(WTV)
There are several g(.)
options, each will work best
in special cases. See
FastICA sw / tut for details.
Application domains of ICA

Blind source separation (Bell&Sejnowski, Te won Lee,
Girolami, Hyvarinen, etc.)
Image denoising (Hyvarinen)
Medical signal processing fMRI, ECG, EEG (Mackeig)
Modelling of the hippocampus and visual cortex (Lorincz,
Hyvarinen)
Feature extraction, face recognition (Marni Bartlett)
Compression, redundancy reduction
Watermarking (D Lowe)
Clustering (Girolami, Kolenda)
Time series analysis (Back, Valpola)
Topic extraction (Kolenda, Bingham, Kaban)
Scientific Data Mining (Kaban, etc)
Image denoising
Original
image
Wiener
filtering
Noisy
image
ICA
filtering
Clustering
In multi-variate data
search for the direction
along of which the
projection of the data is
maximally non-Gaussian =
has the most structure
Blind Separation of Information from Galaxy

Spectra
1.4
1.2
1
0.8
0.6
0.4
0.2
0
-0.2
50
100
150
200
250
300
350
Decomposition using Physical Models
Decomposition using ICA
Summing Up
Assumption that the data consists of unknown
components
Individual signals in a mix
topics in a text corpus
basis-galaxies
Trying to solve the inverse problem:

Observing the superposition only
Recover components
Components often give simpler, clearer view of the
data
Related resources
http://www.cis.hut.fi/projects/ica/cocktail/cocktail_en.cgi
Demo and links to further info on ICA.
http://www.cis.hut.fi/projects/ica/fastica/code/dlcode.shtml
ICA software in MatLab.
http://www.cs.helsinki.fi/u/ahyvarin/papers/NN00new.pdf
Comprehensive tutorial paper, slightly more technical.

Independent Component Analysis & Blind Source Separation: Ata Kaban The University of Birmingham

Caricato da

Informazioni sul documento

Descrizione originale:

Titolo originale

Copyright

Formati disponibili

Condividi questo documento

Condividi o incorpora il documento

Opzioni di condivisione

Hai trovato utile questo documento?

Questo contenuto è inappropriato?

Copyright:

Formati disponibili

Independent Component Analysis & Blind Source Separation: Ata Kaban The University of Birmingham

Caricato da

Copyright:

Formati disponibili

Independent Component Analysis &

Blind Source Separation

in the sense of having to find quantities that

Signals, joint density

Original signals (hidden sources)

The ICA model

This is recorded by the microphones: a

The coctail party problem

It can be shown that under some reasonable conditions, if the

Determine the source signals, given

Some further considerations

Illustration of ICA with 2 signals

Illustration of ICA with 2 signals

Illustration of ICA with 2 signals

Example of non-Gaussian density (-)

when both densities are

Seek non-Gaussian sources for two reasons:

Computing the pre-processing steps for ICA

Aapo Hyvarinen (97)

Computing the rotation step

Fixed Point Algorithm

Application domains of ICA

Blind Separation of Information from Galaxy

Decomposition using Physical Models

Decomposition using ICA

Trying to solve the inverse problem:

Potrebbero piacerti anche