(gunnar carlsson, bulletin of the ams, 2009) dino sejdinovicsejdinov/talks/pdf/tda_mljc.pdfdino...

102

Upload: others

Post on 25-Jul-2021

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Topology and data(Gunnar Carlsson, Bulletin of the AMS, 2009)

Dino Sejdinovic

Gatsby Unit MLJC

October 24, 2012

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 1 / 43

Page 2: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Introduction

too many dimensions, too little time

dimensionality reduction / visualization

the meaning of coordinates / reparametrization? (is the intrinsicmeaning of coordinates justi�ed in your data?)

what metric to use? e.g., distance between two DNA sequences? whatis the signi�cance of a measured distance?

sometimes, only to re�ect the intuitive notion of similarity: nearby datapoints are similar, far apart data points are di�erentwe do not trust large distances (genomic sequences di�ering by100/150 entries?)we trust small distances only a little bit (strength of similarity asencoded by the distance may not be signi�cant)

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 2 / 43

Page 3: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Introduction

too many dimensions, too little time

dimensionality reduction / visualization

the meaning of coordinates / reparametrization? (is the intrinsicmeaning of coordinates justi�ed in your data?)

what metric to use? e.g., distance between two DNA sequences? whatis the signi�cance of a measured distance?

sometimes, only to re�ect the intuitive notion of similarity: nearby datapoints are similar, far apart data points are di�erentwe do not trust large distances (genomic sequences di�ering by100/150 entries?)we trust small distances only a little bit (strength of similarity asencoded by the distance may not be signi�cant)

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 2 / 43

Page 4: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Introduction

too many dimensions, too little time

dimensionality reduction / visualization

the meaning of coordinates / reparametrization? (is the intrinsicmeaning of coordinates justi�ed in your data?)

what metric to use? e.g., distance between two DNA sequences? whatis the signi�cance of a measured distance?

sometimes, only to re�ect the intuitive notion of similarity: nearby datapoints are similar, far apart data points are di�erentwe do not trust large distances (genomic sequences di�ering by100/150 entries?)we trust small distances only a little bit (strength of similarity asencoded by the distance may not be signi�cant)

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 2 / 43

Page 5: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Introduction

too many dimensions, too little time

dimensionality reduction / visualization

the meaning of coordinates / reparametrization? (is the intrinsicmeaning of coordinates justi�ed in your data?)

what metric to use? e.g., distance between two DNA sequences? whatis the signi�cance of a measured distance?

sometimes, only to re�ect the intuitive notion of similarity: nearby datapoints are similar, far apart data points are di�erentwe do not trust large distances (genomic sequences di�ering by100/150 entries?)we trust small distances only a little bit (strength of similarity asencoded by the distance may not be signi�cant)

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 2 / 43

Page 6: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Introduction

too many dimensions, too little time

dimensionality reduction / visualization

the meaning of coordinates / reparametrization? (is the intrinsicmeaning of coordinates justi�ed in your data?)

what metric to use? e.g., distance between two DNA sequences? whatis the signi�cance of a measured distance?

sometimes, only to re�ect the intuitive notion of similarity: nearby datapoints are similar, far apart data points are di�erent

we do not trust large distances (genomic sequences di�ering by100/150 entries?)we trust small distances only a little bit (strength of similarity asencoded by the distance may not be signi�cant)

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 2 / 43

Page 7: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Introduction

too many dimensions, too little time

dimensionality reduction / visualization

the meaning of coordinates / reparametrization? (is the intrinsicmeaning of coordinates justi�ed in your data?)

what metric to use? e.g., distance between two DNA sequences? whatis the signi�cance of a measured distance?

sometimes, only to re�ect the intuitive notion of similarity: nearby datapoints are similar, far apart data points are di�erentwe do not trust large distances (genomic sequences di�ering by100/150 entries?)

we trust small distances only a little bit (strength of similarity asencoded by the distance may not be signi�cant)

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 2 / 43

Page 8: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Introduction

too many dimensions, too little time

dimensionality reduction / visualization

the meaning of coordinates / reparametrization? (is the intrinsicmeaning of coordinates justi�ed in your data?)

what metric to use? e.g., distance between two DNA sequences? whatis the signi�cance of a measured distance?

sometimes, only to re�ect the intuitive notion of similarity: nearby datapoints are similar, far apart data points are di�erentwe do not trust large distances (genomic sequences di�ering by100/150 entries?)we trust small distances only a little bit (strength of similarity asencoded by the distance may not be signi�cant)

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 2 / 43

Page 9: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Introduction

what is then the meaning of very re�ned notions we obtain from such�rough� distance notion, say curvature?

asking qualitative (unsupervised) questions about data?

properties robust to changes in metrics?the study of idealized versions of such properties: topology

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 3 / 43

Page 10: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Introduction

what is then the meaning of very re�ned notions we obtain from such�rough� distance notion, say curvature?

asking qualitative (unsupervised) questions about data?

properties robust to changes in metrics?the study of idealized versions of such properties: topology

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 3 / 43

Page 11: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Introduction

what is then the meaning of very re�ned notions we obtain from such�rough� distance notion, say curvature?

asking qualitative (unsupervised) questions about data?

properties robust to changes in metrics?

the study of idealized versions of such properties: topology

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 3 / 43

Page 12: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Introduction

what is then the meaning of very re�ned notions we obtain from such�rough� distance notion, say curvature?

asking qualitative (unsupervised) questions about data?

properties robust to changes in metrics?the study of idealized versions of such properties: topology

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 3 / 43

Page 13: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Introduction

geometry studies metrics. topology studies what remains after onestretches and deforms without tearing it

replace the quantitative values fo distance functions with the notion ofin�nite nearness (i.e., metric understood in a coarse way only): whatremains is �connectivity information� of your data

connected components / clusters: zeroth order topological information

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 4 / 43

Page 14: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Introduction

geometry studies metrics. topology studies what remains after onestretches and deforms without tearing it

replace the quantitative values fo distance functions with the notion ofin�nite nearness (i.e., metric understood in a coarse way only): whatremains is �connectivity information� of your data

connected components / clusters: zeroth order topological information

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 4 / 43

Page 15: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Introduction

geometry studies metrics. topology studies what remains after onestretches and deforms without tearing it

replace the quantitative values fo distance functions with the notion ofin�nite nearness (i.e., metric understood in a coarse way only): whatremains is �connectivity information� of your data

connected components / clusters: zeroth order topological information

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 4 / 43

Page 16: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Homotopy

connectivity: x , y ∈ X , say x ∼ y i� ∃ continuous map f : [0, 1]→ X ,such that f (0) = x , f (1) = y

also, equivalence classes of maps: f ∼ g i� ∃ continuous mapF : [0, 1]2 → X , such that F (t, 0) = f (t), F (t, 1) = g(t)

In general, f , g : Y → X , and F : Y × [0, 1]→ X . f and g are said tobe homotopic

X and Y are homotopy equivalent if there are f : X → Y andg : Y → X , s.t. f ◦ g is homotopic to idY and g ◦ f is homotopic toidX

Every homeomorphism is a homotopy equivalence, but the converse isnot true

click me

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 5 / 43

Page 17: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Homotopy

connectivity: x , y ∈ X , say x ∼ y i� ∃ continuous map f : [0, 1]→ X ,such that f (0) = x , f (1) = y

also, equivalence classes of maps: f ∼ g i� ∃ continuous mapF : [0, 1]2 → X , such that F (t, 0) = f (t), F (t, 1) = g(t)

In general, f , g : Y → X , and F : Y × [0, 1]→ X . f and g are said tobe homotopic

X and Y are homotopy equivalent if there are f : X → Y andg : Y → X , s.t. f ◦ g is homotopic to idY and g ◦ f is homotopic toidX

Every homeomorphism is a homotopy equivalence, but the converse isnot true

click me

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 5 / 43

Page 18: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Homotopy

connectivity: x , y ∈ X , say x ∼ y i� ∃ continuous map f : [0, 1]→ X ,such that f (0) = x , f (1) = y

also, equivalence classes of maps: f ∼ g i� ∃ continuous mapF : [0, 1]2 → X , such that F (t, 0) = f (t), F (t, 1) = g(t)

In general, f , g : Y → X , and F : Y × [0, 1]→ X . f and g are said tobe homotopic

X and Y are homotopy equivalent if there are f : X → Y andg : Y → X , s.t. f ◦ g is homotopic to idY and g ◦ f is homotopic toidX

Every homeomorphism is a homotopy equivalence, but the converse isnot true

click me

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 5 / 43

Page 19: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Homotopy

connectivity: x , y ∈ X , say x ∼ y i� ∃ continuous map f : [0, 1]→ X ,such that f (0) = x , f (1) = y

also, equivalence classes of maps: f ∼ g i� ∃ continuous mapF : [0, 1]2 → X , such that F (t, 0) = f (t), F (t, 1) = g(t)

In general, f , g : Y → X , and F : Y × [0, 1]→ X . f and g are said tobe homotopic

X and Y are homotopy equivalent if there are f : X → Y andg : Y → X , s.t. f ◦ g is homotopic to idY and g ◦ f is homotopic toidX

Every homeomorphism is a homotopy equivalence, but the converse isnot true

click me

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 5 / 43

Page 20: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Homotopy

connectivity: x , y ∈ X , say x ∼ y i� ∃ continuous map f : [0, 1]→ X ,such that f (0) = x , f (1) = y

also, equivalence classes of maps: f ∼ g i� ∃ continuous mapF : [0, 1]2 → X , such that F (t, 0) = f (t), F (t, 1) = g(t)

In general, f , g : Y → X , and F : Y × [0, 1]→ X . f and g are said tobe homotopic

X and Y are homotopy equivalent if there are f : X → Y andg : Y → X , s.t. f ◦ g is homotopic to idY and g ◦ f is homotopic toidX

Every homeomorphism is a homotopy equivalence, but the converse isnot true

click me

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 5 / 43

Page 21: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Homotopy

connectivity: x , y ∈ X , say x ∼ y i� ∃ continuous map f : [0, 1]→ X ,such that f (0) = x , f (1) = y

also, equivalence classes of maps: f ∼ g i� ∃ continuous mapF : [0, 1]2 → X , such that F (t, 0) = f (t), F (t, 1) = g(t)

In general, f , g : Y → X , and F : Y × [0, 1]→ X . f and g are said tobe homotopic

X and Y are homotopy equivalent if there are f : X → Y andg : Y → X , s.t. f ◦ g is homotopic to idY and g ◦ f is homotopic toidX

Every homeomorphism is a homotopy equivalence, but the converse isnot true

click me

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 5 / 43

Page 22: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Homotopy groups

n-th order topological information: homotopy classes of equivalence ofcontinuous maps f from the n-dimensional sphere Sn to X s.t.f (s) = x

b

x

Classes of equivalence form a group structure πn(X ); for n = 1,fundamental group, e.g., π1(Rn) = {0}, π1(Rn\{0}) = π1(S1) = Z.

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 6 / 43

Page 23: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Homotopy groups

n-th order topological information: homotopy classes of equivalence ofcontinuous maps f from the n-dimensional sphere Sn to X s.t.f (s) = x

b

x

Classes of equivalence form a group structure πn(X ); for n = 1,fundamental group, e.g., π1(Rn) = {0}, π1(Rn\{0}) = π1(S1) = Z.

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 6 / 43

Page 24: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Homotopy groups

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 7 / 43

Page 25: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Homology

higher-dimensional homotopy groups extremely di�cult to compute,even πn(S i ) for n > i is a di�cult problem / Hopf �bration

a friendlier alternative: homology groups, with an extendedequivalence relation

e.g., two loops are equivalent if there is a surface with boundary equalto the di�erence of two loops

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 8 / 43

Page 26: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Homology

higher-dimensional homotopy groups extremely di�cult to compute,even πn(S i ) for n > i is a di�cult problem / Hopf �bration

a friendlier alternative: homology groups, with an extendedequivalence relation

e.g., two loops are equivalent if there is a surface with boundary equalto the di�erence of two loops

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 8 / 43

Page 27: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Homology

higher-dimensional homotopy groups extremely di�cult to compute,even πn(S i ) for n > i is a di�cult problem / Hopf �bration

a friendlier alternative: homology groups, with an extendedequivalence relation

e.g., two loops are equivalent if there is a surface with boundary equalto the di�erence of two loops

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 8 / 43

Page 28: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Simplices and chains

A 0-simplex is a point [i ]

A 1-simplex is an edge [ij ]

An n-simplex is a hyperedge σ = [i0 . . . in]. A face of σ is an(n − 1)-dimensional simplex [i0 . . . ij−1ij+1 . . . in] =

[i\j]

An n-chain c is a formal sum of n-simplices, e.g.,[12] + [23] + [34] ∈ C1 (may occur with a multiplicity or with anopposite orientation - winding numbers):

c =∑k

αkσk , αk ∈ A, σk ∈ Sn

A is an abelian group (such as Z,Fp); Sn is a �nite set of n-simplices

The set of all n-chains is denoted Cn; (Cn,+) forms a free abeliangroup: c + c ′ =

∑(αk + α′k

)σk (abelian group with a �basis�)

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 9 / 43

Page 29: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Simplices and chains

A 0-simplex is a point [i ]

A 1-simplex is an edge [ij ]

An n-simplex is a hyperedge σ = [i0 . . . in]. A face of σ is an(n − 1)-dimensional simplex [i0 . . . ij−1ij+1 . . . in] =

[i\j]

An n-chain c is a formal sum of n-simplices, e.g.,[12] + [23] + [34] ∈ C1 (may occur with a multiplicity or with anopposite orientation - winding numbers):

c =∑k

αkσk , αk ∈ A, σk ∈ Sn

A is an abelian group (such as Z,Fp); Sn is a �nite set of n-simplices

The set of all n-chains is denoted Cn; (Cn,+) forms a free abeliangroup: c + c ′ =

∑(αk + α′k

)σk (abelian group with a �basis�)

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 9 / 43

Page 30: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Simplices and chains

A 0-simplex is a point [i ]

A 1-simplex is an edge [ij ]

An n-simplex is a hyperedge σ = [i0 . . . in]. A face of σ is an(n − 1)-dimensional simplex [i0 . . . ij−1ij+1 . . . in] =

[i\j]

An n-chain c is a formal sum of n-simplices, e.g.,[12] + [23] + [34] ∈ C1 (may occur with a multiplicity or with anopposite orientation - winding numbers):

c =∑k

αkσk , αk ∈ A, σk ∈ Sn

A is an abelian group (such as Z,Fp); Sn is a �nite set of n-simplices

The set of all n-chains is denoted Cn; (Cn,+) forms a free abeliangroup: c + c ′ =

∑(αk + α′k

)σk (abelian group with a �basis�)

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 9 / 43

Page 31: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Simplices and chains

A 0-simplex is a point [i ]

A 1-simplex is an edge [ij ]

An n-simplex is a hyperedge σ = [i0 . . . in]. A face of σ is an(n − 1)-dimensional simplex [i0 . . . ij−1ij+1 . . . in] =

[i\j]

An n-chain c is a formal sum of n-simplices, e.g.,[12] + [23] + [34] ∈ C1 (may occur with a multiplicity or with anopposite orientation - winding numbers):

c =∑k

αkσk , αk ∈ A, σk ∈ Sn

A is an abelian group (such as Z,Fp); Sn is a �nite set of n-simplices

The set of all n-chains is denoted Cn; (Cn,+) forms a free abeliangroup: c + c ′ =

∑(αk + α′k

)σk (abelian group with a �basis�)

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 9 / 43

Page 32: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Simplices and chains

A 0-simplex is a point [i ]

A 1-simplex is an edge [ij ]

An n-simplex is a hyperedge σ = [i0 . . . in]. A face of σ is an(n − 1)-dimensional simplex [i0 . . . ij−1ij+1 . . . in] =

[i\j]

An n-chain c is a formal sum of n-simplices, e.g.,[12] + [23] + [34] ∈ C1 (may occur with a multiplicity or with anopposite orientation - winding numbers):

c =∑k

αkσk , αk ∈ A, σk ∈ Sn

A is an abelian group (such as Z,Fp); Sn is a �nite set of n-simplices

The set of all n-chains is denoted Cn; (Cn,+) forms a free abeliangroup: c + c ′ =

∑(αk + α′k

)σk (abelian group with a �basis�)

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 9 / 43

Page 33: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Simplices and chains

A 0-simplex is a point [i ]

A 1-simplex is an edge [ij ]

An n-simplex is a hyperedge σ = [i0 . . . in]. A face of σ is an(n − 1)-dimensional simplex [i0 . . . ij−1ij+1 . . . in] =

[i\j]

An n-chain c is a formal sum of n-simplices, e.g.,[12] + [23] + [34] ∈ C1 (may occur with a multiplicity or with anopposite orientation - winding numbers):

c =∑k

αkσk , αk ∈ A, σk ∈ Sn

A is an abelian group (such as Z,Fp); Sn is a �nite set of n-simplices

The set of all n-chains is denoted Cn; (Cn,+) forms a free abeliangroup: c + c ′ =

∑(αk + α′k

)σk (abelian group with a �basis�)

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 9 / 43

Page 34: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Boundary map

Simplicial complex is a collection C of simplices with a specialstructure:

σ ∈ C ⇒ any face of σ ∈ C

Boundary of an n-simplex is de�ned to be the sum of its faces:

∂n [i0 . . . in] =n−1∑j=0

(−1A)j[i\j]

Boundary of a general chain:

∂n∑k

αkσk =∑k

αk∂nσk

Boundary map ∂n : Cn → Cn−1 is a group homomorphism

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 10 / 43

Page 35: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Boundary map

Simplicial complex is a collection C of simplices with a specialstructure:

σ ∈ C ⇒ any face of σ ∈ C

Boundary of an n-simplex is de�ned to be the sum of its faces:

∂n [i0 . . . in] =n−1∑j=0

(−1A)j[i\j]

Boundary of a general chain:

∂n∑k

αkσk =∑k

αk∂nσk

Boundary map ∂n : Cn → Cn−1 is a group homomorphism

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 10 / 43

Page 36: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Boundary map

Simplicial complex is a collection C of simplices with a specialstructure:

σ ∈ C ⇒ any face of σ ∈ C

Boundary of an n-simplex is de�ned to be the sum of its faces:

∂n [i0 . . . in] =n−1∑j=0

(−1A)j[i\j]

Boundary of a general chain:

∂n∑k

αkσk =∑k

αk∂nσk

Boundary map ∂n : Cn → Cn−1 is a group homomorphism

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 10 / 43

Page 37: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Boundary map

Simplicial complex is a collection C of simplices with a specialstructure:

σ ∈ C ⇒ any face of σ ∈ C

Boundary of an n-simplex is de�ned to be the sum of its faces:

∂n [i0 . . . in] =n−1∑j=0

(−1A)j[i\j]

Boundary of a general chain:

∂n∑k

αkσk =∑k

αk∂nσk

Boundary map ∂n : Cn → Cn−1 is a group homomorphism

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 10 / 43

Page 38: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Boundary map

Example:

∂1 ([12] + [23] + [34]) = [2]− [1] + [3]− [2] + [4]− [3]

= [4]− [1].

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 11 / 43

Page 39: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Fundamental Lemma of Homology

The boundary of the boundary of a simplex is empty:

∂n∂n+1 [i0 . . . in+1] = ∂n

n+1∑j=0

(−1)j[i\j]

=∑j<l

[(−1)j+l−1 − (−1)l+j

] [i\l ,j]

= 0

Therefore, the boundary of the boundary of a chain is also empty, i.e.,∂n∂n+1Cn+1 ≡ 0 ⇒im∂n+1 ⊂ ker ∂n

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 12 / 43

Page 40: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Fundamental Lemma of Homology

The boundary of the boundary of a simplex is empty:

∂n∂n+1 [i0 . . . in+1] = ∂n

n+1∑j=0

(−1)j[i\j]

=∑j<l

[(−1)j+l−1 − (−1)l+j

] [i\l ,j]

= 0

Therefore, the boundary of the boundary of a chain is also empty, i.e.,∂n∂n+1Cn+1 ≡ 0 ⇒im∂n+1 ⊂ ker ∂n

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 12 / 43

Page 41: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Cycles and boundaries

An n-cycle is a chain with no boundary, e.g., [12] + [23] + [34] + [41].The set of cycles: Zn = ker ∂n is a subgroup of Cn

im∂n+1 ⊂ ker ∂n means that all boundaries of higher order chains arecycles

Some cycles (not all) are boundaries of higher order chains, e.g.,[23] + [31] + [12] = ∂2 [123]. The set of n-boundaries: Bn = im∂n+1

is a subgroup of Zn

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 13 / 43

Page 42: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Cycles and boundaries

An n-cycle is a chain with no boundary, e.g., [12] + [23] + [34] + [41].The set of cycles: Zn = ker ∂n is a subgroup of Cn

im∂n+1 ⊂ ker ∂n means that all boundaries of higher order chains arecycles

Some cycles (not all) are boundaries of higher order chains, e.g.,[23] + [31] + [12] = ∂2 [123]. The set of n-boundaries: Bn = im∂n+1

is a subgroup of Zn

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 13 / 43

Page 43: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Cycles and boundaries

An n-cycle is a chain with no boundary, e.g., [12] + [23] + [34] + [41].The set of cycles: Zn = ker ∂n is a subgroup of Cn

im∂n+1 ⊂ ker ∂n means that all boundaries of higher order chains arecycles

Some cycles (not all) are boundaries of higher order chains, e.g.,[23] + [31] + [12] = ∂2 [123]. The set of n-boundaries: Bn = im∂n+1

is a subgroup of Zn

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 13 / 43

Page 44: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Cycles and boundaries

n-th Homology group: Hn = Zn/Bn = ker ∂n/im∂n+1, i.e., it is afactor group of equivalence classes, given by:

z ∼= z ′ i� z ′ − z ∈ Bn

two cycles are equivalent (homologous) if they di�er by a boundary,say z1 = [12] + [23] + [34] + [41], z2 = [12] + [23] + [34] + [45] + [51],then:

z2 − z1 = [45] + [51] + [14]

= ∂2[145]

rank of Hn (roughly) counts the number of n-dimensional holes in thespace

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 14 / 43

Page 45: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Cycles and boundaries

n-th Homology group: Hn = Zn/Bn = ker ∂n/im∂n+1, i.e., it is afactor group of equivalence classes, given by:

z ∼= z ′ i� z ′ − z ∈ Bn

two cycles are equivalent (homologous) if they di�er by a boundary,say z1 = [12] + [23] + [34] + [41], z2 = [12] + [23] + [34] + [45] + [51],then:

z2 − z1 = [45] + [51] + [14]

= ∂2[145]

rank of Hn (roughly) counts the number of n-dimensional holes in thespace

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 14 / 43

Page 46: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Cycles and boundaries

n-th Homology group: Hn = Zn/Bn = ker ∂n/im∂n+1, i.e., it is afactor group of equivalence classes, given by:

z ∼= z ′ i� z ′ − z ∈ Bn

two cycles are equivalent (homologous) if they di�er by a boundary,say z1 = [12] + [23] + [34] + [41], z2 = [12] + [23] + [34] + [45] + [51],then:

z2 − z1 = [45] + [51] + [14]

= ∂2[145]

rank of Hn (roughly) counts the number of n-dimensional holes in thespace

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 14 / 43

Page 47: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Chains, cycles and boundaries

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 15 / 43

Page 48: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Homology groups

In general Hn = Hn(X ,A) is a free abelian group which depends onthe underlying topological space and the choice of the underlyingabelian group A of winding numbers in the de�nition of the chain

Functoriality: tranforming topological problems into algebraicproblems. If f : X → Y is continuous then there is an inducedhomomorphism Hn(f ,A) : Hn(X ,A)→ Hn(Y,A), with

Hn(idX ,A) = idHn(X ,A)Hn(f ,A) ◦ Hn(g ,A) = Hn(f ◦ g ,A)

If f and g are homotopic then Hn(f ,A) = Hn(g ,A), i.e., iftopological spaces X and Y are homotopy equivalent then

Hn(g ,A) ◦ Hn(f ,A) = Hn(g ◦ f ,A) = Hn(idX ,A) = idHn(X ,A), i.e.,their homology groups Hn(X ,A) and Hn(Y,A) are isomorphic

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 16 / 43

Page 49: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Homology groups

In general Hn = Hn(X ,A) is a free abelian group which depends onthe underlying topological space and the choice of the underlyingabelian group A of winding numbers in the de�nition of the chain

Functoriality: tranforming topological problems into algebraicproblems. If f : X → Y is continuous then there is an inducedhomomorphism Hn(f ,A) : Hn(X ,A)→ Hn(Y,A), with

Hn(idX ,A) = idHn(X ,A)Hn(f ,A) ◦ Hn(g ,A) = Hn(f ◦ g ,A)

If f and g are homotopic then Hn(f ,A) = Hn(g ,A), i.e., iftopological spaces X and Y are homotopy equivalent then

Hn(g ,A) ◦ Hn(f ,A) = Hn(g ◦ f ,A) = Hn(idX ,A) = idHn(X ,A), i.e.,their homology groups Hn(X ,A) and Hn(Y,A) are isomorphic

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 16 / 43

Page 50: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Homology groups

In general Hn = Hn(X ,A) is a free abelian group which depends onthe underlying topological space and the choice of the underlyingabelian group A of winding numbers in the de�nition of the chain

Functoriality: tranforming topological problems into algebraicproblems. If f : X → Y is continuous then there is an inducedhomomorphism Hn(f ,A) : Hn(X ,A)→ Hn(Y,A), with

Hn(idX ,A) = idHn(X ,A)Hn(f ,A) ◦ Hn(g ,A) = Hn(f ◦ g ,A)

If f and g are homotopic then Hn(f ,A) = Hn(g ,A), i.e., iftopological spaces X and Y are homotopy equivalent then

Hn(g ,A) ◦ Hn(f ,A) = Hn(g ◦ f ,A) = Hn(idX ,A) = idHn(X ,A), i.e.,their homology groups Hn(X ,A) and Hn(Y,A) are isomorphic

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 16 / 43

Page 51: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Homology vector spaces

If underlying group of winding numbers is a �eld A = F , thenHn(X ,F ) is a vector space over F

βn(X ,F ) = dimHn(X ,F ) is called the n-th Betti number of X w.r.t.F

If two spaces are homotopy equivalent, then all their Betti numbersare equal

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 17 / 43

Page 52: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Homology vector spaces

If underlying group of winding numbers is a �eld A = F , thenHn(X ,F ) is a vector space over F

βn(X ,F ) = dimHn(X ,F ) is called the n-th Betti number of X w.r.t.F

If two spaces are homotopy equivalent, then all their Betti numbersare equal

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 17 / 43

Page 53: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Homology vector spaces

If underlying group of winding numbers is a �eld A = F , thenHn(X ,F ) is a vector space over F

βn(X ,F ) = dimHn(X ,F ) is called the n-th Betti number of X w.r.t.F

If two spaces are homotopy equivalent, then all their Betti numbersare equal

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 17 / 43

Page 54: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Homology vector spaces

Given the sets of n-simplices Sn, we form

the chain �nite-dimensional vector spaces Cn

boundary homomorphisms (linear maps) ∂n : Cn → Cn−1, which can beexpressed as a sequence of matrices Dn, with

(Dn)τσ =

{(−1)j τ is a face ofσ

0 otherwise

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 18 / 43

Page 55: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Homology vector spaces

Given the sets of n-simplices Sn, we form

the chain �nite-dimensional vector spaces Cn

boundary homomorphisms (linear maps) ∂n : Cn → Cn−1, which can beexpressed as a sequence of matrices Dn, with

(Dn)τσ =

{(−1)j τ is a face ofσ

0 otherwise

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 18 / 43

Page 56: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Homology vector spaces

Given the sets of n-simplices Sn, we form

the chain �nite-dimensional vector spaces Cn

boundary homomorphisms (linear maps) ∂n : Cn → Cn−1, which can beexpressed as a sequence of matrices Dn, with

(Dn)τσ =

{(−1)j τ is a face ofσ

0 otherwise

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 18 / 43

Page 57: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Homology vector spaces

βn(X ,F ) = dimHn(X ,F )

= dim ker ∂n − dim im∂n+1

= dimCn(X ,F )− dim im∂n − dim im∂n+1

= dimCn(X ,F )− rankDn − rankDn+1

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 19 / 43

Page 58: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

rank-nullity in graph theory

D1 = incidence matrix,S0-vertices, S1-edges

#connectedcomponents=#nodes-rank(D1)

#loops=#edges-rank(D1)

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 20 / 43

Page 59: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

rank-nullity in graph theory

D1 = incidence matrix,S0-vertices, S1-edges

#connectedcomponents=#nodes-rank(D1)

#loops=#edges-rank(D1)

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 20 / 43

Page 60: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

rank-nullity in graph theory

D1 = incidence matrix,S0-vertices, S1-edges

#connectedcomponents=#nodes-rank(D1)

#loops=#edges-rank(D1)

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 20 / 43

Page 61: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Betti numbers

β0 = 1, β1 = 1, βk = 0, for k ≥ 2

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 21 / 43

Page 62: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Betti numbers

β0 = 1, β1 = 1, βk = 0, for k ≥ 2

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 21 / 43

Page 63: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Betti numbers

β0 = 1, β1 = 2, β2 = 1, βk = 0 for k ≥ 3

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 22 / 43

Page 64: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Betti numbers

β0 = 1, β1 = 2, β2 = 1, βk = 0 for k ≥ 3

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 22 / 43

Page 65: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

The story so far

If someone gave us the topological space X which consists of sets ofpoints, edges, triangles, ..., n-simplices, we can compute its Bettinumbers over, say, F2 using linear algebra (simplicial homology)

Only got 0-simplices - we have to build the higher order structure intodata, i.e. form the simplicial complex

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 23 / 43

Page 66: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

The story so far

If someone gave us the topological space X which consists of sets ofpoints, edges, triangles, ..., n-simplices, we can compute its Bettinumbers over, say, F2 using linear algebra (simplicial homology)

Only got 0-simplices - we have to build the higher order structure intodata, i.e. form the simplicial complex

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 23 / 43

Page 67: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

�ech complex

�ech complex (nerve) C(ε) of data {Yi}Ni=1 contains:

0-simplices [i ]1-simplices [ij ] whenever ‖Yi − Yj‖ ≤ εn-simplices [i0 . . . in] whenever

⋂n

j=0Uij 6= Ø,

Uij ={y ∈ Y :

∥∥y − Yij

∥∥ ≤ ε}

The nerve theorem: In a general topological space X , the nerveN(U) is associated to an open covering U = {Ui}i∈I . N(U) ishomotopy equivalent to X whenever every Ui is contractible(homotopy equivalent to a point).

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 24 / 43

Page 68: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

�ech complex

�ech complex (nerve) C(ε) of data {Yi}Ni=1 contains:

0-simplices [i ]1-simplices [ij ] whenever ‖Yi − Yj‖ ≤ εn-simplices [i0 . . . in] whenever

⋂n

j=0Uij 6= Ø,

Uij ={y ∈ Y :

∥∥y − Yij

∥∥ ≤ ε}The nerve theorem: In a general topological space X , the nerveN(U) is associated to an open covering U = {Ui}i∈I . N(U) ishomotopy equivalent to X whenever every Ui is contractible(homotopy equivalent to a point).

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 24 / 43

Page 69: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

�ech complex

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 25 / 43

Page 70: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Vietoris-Rips complex

VR complex VR(ε) contains:

0-simplices [i ]1-simplices [ij ] whenever ‖Yi − Yj‖ ≤ εn-simplices [i0 . . . in] whenever all its faces are in VR(ε).

C(ε) ⊆ VR(2ε) ⊆ C(2ε)

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 26 / 43

Page 71: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Vietoris-Rips complex

VR complex VR(ε) contains:

0-simplices [i ]1-simplices [ij ] whenever ‖Yi − Yj‖ ≤ εn-simplices [i0 . . . in] whenever all its faces are in VR(ε).

C(ε) ⊆ VR(2ε) ⊆ C(2ε)

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 26 / 43

Page 72: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

C(ε) vs VR(ε)

Vietoris-Rips is the maximal simplicial complex that can be built ontop of the 1-simplicial skeleton (�ag complex)

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 27 / 43

Page 73: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Witness complexes

Choose a set of landmark points L ⊂ {Yi}Ni=1 - this is the set of0-simplices

Strong witness complex:

[l0 . . . ln] ∈W s(ε) i� ∃Y (a strong witness): d(Y , lj) ≤ d(Y ,L) + ε,∀j = 0, . . . , n

Weak witness complex:

[l0 . . . ln] ∈W w (ε) i� ∃Y (a weak witness):d(Y , lj) ≤ d(Y ,L\{l0 . . . ln}) + ε, ∀j = 0, . . . , n

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 28 / 43

Page 74: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

How to choose ε?

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 29 / 43

Page 75: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

How to choose ε?

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 30 / 43

Page 76: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Persistence

C(ε) ⊂ C(ε′) whenever ε ≤ ε′

Using inclusion ι : C(ε)→ C(ε′), we get a homomorphismHn(ι,F ) : Hn(C(ε),F )→ Hn(C(ε′),F ) (and can study the image ofthe homology of a smaller complex in the homology of a largercomplex)

two small cycles in the smaller complex vanish in the larger complex,the small cycle in the larger complex is not in the image of Hn(ι,F ),only the largest cycle persists

incremental computation of Betti numbers

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 31 / 43

Page 77: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Persistence

C(ε) ⊂ C(ε′) whenever ε ≤ ε′

Using inclusion ι : C(ε)→ C(ε′), we get a homomorphismHn(ι,F ) : Hn(C(ε),F )→ Hn(C(ε′),F ) (and can study the image ofthe homology of a smaller complex in the homology of a largercomplex)

two small cycles in the smaller complex vanish in the larger complex,the small cycle in the larger complex is not in the image of Hn(ι,F ),only the largest cycle persists

incremental computation of Betti numbers

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 31 / 43

Page 78: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Persistence

C(ε) ⊂ C(ε′) whenever ε ≤ ε′

Using inclusion ι : C(ε)→ C(ε′), we get a homomorphismHn(ι,F ) : Hn(C(ε),F )→ Hn(C(ε′),F ) (and can study the image ofthe homology of a smaller complex in the homology of a largercomplex)

two small cycles in the smaller complex vanish in the larger complex,the small cycle in the larger complex is not in the image of Hn(ι,F ),only the largest cycle persists

incremental computation of Betti numbers

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 31 / 43

Page 79: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Persistence

C(ε) ⊂ C(ε′) whenever ε ≤ ε′

Using inclusion ι : C(ε)→ C(ε′), we get a homomorphismHn(ι,F ) : Hn(C(ε),F )→ Hn(C(ε′),F ) (and can study the image ofthe homology of a smaller complex in the homology of a largercomplex)

two small cycles in the smaller complex vanish in the larger complex,the small cycle in the larger complex is not in the image of Hn(ι,F ),only the largest cycle persists

incremental computation of Betti numbers

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 31 / 43

Page 80: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Persistent homology barcode

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 32 / 43

Page 81: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Natural image statistics

3x3 patches from a database of black and white images - eachdatapoint is a vector in R9

remove the low contrast (nearly constant) patches

mean-center - �turning the brightness knob�

normalize the contrast - �turning the contrast knob�

the points sit on a 7D ellipsoid in R8, but not uniformly

exploring the high-density regions, using the k-codensity proxyδk(x) = ‖x − νk(x)‖

Carlsson et al, On the local behaviour of spaces of natural images, International Journal

of Computer Vision 2008

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 33 / 43

Page 82: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Natural image statistics

3x3 patches from a database of black and white images - eachdatapoint is a vector in R9

remove the low contrast (nearly constant) patches

mean-center - �turning the brightness knob�

normalize the contrast - �turning the contrast knob�

the points sit on a 7D ellipsoid in R8, but not uniformly

exploring the high-density regions, using the k-codensity proxyδk(x) = ‖x − νk(x)‖

Carlsson et al, On the local behaviour of spaces of natural images, International Journal

of Computer Vision 2008

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 33 / 43

Page 83: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Natural image statistics

3x3 patches from a database of black and white images - eachdatapoint is a vector in R9

remove the low contrast (nearly constant) patches

mean-center - �turning the brightness knob�

normalize the contrast - �turning the contrast knob�

the points sit on a 7D ellipsoid in R8, but not uniformly

exploring the high-density regions, using the k-codensity proxyδk(x) = ‖x − νk(x)‖

Carlsson et al, On the local behaviour of spaces of natural images, International Journal

of Computer Vision 2008

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 33 / 43

Page 84: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Natural image statistics

3x3 patches from a database of black and white images - eachdatapoint is a vector in R9

remove the low contrast (nearly constant) patches

mean-center - �turning the brightness knob�

normalize the contrast - �turning the contrast knob�

the points sit on a 7D ellipsoid in R8, but not uniformly

exploring the high-density regions, using the k-codensity proxyδk(x) = ‖x − νk(x)‖

Carlsson et al, On the local behaviour of spaces of natural images, International Journal

of Computer Vision 2008

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 33 / 43

Page 85: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Natural image statistics

3x3 patches from a database of black and white images - eachdatapoint is a vector in R9

remove the low contrast (nearly constant) patches

mean-center - �turning the brightness knob�

normalize the contrast - �turning the contrast knob�

the points sit on a 7D ellipsoid in R8, but not uniformly

exploring the high-density regions, using the k-codensity proxyδk(x) = ‖x − νk(x)‖

Carlsson et al, On the local behaviour of spaces of natural images, International Journal

of Computer Vision 2008

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 33 / 43

Page 86: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Natural image statistics

3x3 patches from a database of black and white images - eachdatapoint is a vector in R9

remove the low contrast (nearly constant) patches

mean-center - �turning the brightness knob�

normalize the contrast - �turning the contrast knob�

the points sit on a 7D ellipsoid in R8, but not uniformly

exploring the high-density regions, using the k-codensity proxyδk(x) = ‖x − νk(x)‖

Carlsson et al, On the local behaviour of spaces of natural images, International Journal

of Computer Vision 2008

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 33 / 43

Page 87: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Natural image statistics

k = 300, top 25% �densest points� - the underlying space appears toform a circle

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 34 / 43

Page 88: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Three-circle model

k = 15, top 25% �densest points� leads to β1 = 5green and red circles do not touch, each touches the blue circle

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 35 / 43

Page 89: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Three-circle model

primary circle: smooth transitions - angle varies along the circlesecondary circles: vertical and horizontal transitions - contrastarrangement varies along the circle

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 36 / 43

Page 90: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Three-circle model

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 37 / 43

Page 91: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Klein bottle!

A mathematician named Klein

Thought the Möbius band was

divine.

Said he: "If you glue

The edges of two,

You'll get a weird bottle like

mine."

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 38 / 43

Page 92: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

V1 data

recordings from 10x10 electrode arrays from the V1 in Macaquemonkeys (20-30 minutes):

spontaneuous / no stimulus presentedevoked / video sequences presented

each data segment consists of 200 50ms bins - for each neuron a �ringcount within such bin is recorded

�ve neurons with highest �ring rate: data point cloud X is 200 pointsin R5

For each data segment, construct a witness complex, and obtain itsBetti signature (β0, β1, β2)

Singh et al, Topological Structure of Population Activity in Primary Visual Cortex,

Journal of Vision 2008

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 39 / 43

Page 93: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

V1 data

recordings from 10x10 electrode arrays from the V1 in Macaquemonkeys (20-30 minutes):

spontaneuous / no stimulus presentedevoked / video sequences presented

each data segment consists of 200 50ms bins - for each neuron a �ringcount within such bin is recorded

�ve neurons with highest �ring rate: data point cloud X is 200 pointsin R5

For each data segment, construct a witness complex, and obtain itsBetti signature (β0, β1, β2)

Singh et al, Topological Structure of Population Activity in Primary Visual Cortex,

Journal of Vision 2008

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 39 / 43

Page 94: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

V1 data

recordings from 10x10 electrode arrays from the V1 in Macaquemonkeys (20-30 minutes):

spontaneuous / no stimulus presentedevoked / video sequences presented

each data segment consists of 200 50ms bins - for each neuron a �ringcount within such bin is recorded

�ve neurons with highest �ring rate: data point cloud X is 200 pointsin R5

For each data segment, construct a witness complex, and obtain itsBetti signature (β0, β1, β2)

Singh et al, Topological Structure of Population Activity in Primary Visual Cortex,

Journal of Vision 2008

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 39 / 43

Page 95: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

V1 data

recordings from 10x10 electrode arrays from the V1 in Macaquemonkeys (20-30 minutes):

spontaneuous / no stimulus presentedevoked / video sequences presented

each data segment consists of 200 50ms bins - for each neuron a �ringcount within such bin is recorded

�ve neurons with highest �ring rate: data point cloud X is 200 pointsin R5

For each data segment, construct a witness complex, and obtain itsBetti signature (β0, β1, β2)

Singh et al, Topological Structure of Population Activity in Primary Visual Cortex,

Journal of Vision 2008

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 39 / 43

Page 96: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

V1 data - the observed signatures

the most frequently occurring signatures are 110 (circle) and 101(sphere)

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 40 / 43

Page 97: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

V1 data - the observed signatures

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 41 / 43

Page 98: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

V1 data - the observed signatures

the data sets under two regimes are topologically di�erent

signi�cance validation of observed Betti numbers:

simulate �rings from a Poisson modelfrequency of obtaining persistent segments of β1, β2 is < .005

topology distinguishes both data sets from the Poisson model andfrom each other; the nature of the �circular� topological phenomenon?

not likely due to periodicity of body's natural rhytms - no peaks in theamplitude spectrum observed

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 42 / 43

Page 99: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

V1 data - the observed signatures

the data sets under two regimes are topologically di�erent

signi�cance validation of observed Betti numbers:

simulate �rings from a Poisson modelfrequency of obtaining persistent segments of β1, β2 is < .005

topology distinguishes both data sets from the Poisson model andfrom each other; the nature of the �circular� topological phenomenon?

not likely due to periodicity of body's natural rhytms - no peaks in theamplitude spectrum observed

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 42 / 43

Page 100: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

V1 data - the observed signatures

the data sets under two regimes are topologically di�erent

signi�cance validation of observed Betti numbers:

simulate �rings from a Poisson modelfrequency of obtaining persistent segments of β1, β2 is < .005

topology distinguishes both data sets from the Poisson model andfrom each other; the nature of the �circular� topological phenomenon?

not likely due to periodicity of body's natural rhytms - no peaks in theamplitude spectrum observed

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 42 / 43

Page 101: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

V1 data - the observed signatures

the data sets under two regimes are topologically di�erent

signi�cance validation of observed Betti numbers:

simulate �rings from a Poisson modelfrequency of obtaining persistent segments of β1, β2 is < .005

topology distinguishes both data sets from the Poisson model andfrom each other; the nature of the �circular� topological phenomenon?

not likely due to periodicity of body's natural rhytms - no peaks in theamplitude spectrum observed

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 42 / 43

Page 102: (Gunnar Carlsson, Bulletin of the AMS, 2009) Dino Sejdinovicsejdinov/talks/pdf/TDA_MLJC.pdfDino Sejdinovic Gatsby Unit MLJC October 24, 2012 Dino Sejdinovic (Gatsby Unit MLJC) opTology

Code

Toolbox: JPlex (http://comptop.stanford.edu/)

Java version of Plex, work with MatlabRips, Witness complex, Persistence Homology, barcodes

Other Choices: Plex 2.5/Matlab (not maintained any more), Dionysus(Dimitry Morozov)

Dino Sejdinovic (Gatsby Unit MLJC) Topology and Data October 24, 2012 43 / 43