generalizing convolutional neural networks to graph ...dzeng/bios740/walker_bios740.pdf ·...

33
Generalizing Convolutional Neural Networks to Graph- structured Data Ben Walker Department of Mathematics, UNC-Chapel Hill 5/4/2018

Upload: others

Post on 28-May-2020

18 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Generalizing Convolutional Neural Networks to Graph ...dzeng/BIOS740/Walker_Bios740.pdf · "Convolutional neural networks on graphs with fast localized spectral filtering." Advances

Generalizing Convolutional Neural Networks to Graph-

structured DataBen Walker

Department of Mathematics, UNC-Chapel Hill5/4/2018

Page 2: Generalizing Convolutional Neural Networks to Graph ...dzeng/BIOS740/Walker_Bios740.pdf · "Convolutional neural networks on graphs with fast localized spectral filtering." Advances

Overview• Relational structure in data and how to approach it

• Defferrard, Bresson, Vandergheynst 2016: Fast spectral filter method

• Kipf, Welling 2017: A first-order simplification for improved performance

• Discussion

Page 3: Generalizing Convolutional Neural Networks to Graph ...dzeng/BIOS740/Walker_Bios740.pdf · "Convolutional neural networks on graphs with fast localized spectral filtering." Advances

Unstructured Data

Page 4: Generalizing Convolutional Neural Networks to Graph ...dzeng/BIOS740/Walker_Bios740.pdf · "Convolutional neural networks on graphs with fast localized spectral filtering." Advances

Unstructured DataName Alice Bob

Age 14 65

Gender F M

Smokes? N Y

Gender M F

Smokes? Y N

Age 65 14

Name Bob Alice

Page 5: Generalizing Convolutional Neural Networks to Graph ...dzeng/BIOS740/Walker_Bios740.pdf · "Convolutional neural networks on graphs with fast localized spectral filtering." Advances

Unstructured Data

• The order is irrelevant to processing - there is no prescribed relationship between the variables

Name Alice Bob

Age 14 65

Gender F M

Smokes? N Y

Gender M F

Smokes? Y N

Age 65 14

Name Bob Alice

Page 6: Generalizing Convolutional Neural Networks to Graph ...dzeng/BIOS740/Walker_Bios740.pdf · "Convolutional neural networks on graphs with fast localized spectral filtering." Advances

Unstructured Data

• The order is irrelevant to processing - there is no prescribed relationship between the variables

• Use a fully-connected network to learn the relationships

Name Alice Bob

Age 14 65

Gender F M

Smokes? N Y

Gender M F

Smokes? Y N

Age 65 14

Name Bob Alice

Page 7: Generalizing Convolutional Neural Networks to Graph ...dzeng/BIOS740/Walker_Bios740.pdf · "Convolutional neural networks on graphs with fast localized spectral filtering." Advances

Grid-structured Data

Page 8: Generalizing Convolutional Neural Networks to Graph ...dzeng/BIOS740/Walker_Bios740.pdf · "Convolutional neural networks on graphs with fast localized spectral filtering." Advances

Grid-structured DataA kitten

Page 9: Generalizing Convolutional Neural Networks to Graph ...dzeng/BIOS740/Walker_Bios740.pdf · "Convolutional neural networks on graphs with fast localized spectral filtering." Advances

Grid-structured DataA kitten Google Vision Results

Page 10: Generalizing Convolutional Neural Networks to Graph ...dzeng/BIOS740/Walker_Bios740.pdf · "Convolutional neural networks on graphs with fast localized spectral filtering." Advances

Grid-structured DataA kitten Google Vision Results Same Kitten, Different Order

Page 11: Generalizing Convolutional Neural Networks to Graph ...dzeng/BIOS740/Walker_Bios740.pdf · "Convolutional neural networks on graphs with fast localized spectral filtering." Advances

Grid-structured Data

• Reordered kitten picture is unintelligible

• Use a convolutional neural network to reduce parameters

A kitten Google Vision Results Same Kitten, Different Order

Page 12: Generalizing Convolutional Neural Networks to Graph ...dzeng/BIOS740/Walker_Bios740.pdf · "Convolutional neural networks on graphs with fast localized spectral filtering." Advances

Graph-structured Data

• There is some relationship between data, which is given on an input-specific basis, not known a priori

• What can you use here?

Graph Convolutional Network, (Kipf and Welling 2017)

Page 13: Generalizing Convolutional Neural Networks to Graph ...dzeng/BIOS740/Walker_Bios740.pdf · "Convolutional neural networks on graphs with fast localized spectral filtering." Advances

Defferrard et al 2016

Page 14: Generalizing Convolutional Neural Networks to Graph ...dzeng/BIOS740/Walker_Bios740.pdf · "Convolutional neural networks on graphs with fast localized spectral filtering." Advances

Defferrard et al 2016• Spectral method allows for robust application to the

“neighborhood” of a node.

Page 15: Generalizing Convolutional Neural Networks to Graph ...dzeng/BIOS740/Walker_Bios740.pdf · "Convolutional neural networks on graphs with fast localized spectral filtering." Advances

Defferrard et al 2016• Spectral method allows for robust application to the

“neighborhood” of a node.

L = D �W

Page 16: Generalizing Convolutional Neural Networks to Graph ...dzeng/BIOS740/Walker_Bios740.pdf · "Convolutional neural networks on graphs with fast localized spectral filtering." Advances

Defferrard et al 2016• Spectral method allows for robust application to the

“neighborhood” of a node.

L = D �W y =K�1X

k=0

✓kLkx

Page 17: Generalizing Convolutional Neural Networks to Graph ...dzeng/BIOS740/Walker_Bios740.pdf · "Convolutional neural networks on graphs with fast localized spectral filtering." Advances

Defferrard et al 2016• Spectral method allows for robust application to the

“neighborhood” of a node.

y =K�1X

k=0

✓kTk(L)xL =2

�max

L� In

L = D �W y =K�1X

k=0

✓kLkx

Page 18: Generalizing Convolutional Neural Networks to Graph ...dzeng/BIOS740/Walker_Bios740.pdf · "Convolutional neural networks on graphs with fast localized spectral filtering." Advances

Defferrard et al 2016• Spectral method allows for robust application to the

“neighborhood” of a node.

• This “filtering” that maps x to y is the equivalent of the convolution step in a standard convolutional network - K parameters to learn.

y =K�1X

k=0

✓kTk(L)xL =2

�max

L� In

L = D �W y =K�1X

k=0

✓kLkx

Page 19: Generalizing Convolutional Neural Networks to Graph ...dzeng/BIOS740/Walker_Bios740.pdf · "Convolutional neural networks on graphs with fast localized spectral filtering." Advances

Defferrard et al 2016y =

K�1X

k=0

✓kTk(L)x

Page 20: Generalizing Convolutional Neural Networks to Graph ...dzeng/BIOS740/Walker_Bios740.pdf · "Convolutional neural networks on graphs with fast localized spectral filtering." Advances

Defferrard et al 2016

• Localized - kth term in sum includes contribution up to k hops from the node

y =K�1X

k=0

✓kTk(L)x

Page 21: Generalizing Convolutional Neural Networks to Graph ...dzeng/BIOS740/Walker_Bios740.pdf · "Convolutional neural networks on graphs with fast localized spectral filtering." Advances

Defferrard et al 2016

• Localized - kth term in sum includes contribution up to k hops from the node

• Recursive definition, allowing for efficient computation

y =K�1X

k=0

✓kTk(L)x

Tk+1(L)x = 2LTk(L)x� Tk�1(L)x

Page 22: Generalizing Convolutional Neural Networks to Graph ...dzeng/BIOS740/Walker_Bios740.pdf · "Convolutional neural networks on graphs with fast localized spectral filtering." Advances

Defferrard et al 2016

• Localized - kth term in sum includes contribution up to k hops from the node

• Recursive definition, allowing for efficient computation

• This filter is something we can apply machine learning techniques to

y =K�1X

k=0

✓kTk(L)x

Tk+1(L)x = 2LTk(L)x� Tk�1(L)x

Page 23: Generalizing Convolutional Neural Networks to Graph ...dzeng/BIOS740/Walker_Bios740.pdf · "Convolutional neural networks on graphs with fast localized spectral filtering." Advances

Validation• Chebyshev filter Graph CNN tested on MNIST

• Graph created to represent grid structure

• Comparable performance to classical CNN

• Also validated on 20NEWS text categorization dataset.

Page 24: Generalizing Convolutional Neural Networks to Graph ...dzeng/BIOS740/Walker_Bios740.pdf · "Convolutional neural networks on graphs with fast localized spectral filtering." Advances

Kipf, Welling 2017

Page 25: Generalizing Convolutional Neural Networks to Graph ...dzeng/BIOS740/Walker_Bios740.pdf · "Convolutional neural networks on graphs with fast localized spectral filtering." Advances

Kipf, Welling 2017• Aim to improve the approach from Defferrard

• Linearize the previous filter equation

y = ✓

00x� ✓

01D

� 12AD

� 12x

Page 26: Generalizing Convolutional Neural Networks to Graph ...dzeng/BIOS740/Walker_Bios740.pdf · "Convolutional neural networks on graphs with fast localized spectral filtering." Advances

Kipf, Welling 2017• Aim to improve the approach from Defferrard

• Linearize the previous filter equation

• Simplify and renormalize for improved numerical stability, and generalize to multiple feature maps to get an equation

y = ✓

00x� ✓

01D

� 12AD

� 12x

Z = D� 12 AD� 1

2X⇥

Page 27: Generalizing Convolutional Neural Networks to Graph ...dzeng/BIOS740/Walker_Bios740.pdf · "Convolutional neural networks on graphs with fast localized spectral filtering." Advances

Kipf, Welling 2017• Aim to improve the approach from Defferrard

• Linearize the previous filter equation

• Simplify and renormalize for improved numerical stability, and generalize to multiple feature maps to get an equation

y = ✓

00x� ✓

01D

� 12AD

� 12x

Z = D� 12 AD� 1

2X⇥

Xk+1 = � (MXk⇥k)

Page 28: Generalizing Convolutional Neural Networks to Graph ...dzeng/BIOS740/Walker_Bios740.pdf · "Convolutional neural networks on graphs with fast localized spectral filtering." Advances

Validation• Validation Datasets

• Citeseer, Cora, and Pubmed citation networks

• NELL knowledge graph

Comparison of classification accuracy percentage of different methods. (Kipf Welling 2017)

Page 29: Generalizing Convolutional Neural Networks to Graph ...dzeng/BIOS740/Walker_Bios740.pdf · "Convolutional neural networks on graphs with fast localized spectral filtering." Advances

Discussion

Page 30: Generalizing Convolutional Neural Networks to Graph ...dzeng/BIOS740/Walker_Bios740.pdf · "Convolutional neural networks on graphs with fast localized spectral filtering." Advances

Discussion• Graph-structured data is an interesting new frontier for

machine-learning methods

Page 31: Generalizing Convolutional Neural Networks to Graph ...dzeng/BIOS740/Walker_Bios740.pdf · "Convolutional neural networks on graphs with fast localized spectral filtering." Advances

Discussion• Graph-structured data is an interesting new frontier for

machine-learning methods

• Kipf and Welling GCN is very similar to standard neural network formulations

Xk+1 = � (MXk⇥k)

Page 32: Generalizing Convolutional Neural Networks to Graph ...dzeng/BIOS740/Walker_Bios740.pdf · "Convolutional neural networks on graphs with fast localized spectral filtering." Advances

Discussion• Graph-structured data is an interesting new frontier for

machine-learning methods

• Kipf and Welling GCN is very similar to standard neural network formulations

• By nature of linearization, it is localized at a distance of 1.

Xk+1 = � (MXk⇥k)

Page 33: Generalizing Convolutional Neural Networks to Graph ...dzeng/BIOS740/Walker_Bios740.pdf · "Convolutional neural networks on graphs with fast localized spectral filtering." Advances

References

Defferrard, Michaël, Xavier Bresson, and Pierre Vandergheynst. "Convolutional neural networks on graphs with fast localized spectral filtering." Advances in Neural Information Processing Systems. 2016.

Kipf, Thomas N., and Max Welling. "Semi-supervised classification with graph convolutional networks." arXiv preprint arXiv:1609.02907 (2016).