two-way mul+modal online matrix...

36
Two-way Mul+modal Online Matrix Factoriza+on JORGE A. VANEGAS MindLab Research Group, Universidad Nacional de Colombia, BogotÁ, Colombia

Upload: others

Post on 02-Sep-2021

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Two-way Mul+modal Online Matrix Factoriza+onfagonzalezo.github.io/ml/Two-wayMatrixFactorization.pdf · 2017. 11. 2. · Sebastian Ot´alora-Montenegro, Santiago A. P´erez-Rubiano,

Two-way Mul+modal Online Matrix Factoriza+on

JORGE A. VANEGAS

MindLab Research Group, Universidad

Nacional de Colombia, BogotÁ, Colombia

Page 2: Two-way Mul+modal Online Matrix Factoriza+onfagonzalezo.github.io/ml/Two-wayMatrixFactorization.pdf · 2017. 11. 2. · Sebastian Ot´alora-Montenegro, Santiago A. P´erez-Rubiano,

Outline

1.  Introduction 2.  Two-way Multimodal Online Matrix

Factorization for Multi-label Annotation 3.  Semi-supervised Dimensionality

Reduction via Multimodal Matrix Factorization

Page 3: Two-way Mul+modal Online Matrix Factoriza+onfagonzalezo.github.io/ml/Two-wayMatrixFactorization.pdf · 2017. 11. 2. · Sebastian Ot´alora-Montenegro, Santiago A. P´erez-Rubiano,

Synonymy and polysemy

Seman+c gap

Seman+c similarity Visual

similarity

HARD

DRYBRAVE

RIPE

STRONG

SHARPDIFFICUT

RUDE

SOLID

STUBBORN

EXPENSIVE

ROUGH

Images

Text

Page 4: Two-way Mul+modal Online Matrix Factoriza+onfagonzalezo.github.io/ml/Two-wayMatrixFactorization.pdf · 2017. 11. 2. · Sebastian Ot´alora-Montenegro, Santiago A. P´erez-Rubiano,

Eliminatetheredundancyandthenoisepresentinthemanifoldstructureofthe

originalhighdimensionalfeature

representaGon.

 TacklesthecurseofdimensionalitybycompressingtherepresentaGoninamore

expressivereducedsetofvariables.

Dimensionality reduc+on

Page 5: Two-way Mul+modal Online Matrix Factoriza+onfagonzalezo.github.io/ml/Two-wayMatrixFactorization.pdf · 2017. 11. 2. · Sebastian Ot´alora-Montenegro, Santiago A. P´erez-Rubiano,

•  Latent Seman+c Analysis (LSA)

[Dumai et al. 2004]

•  Nonnega+ve Matrix Factoriza+on (NMF)

[Lee et al. 1999]

SemanGcrepresentaGonviamatrix

factorizaGon

Page 6: Two-way Mul+modal Online Matrix Factoriza+onfagonzalezo.github.io/ml/Two-wayMatrixFactorization.pdf · 2017. 11. 2. · Sebastian Ot´alora-Montenegro, Santiago A. P´erez-Rubiano,

ProblemStrategy

Conclusions

Two-way Multimodal Online Matrix Factorizationfor Multi-label Annotation

(ICPRAM)

Jorge A. Vanegas

MindLAB Research Group - Universidad Nacional de Colombia

November 24, 2015

Jorge A. Vanegas Two-way Multimodal Online Matrix Factorization for Multi-label Annotation

Page 7: Two-way Mul+modal Online Matrix Factoriza+onfagonzalezo.github.io/ml/Two-wayMatrixFactorization.pdf · 2017. 11. 2. · Sebastian Ot´alora-Montenegro, Santiago A. P´erez-Rubiano,

ProblemStrategy

Conclusions

Multi-label Annotation

The multi-label annotation problem arises in situations such asobject recognition in images where we want to automatically findthe objects present in a given image.The solution consists in learning a classification model able toassign one or many labels to a particular sample.

Jorge A. Vanegas Two-way Multimodal Online Matrix Factorization for Multi-label Annotation

Page 8: Two-way Mul+modal Online Matrix Factoriza+onfagonzalezo.github.io/ml/Two-wayMatrixFactorization.pdf · 2017. 11. 2. · Sebastian Ot´alora-Montenegro, Santiago A. P´erez-Rubiano,

ProblemStrategy

Conclusions

Two-way Multimodal Matrix Factorization

f : Rn ! Rr ,

g : Rr ! Rn

where n � r

Reconstruction of the original feature representation:

Xv t W

0vWvXv (1)

Reconstruction of the original label representation:

Xt t W

0tWtXt (2)

Jorge A. Vanegas Two-way Multimodal Online Matrix Factorization for Multi-label Annotation

Page 9: Two-way Mul+modal Online Matrix Factoriza+onfagonzalezo.github.io/ml/Two-wayMatrixFactorization.pdf · 2017. 11. 2. · Sebastian Ot´alora-Montenegro, Santiago A. P´erez-Rubiano,

ProblemStrategy

Conclusions

Two-way Multimodal Matrix Factorization

Xt = W

0tWvXv

Jorge A. Vanegas Two-way Multimodal Online Matrix Factorization for Multi-label Annotation

Page 10: Two-way Mul+modal Online Matrix Factoriza+onfagonzalezo.github.io/ml/Two-wayMatrixFactorization.pdf · 2017. 11. 2. · Sebastian Ot´alora-Montenegro, Santiago A. P´erez-Rubiano,

ProblemStrategy

Conclusions

Optimization problem

L

⇣Xv ,Xt ,Wv ,W

0

v ,Wt ,W0

t

⌘=

↵���Xv �W

0

vWvXv

���2

F+

+(1� ↵)���Xt �W

0

t WtXt

���2

F

+����Xt �W

0

t WvXv

���2

F

+�

✓kWvk2F +

���W0

v

���2

F+ kWtk2F +

���W0

t

���2

F

◆(3)

↵ controls the relative importance between the reconstruction of the instance

representation and the label representation.

� controls the relative importance of the mapping between instance features

and label information

� controls the relative importance of the regularization terms, which penalizes

large values for the Frobenius norm of the transformation matrices.

Jorge A. Vanegas Two-way Multimodal Online Matrix Factorization for Multi-label Annotation

Page 11: Two-way Mul+modal Online Matrix Factoriza+onfagonzalezo.github.io/ml/Two-wayMatrixFactorization.pdf · 2017. 11. 2. · Sebastian Ot´alora-Montenegro, Santiago A. P´erez-Rubiano,

ProblemStrategy

Conclusions

Online learning

Jorge A. Vanegas Two-way Multimodal Online Matrix Factorization for Multi-label Annotation

Page 12: Two-way Mul+modal Online Matrix Factoriza+onfagonzalezo.github.io/ml/Two-wayMatrixFactorization.pdf · 2017. 11. 2. · Sebastian Ot´alora-Montenegro, Santiago A. P´erez-Rubiano,

ProblemStrategy

Conclusions

Online learning

Jorge A. Vanegas Two-way Multimodal Online Matrix Factorization for Multi-label Annotation

Page 13: Two-way Mul+modal Online Matrix Factoriza+onfagonzalezo.github.io/ml/Two-wayMatrixFactorization.pdf · 2017. 11. 2. · Sebastian Ot´alora-Montenegro, Santiago A. P´erez-Rubiano,

ProblemStrategy

Conclusions

Two-way multimodal online matrix factorization algorithm

input r :latent space size, �0: initial step size, epochs: number of epochs,

Xv 2 Rn⇥l, Xt 2 Rm⇥l

, ↵, �, �Random initialization of transformation matrices:

W

0(0)v = random matrix (r , n)

W

(0)v = random matrix (n, r)

W

0(0)t = random matrix (r ,m)

W

(0)t = random matrix (m, r)

for i = 1 to epochs dofor j = 1 to l do

⌧ = i ⇥ j

x

(⌧)v , x (⌧)

t sample without replacement(Xv ,Xt)

Compute gradients:

g

(⌧)

W 0v= rW 0

vL

⇣x

(⌧)v , x (⌧)

t ,W (⌧)v ,W

0(⌧)v ,W (⌧)

t ,W0(⌧)t

g

(⌧)Wv

= rWv L

⇣x

(⌧)v , x (⌧)

t ,W (⌧)v ,W

0(⌧)v ,W (⌧)

t ,W0(⌧)t

g

(⌧)

W0t= rW

0tL

⇣x

(⌧)v , x (⌧)

t ,W (⌧)v ,W

0(⌧)v ,W (⌧)

t ,W0(⌧)t

g

(⌧)Wt

= rWtL

⇣x

(⌧)v , x (⌧)

t ,W (⌧)v ,W

0(⌧)v ,W (⌧)

t ,W0(⌧)t

...Jorge A. Vanegas Two-way Multimodal Online Matrix Factorization for Multi-label Annotation

Page 14: Two-way Mul+modal Online Matrix Factoriza+onfagonzalezo.github.io/ml/Two-wayMatrixFactorization.pdf · 2017. 11. 2. · Sebastian Ot´alora-Montenegro, Santiago A. P´erez-Rubiano,

ProblemStrategy

Conclusions

Two-way multimodal online matrix factorization algorithm

...Update term calculation using momentum:

4W0(⌧)

v = ��(⌧)g (⌧)

W 0v+ p4W

0(⌧�1)v

4W (⌧)v = ��(⌧)g (⌧)

Wv+ p4W (⌧�1)

v

4W0(⌧)t = ��(⌧)g (⌧)

W0t

+ p4W0(⌧�1)t

4W (⌧)t = ��(⌧)g (⌧)

Wt+ p4W (⌧�1)

t

Update transformation matrices:

W

0(⌧+1)v = W

0(⌧)v +4W

0(⌧)v

W

(⌧+1)v = W

(⌧)v +4W

(⌧)v

W

0(⌧+1)t = W

0(⌧)t +�W

0(⌧)t

W

(⌧+1)t = W

(⌧)t +�W

(⌧)t

end forend forreturn W

0(N)v ,W (N)

v ,W0(N)t ,W (N)

t

Jorge A. Vanegas Two-way Multimodal Online Matrix Factorization for Multi-label Annotation

Page 15: Two-way Mul+modal Online Matrix Factoriza+onfagonzalezo.github.io/ml/Two-wayMatrixFactorization.pdf · 2017. 11. 2. · Sebastian Ot´alora-Montenegro, Santiago A. P´erez-Rubiano,

ProblemStrategy

Conclusions

Prediction

xt = W

0tWvxv

Jorge A. Vanegas Two-way Multimodal Online Matrix Factorization for Multi-label Annotation

Page 16: Two-way Mul+modal Online Matrix Factoriza+onfagonzalezo.github.io/ml/Two-wayMatrixFactorization.pdf · 2017. 11. 2. · Sebastian Ot´alora-Montenegro, Santiago A. P´erez-Rubiano,

ProblemStrategy

Conclusions

Annotation

The transformation of the input features generates anm�dimensional vector with an smoothed label representation

The final decision to assign a label would be taken by defininga thresholdassign 1 to the j � th label if ˜xt,j = threshold , or we can assign1 to the top�k labels with the highest values in the vector.

Jorge A. Vanegas Two-way Multimodal Online Matrix Factorization for Multi-label Annotation

Page 17: Two-way Mul+modal Online Matrix Factoriza+onfagonzalezo.github.io/ml/Two-wayMatrixFactorization.pdf · 2017. 11. 2. · Sebastian Ot´alora-Montenegro, Santiago A. P´erez-Rubiano,

ProblemStrategy

Conclusions

Implementation details

Pylearn2 library 1

2

1http://deeplearning.net/software/pylearn2/

2http://deeplearning.net/software/theano/

Jorge A. Vanegas Two-way Multimodal Online Matrix Factorization for Multi-label Annotation

Page 18: Two-way Mul+modal Online Matrix Factoriza+onfagonzalezo.github.io/ml/Two-wayMatrixFactorization.pdf · 2017. 11. 2. · Sebastian Ot´alora-Montenegro, Santiago A. P´erez-Rubiano,

ProblemStrategy

Conclusions

Experiments and Results

80% of the images for training

the remaining 20% for test

Were compared against 8 MLLSE3 algorithms:

OVA: One-versus-allCCA: Canonical Correlation AnalysisCS: Compressed SensingPLST: Principal Label Space TransformMME: Multilabel max-margin embeddingANMF, MNMF, OMMF

3multi-label latent space embedding

Jorge A. Vanegas Two-way Multimodal Online Matrix Factorization for Multi-label Annotation

Page 19: Two-way Mul+modal Online Matrix Factoriza+onfagonzalezo.github.io/ml/Two-wayMatrixFactorization.pdf · 2017. 11. 2. · Sebastian Ot´alora-Montenegro, Santiago A. P´erez-Rubiano,

ProblemStrategy

Conclusions

Datasets

Dataset Corel5k Bibtex MediaMillLabels 374 159 101

Features 500 1,836 120Label cardinality 3,522 2,402 4.376

Examples 5,000 7,395 43,907

The method was evaluated on 3 standard multilabel datasetsdistributed by the mulan framework authors(http://mulan.sourceforge.net/datasets.html)

Jorge A. Vanegas Two-way Multimodal Online Matrix Factorization for Multi-label Annotation

Page 20: Two-way Mul+modal Online Matrix Factoriza+onfagonzalezo.github.io/ml/Two-wayMatrixFactorization.pdf · 2017. 11. 2. · Sebastian Ot´alora-Montenegro, Santiago A. P´erez-Rubiano,

ProblemStrategy

Conclusions

F-Measure for each method

Performance of each method in terms of f-measure

Method Corel5k Bibtex MediaMill

OVA 0.112 0.372CCA 0.150 0.404CS 0.086 (50) 0.332 (50)

PLST 0.074 (50) 0.283 (50)MME 0.178 (50) 0.403 (50) 0.199 (350)ANMF 0.210 (30) 0.297 (140) 0.496 (350)MNMF 0.240 (35) 0.376 (140) 0.510 (350)OMMF 0.263 (40) 0.436 (140) 0.503 (350)

Our Method 0.283 (100) 0.422 (300) 0.540 (300)

Sebastian Otalora-Montenegro et al. [1]Sunho Park et al. [2]

Jorge A. Vanegas Two-way Multimodal Online Matrix Factorization for Multi-label Annotation

Page 21: Two-way Mul+modal Online Matrix Factoriza+onfagonzalezo.github.io/ml/Two-wayMatrixFactorization.pdf · 2017. 11. 2. · Sebastian Ot´alora-Montenegro, Santiago A. P´erez-Rubiano,

ProblemStrategy

Conclusions

Conclusions and Future Work

We presented a novel multi-label annotation method whichlearns a mapping between the original sample representationand labels by finding a common semantic representation.

We propose a model that finds a mapping from the samplerepresentation space to a semantic space, and simultaneouslyfinds a back-projection from the semantic space to the originalspace.

This method is formulated as an online learning algorithmallowing to deal with large collections.

One important limitation is that the method assumes lineardependencies between the data modalities

Jorge A. Vanegas Two-way Multimodal Online Matrix Factorization for Multi-label Annotation

Page 22: Two-way Mul+modal Online Matrix Factoriza+onfagonzalezo.github.io/ml/Two-wayMatrixFactorization.pdf · 2017. 11. 2. · Sebastian Ot´alora-Montenegro, Santiago A. P´erez-Rubiano,

ProblemStrategy

Conclusions

References

Sebastian Otalora-Montenegro, Santiago A. Perez-Rubiano,and Fabio A. Gonzalez.Online matrix factorization for space embedding multilabelannotation.In CIARP (1), volume 8258 of Lecture Notes in Computer

Science, pages 343–350. Springer, 2013.

Sunho Park and Seungjin Choi.Max-margin embedding for multi-label learning.Pattern Recognition Letters, 34(3):292 – 298, 2013.

Jorge A. Vanegas Two-way Multimodal Online Matrix Factorization for Multi-label Annotation

Page 23: Two-way Mul+modal Online Matrix Factoriza+onfagonzalezo.github.io/ml/Two-wayMatrixFactorization.pdf · 2017. 11. 2. · Sebastian Ot´alora-Montenegro, Santiago A. P´erez-Rubiano,

Semi-supervised Dimensionality Reduc+on via Mul+modal Matrix

Factoriza+on

VIVIANA BELTRÁN, JORGE A. VANEGAS, FABIO A. GONZÁLEZ

MindLab Research Group, Universidad

Nacional de Colombia, BogotÁ , Colombia

Page 24: Two-way Mul+modal Online Matrix Factoriza+onfagonzalezo.github.io/ml/Two-wayMatrixFactorization.pdf · 2017. 11. 2. · Sebastian Ot´alora-Montenegro, Santiago A. P´erez-Rubiano,

Challenges ProposedSolu7on

Highdimensionaldata Lowdimensionalembedding

representaGon

Reducednumberoflabeled

data Semi–supervisedlearningvia

matrixfactorizaGon

Hugeamountofunstructured

data:

•  Massiveunlabeledexamples

available

•  StochasGcgradientdescent

•  GPUimplementaGon

Mo+va+on

Page 25: Two-way Mul+modal Online Matrix Factoriza+onfagonzalezo.github.io/ml/Two-wayMatrixFactorization.pdf · 2017. 11. 2. · Sebastian Ot´alora-Montenegro, Santiago A. P´erez-Rubiano,

Semi-supervised learning

Manifoldlearningforclassifica7on

Page 26: Two-way Mul+modal Online Matrix Factoriza+onfagonzalezo.github.io/ml/Two-wayMatrixFactorization.pdf · 2017. 11. 2. · Sebastian Ot´alora-Montenegro, Santiago A. P´erez-Rubiano,

Semi-supervised learning

AnnotatedinstancesareusedtomaximizethediscriminaGonbetweenclasses,

butalso,non-annotatedinstancescanbeexploitedtoesGmatetheintrinsic

manifoldstructureofthedata.

Page 27: Two-way Mul+modal Online Matrix Factoriza+onfagonzalezo.github.io/ml/Two-wayMatrixFactorization.pdf · 2017. 11. 2. · Sebastian Ot´alora-Montenegro, Santiago A. P´erez-Rubiano,

n>>r

Model

Page 28: Two-way Mul+modal Online Matrix Factoriza+onfagonzalezo.github.io/ml/Two-wayMatrixFactorization.pdf · 2017. 11. 2. · Sebastian Ot´alora-Montenegro, Santiago A. P´erez-Rubiano,

Model

Xifeaturevectorofthei-thinstanceinthedatacollecGonXtibinarylabelvectorofthei-thinstancekinstancesfortrainingllabeledinstances

k>>l

Page 29: Two-way Mul+modal Online Matrix Factoriza+onfagonzalezo.github.io/ml/Two-wayMatrixFactorization.pdf · 2017. 11. 2. · Sebastian Ot´alora-Montenegro, Santiago A. P´erez-Rubiano,

Experimental Setup: ●  Parameter exploration by using 5-fold cross

validation. ●  Average classification accuracies for 10

runs evaluated by using 1-KNN setup similar as in [Zhao et al. 2014].

●  Linear supervised, semi-supervised and unsupervised dimensionality reduction methods as baselines.

Experiments

Page 30: Two-way Mul+modal Online Matrix Factoriza+onfagonzalezo.github.io/ml/Two-wayMatrixFactorization.pdf · 2017. 11. 2. · Sebastian Ot´alora-Montenegro, Santiago A. P´erez-Rubiano,

Datasets

Page 31: Two-way Mul+modal Online Matrix Factoriza+onfagonzalezo.github.io/ml/Two-wayMatrixFactorization.pdf · 2017. 11. 2. · Sebastian Ot´alora-Montenegro, Santiago A. P´erez-Rubiano,

Datasets par++ons

Datasets

Page 32: Two-way Mul+modal Online Matrix Factoriza+onfagonzalezo.github.io/ml/Two-wayMatrixFactorization.pdf · 2017. 11. 2. · Sebastian Ot´alora-Montenegro, Santiago A. P´erez-Rubiano,

ClassificaGonaccuracyfordifferentpercentagesofannotated

instances

Page 33: Two-way Mul+modal Online Matrix Factoriza+onfagonzalezo.github.io/ml/Two-wayMatrixFactorization.pdf · 2017. 11. 2. · Sebastian Ot´alora-Montenegro, Santiago A. P´erez-Rubiano,

Avg. Classification Accuracy

Labeledsamples(inthousands)

Trainingsamples(inthousands)

Accuracy

Covtype Mnist

Foralltrainingsizesonly30%ofinstancesareannotated

Page 34: Two-way Mul+modal Online Matrix Factoriza+onfagonzalezo.github.io/ml/Two-wayMatrixFactorization.pdf · 2017. 11. 2. · Sebastian Ot´alora-Montenegro, Santiago A. P´erez-Rubiano,

Avg. Training time

Labeledsamples(inthousands)

Training7meinseconds

Trainingsamples(inthousands)

Covtype Mnist

Foralltrainingsizesonly30%ofinstancesareannotated

Page 35: Two-way Mul+modal Online Matrix Factoriza+onfagonzalezo.github.io/ml/Two-wayMatrixFactorization.pdf · 2017. 11. 2. · Sebastian Ot´alora-Montenegro, Santiago A. P´erez-Rubiano,

Conclusions

We presented a method whose main

characterisGcsare:

●  ModelingasemanGclow-spacerepresentaGon

●  Preservingtheseparabilityoftheoriginalclasses●  Ability to exploit unlabeled instances for

modelingthemanifoldstructureofthedata

●  OnlineformulaGon

Page 36: Two-way Mul+modal Online Matrix Factoriza+onfagonzalezo.github.io/ml/Two-wayMatrixFactorization.pdf · 2017. 11. 2. · Sebastian Ot´alora-Montenegro, Santiago A. P´erez-Rubiano,

References

[Zhaoetal.2014]MingboZhao,ZhaoZhang,TommyWS

Chow, and Bing Li. A general so[ label based linear

discriminant analysis for semi-supervised dimensionality

reducGon.NeuralNetworks,55:83–97,2014.