eib seminar from esteban vegas, ph.d

45
Morten Mørup

Upload: statistics-and-bioinformatics-eib-ub

Post on 09-May-2015

418 views

Category:

Technology


0 download

DESCRIPTION

Applications of tensor (multiway array) factorizations and decomposition in data mining presentation

TRANSCRIPT

Page 1: EiB Seminar from Esteban Vegas, Ph.D

Morten Mørup

Page 2: EiB Seminar from Esteban Vegas, Ph.D

Abstract Tensor (multiway array) factorization and decomposition has become an

important tool for data mining. Fueled by the computational power of modern computer researchers can now

analyze large-scale tensorial structured data that only a few years ago wouldhave been impossible.

Tensor factorizations have several advantages over two-way matrixfactorizations including uniqueness of the optimal solution and componentidentification even when most of the data is missing.

Furthermore, multiway decomposition techniques explicitly exploit themultiway structure that is lost when collapsing some of the modes of the tensorin order to analyze the data by regular matrix factorization approaches.

Multiway decomposition is being applied to new fields every year and there isno doubt that the future will bring many exciting new applications.

The aim of this overview is to introduce the basic concepts of tensordecompositions and demonstrate some of the many benefits and challenges ofmodeling data multiway for a wide variety of data and problem domains.

Page 3: EiB Seminar from Esteban Vegas, Ph.D

INTRODUCTION Tensors, or multiway arrays, are generalizations of

vectors (first-order tensors) and matrices (second-order tensors) to arrays of higher orders (N > 2). Hence, a third-order tensor is an array with elements xi,j,k

Tensor decompositions are in frequent use today in a variety of fields ranging from psychology, chemometrics, signal processing, bioinformatics, neuroscience, web mining, and computer vision to mention but a few

Page 4: EiB Seminar from Esteban Vegas, Ph.D

Examples Psychometrics: which group of subjects behave differently

on which variables under which conditions? Chemistry: low concentrations to be the physical model of

fluorescence spectroscopy admitting unique recovery of the chemical compounds from sampled mixtures.

Neuroimaging: extract the consistent patterns of activation whereas noisy trials/subjects can be downweighted in the averaging process.

Bioinformatics: understanding of cellular states and biological processes.

Signal processing, computer vision, web mining …

Page 5: EiB Seminar from Esteban Vegas, Ph.D

Factorizing tensors advantages over two-way matrix factorization: uniqueness of the optimal solution (without imposing

constraints such as orthogonality and independence) and component identification even when only a

relatively small fraction of all the data is observed (i.e., due to missing values).

Furthermore, multiway decomposition techniques can explicitly take into account the multiway structure of the data that would otherwise be lost when analyzing the data by matrix factorization approaches by collapsing some of the modes.

Page 6: EiB Seminar from Esteban Vegas, Ph.D

Tensor factorization: changelling and opening problems: its geometry is not yet fully understood the occurrence of degenerate solutions. no guarantee of finding the optimal solution most tensor decomposition models impose a very

restricted structure on the data which in turn require that data exhibit a strong degree of regularity.

understanding the data generating process is key for theformulation of adequate tensor decomposition modelsthat can well extract the inherent multimodal structure.

Page 7: EiB Seminar from Esteban Vegas, Ph.D

Tensor Nomenclatura Real tensor of order N (calligraphed letters):

More compactly

Element of the tensor

Page 8: EiB Seminar from Esteban Vegas, Ph.D

Tensor Operations

It is a mistake?

Page 9: EiB Seminar from Esteban Vegas, Ph.D

Matricizing and unmatricizing

Page 10: EiB Seminar from Esteban Vegas, Ph.D

Matricizing operation

Page 11: EiB Seminar from Esteban Vegas, Ph.D

n-mode multiplication

Singular value decomposition (SVD) can be written as

Page 12: EiB Seminar from Esteban Vegas, Ph.D

Outer, Kronecker and Khatri-Raoproduct

• Outer product of three vectors a, b, c:

• Kronecker product

• Khatri-Rao product is defined as a column-wise Kronecker product

Page 13: EiB Seminar from Esteban Vegas, Ph.D

Moore-Penrose inverse

Page 14: EiB Seminar from Esteban Vegas, Ph.D

Summary

Page 15: EiB Seminar from Esteban Vegas, Ph.D

Tucker Model

Page 16: EiB Seminar from Esteban Vegas, Ph.D

Tucker Model

Page 17: EiB Seminar from Esteban Vegas, Ph.D

Tucker ModelThe Tucker model is not unique, an equivalent representation is:

where are invertible matrices

Page 18: EiB Seminar from Esteban Vegas, Ph.D

Tucker ModelTucker3 model: Using the n-mode matricizing and Kronecker product operation.

Page 19: EiB Seminar from Esteban Vegas, Ph.D

Tucker Model: Estimation

Page 20: EiB Seminar from Esteban Vegas, Ph.D

Tucker Model: Estimation

Page 21: EiB Seminar from Esteban Vegas, Ph.D

Tucker Model: Estimation

Page 22: EiB Seminar from Esteban Vegas, Ph.D

CP ModelAlso known as:

Canonical Decomposition (CANDECOMP) Parallel Factor Analysis (PARAFAC)

.

Page 23: EiB Seminar from Esteban Vegas, Ph.D

CP Model Tucker model on core array G is the same:

1. L = M = N = D2. gi,i,i ≠ 0 .

Page 24: EiB Seminar from Esteban Vegas, Ph.D

CP Model

Page 25: EiB Seminar from Esteban Vegas, Ph.D

CP Model: Estimation

Page 26: EiB Seminar from Esteban Vegas, Ph.D

CP Model: Estimation

Page 27: EiB Seminar from Esteban Vegas, Ph.D

CP Model: Estimation

Page 28: EiB Seminar from Esteban Vegas, Ph.D

Missing values Solutions

1. Impute missing data values:

2. Marginalization : missing values ignored during optimization

Page 29: EiB Seminar from Esteban Vegas, Ph.D

Missing values

Page 30: EiB Seminar from Esteban Vegas, Ph.D

Model Order Estimation

Page 31: EiB Seminar from Esteban Vegas, Ph.D

Model Order Estimation

Page 32: EiB Seminar from Esteban Vegas, Ph.D

Model Order Estimation

Page 33: EiB Seminar from Esteban Vegas, Ph.D

Common constraints

Page 34: EiB Seminar from Esteban Vegas, Ph.D

Some Available Tensor Software Matlab:

N-way toolbox, CuBatch, PLS_Toolbox, Tensor toolbox Mathematica and Maple support tensors R language:

PTAk: Spatio-Temporal Multiway Decompositions Using Principal Tensor Analysis on k-Modes

Page 35: EiB Seminar from Esteban Vegas, Ph.D
Page 36: EiB Seminar from Esteban Vegas, Ph.D

Bioinformatics Multiway modeling has recently found use within bioinformatics. Here, the HOSVD has been shown to enable the interpretation of

cellular states and biological processes by defining the significance of each combination of extracted patterns.

Page 37: EiB Seminar from Esteban Vegas, Ph.D

Bioinformatics

it was demonstrated that HOSVD computationally can remove experimental artifacts from the global mRNA expression

Page 38: EiB Seminar from Esteban Vegas, Ph.D

Bioinformatics

Page 39: EiB Seminar from Esteban Vegas, Ph.D

Bioinformatics

The Tucker analysis was here demonstrated to have the advantage of easily interpretable time profiles and extraction of metabolic perturbations with common time profiles only.

Page 40: EiB Seminar from Esteban Vegas, Ph.D

Chemistry

Page 41: EiB Seminar from Esteban Vegas, Ph.D

Chemistry

Page 42: EiB Seminar from Esteban Vegas, Ph.D

Neuroscience

Page 43: EiB Seminar from Esteban Vegas, Ph.D

Neuroscience

Page 44: EiB Seminar from Esteban Vegas, Ph.D

Computer vision

Page 45: EiB Seminar from Esteban Vegas, Ph.D

CONCLUSION