the statistical learning theory in practical problems · the nature of the statistical learning,...

DIG Seminar. September 6th, 2018

The Statistical Learning Theory in Practical Problems

Rodrigo Fernandes de Mello

Universidade de São Paulo

Instituto de Ciências Matemáticas e de Computação

Departamento de Ciências de Computação

http://www.icmc.usp.br/~mello

mello@icmc.usp.br

Theoretical Aspects

● Interested in introducing my research interests

Theoretical Aspects

● Interested in introducing my research interests● What a better way than the Statistical Learning Theory?

Statistical Learning Theory

DWNNCNN

TensorFlow

● So many classification algorithms:● How can we assess any of those?

DWNNCNN

TensorFlow

● So many classification algorithms:● How can we assess any of those?

– K-fold cross validation, leave-one-out, ...● How can we prove any of those learn?

● Vapnik proposed the Statistical Learning Theory● Defined in the context of supervised learning● Learning guarantees and conditions

● What is the main call?

Sample error Expected error

This is the concept of Generalization

● So what is generalization?● Consider someone has given you a textbook

Statistical Learning Theory: Bias-Variance Dilemma

● An example based on regression:

Figure from Luxburg and Scholkopf, Statistical Learning Theory: Models, Concepts, and Results

Which function is the best?

To answer it we need to assess their Expected Risks

Statistical Learning Theory: Bias-Variance Dilemma

● The dichotomy associated to the Bias-Variance Dilemma

Distance-Weighted Nearest Neighbors

● Based on the same principles as the k-Nearest Neighbors

Attribute 1

Attribute 2

New instance

Attribute 1

Attribute 2

New instance

Setting K=3

Setting K=5Setting K=6

● It is based on Radial functions centered at the new instance a.k.a. query point

● Classification output:

● Given the weight function:

● After implementing, test it on this simple example of an identity function:

Two main questions:

- What happpens if sigma is too big?

- What happens if sigma is too small?

So, how can we define the best value for sigma?

● When sigma tends to infinity

The space of admissible functions (bias) will contain a single function

In this case, the average function

● When sigma tends to infinity

● When sigma tends to zero

The space of admissible functions (bias) will tend to the whole space

What is the problem with that?

It will most probably contain at least one memory-based classifier

● When sigma tends to zero

Theoretical Aspects

● Putting in simple terms:

Theoretical Aspects

● Putting in simple terms:● Interested in proving learning in different scenarios

Theoretical Aspects

● Putting in simple terms:● Interested in proving learning in different scenarios● Building up a theoretical framework to prove time series

modeling– Idea: use it to ensure concept drift detection

Theoretical Aspects

● Putting in simple terms:● Interested in proving learning in different scenarios● Building up a theoretical framework to prove time series

modeling– Idea: use it to ensure concept drift detection

● Theoretical Framework to design Deep Learning architectures

Living in the Past...

● Some practical results:● Bionic hand

– Hardware under development at the northeast of Brazil– Mainly to support people living in extreme poverty

● Time Series Visualization tool– www.tsviz.com.br

● Cover song identification– Supported the ECAD system, Brazil

● Copyright office of songs in Brazil● Go Digital, Brazil – incorporated by Acxiom, USA

– Machine Learning for geopositioning systems– Marketing based on ML

References

● Vapnik, V. The Nature of the Statistical Learning, Springer, 2011

● Luxburg and Scholkopf, Statistical Learning Theory: Models, Concepts, and Results. Handbook of the History of Logic. Volume 10: Inductive Logic. Volume Editors: Dov M. Gabbay, Stephan Hartmann and John Woods, Elsevier, 2009

● Schölkopf, B., Smola, A. J., Learning With Kernels: Support Vector Machines, Regularization, Optimization, and Beyond, MIT, 2002

References

the statistical learning theory in practical problems · the nature of the statistical learning,...

Documents

statistical learning intro

an introduction to statistical learning learning...with...

cs b351 statistical learning

statistical machine learning, part i statistical learning...

learning joint statistical models for audio-visual fusion...

statistical learning: chapter 4 - dalhousie...

statistical script learning with recurrent neural...

advanced statistical learning theory

svm support vectors machines based on statistical learning...

module 2: statistical learning - ntnu · module 2:...

statistical learning with sparsity

20 statistical learning methods

statistical relational learning: a tutorial learning -...

learning to reconstruct: statistical learning theory and...

statistical language learning - charniak

semi-supervised learning under class distribution...

machine learning statistical learning theory...2019/06/02...

understanding the neural bases of implicit and statistical...

sb2b statistical machine learning hilary term...

books edited...